Data Engineer II
Bengaluru, Karnataka, India
Applications have closed
MediBuddy
MediBuddy is one of the best (cashless) healthcare providers in India. At MediBuddy you can book Health check packages, online lab tests, online medicines, online doctor consultation, teleconsultation, dental consultation and many more. You can...Location: Bengaluru,Karnataka,India
Data Engineer
Data @ MediBuddy?
Data function at MediBuddy is designed to Empower all the users to make key decisions using data. We believe in democratizing data so that people can independently explore the data to make informed decisions and derive insights to enhance customer experience at MediBuddy.
MediBuddy is a matrixed organization which is driven by business and executed by pods and supported by various functions. Each pod will be working on a specific problem statement which is aligned with the business objective of the specific business unit. The pods are staffed with people from different functional areas. A pod is fundamentally driven by business and executes problems independently of others.
Tech Stacks:
BI Tool - Superset
Databases - Druid, Trino, Redshift, Postgres, MySql, MSSQL
Tools - OpenSearch, Spark, Custom ETL pipeline
As a Data Engineer, you will play a pivotal role in developing innovative data-driven solutions @MediBuddy.
What will you do at Medibuddy ?
Develop, maintain, and run the data platform responsible for ETL (Extract, Transform, Load), dataset management, and data catalog.
Maintenance of versioned datasets to enable faster data analytics for product facing features
Ensuring quality of data, analytics pipeline reliability, and data stack efficiency.
Drive continuous adoption and integration of relevant and latest technologies into the data platform
Work and collaborate with cross functional team to deliver the required data sets
What makes you a match for us?
At least 3+ years of experience as a Data Engineer dealing with large complex data workflows and real-time data pipelines.
Hands-on experience with Python, SQL, data warehouse design, implementation and maintenance.
Demonstrated experience in data modeling & ETL development.
Data Warehousing experience with databases like Redshift, etc.
Ability to understand basic query profiles and execution plans. Experience in query performance tuning is a plus.
Coding proficiency in at least one modern programming language (Python, Scala, etc)
Experience with Big Data Technologies (Presto, Hadoop, Hive, Spark, Airflow, etc.)
Experience in large-scale data warehousing projects using Redshift, S3, etc.
Good to have
Experience with AWS Glue, Airflow, EMR, CDC.
Experience with data modeling, data warehousing, data lake supporting analytics for BI
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow AWS AWS Glue Big Data CX Data Analytics Data pipelines Data warehouse Data Warehousing ETL Hadoop MS SQL MySQL OpenSearch Pipelines PostgreSQL Python Redshift Scala Spark SQL Superset
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.