Lead Data Engineer

IND - Maharashtra - Pune (Wework)

MSD

At MSD, we're following the science to tackle some of the world's greatest health threats. Get a glimpse of how we work to improve lives.

View all jobs at MSD

Apply now Apply later

Job Description

Lead Data Engineer

At our Company we are leveraging analytics and technology, as we invent for life on behalf of patients around the world. We are seeking those who have a passion for using data, analytics, and insights to drive decision making, that will allow us to tackle some of the world’s greatest health threats.

Within our commercial Insights, Analytics, and Data organization we are transforming to better power decision-making across our end-to-end commercialization process, from business development to late lifecycle management. As we endeavor, we are seeking a dynamic talent for the role of Data Engineer


For the Data Engineer role, we are looking for professional with experience in designing, developing, and maintaining data pipelines. We intend to make data reliable, governed, secure and available for analytics within the organization. As part of a team this role will be responsible for data management with a broad range of activities like data ingestion to cloud data lakes and warehouses, quality control, metadata management and orchestration of machine learning models. We are also forward looking and plan to bring innovations like data mesh and data fabric into our ecosystem of tools and processes
 

Primary Responsibilities:

  • Play a key role in the success and growth of the Data Engineering team by mentoring and playing a leadership role within the team
  • Drive innovation within Data Engineering by playing a lead role in technology decisions for the future of our data science, analysis, and reporting needs
  • Work with business partners and software engineers to gather, understand, and bridge definitions and requirements
  • Lead the design and development for highly complex and critical data projects with strict timelines        
  • Improvements to team efficiency and effectiveness through implementation of data tools (self-service, data quality, etc.)
  • Design, develop and maintain data pipelines to extract data from a variety of sources and populate data lake and data warehouse
  • Develop the various data transformation rules and data modeling capabilities
  • Collaborate with Data Analyst, Data Scientists, Machine Learning Engineers to identify and transform data for ingestion, exploration, and modeling
  • Work with data governance team and implement data quality checks and maintain data catalogs
  • Use Orchestration, logging, and monitoring tools to build resilient pipelines
  • Use test driven development methodology when building ELT/ETL pipelines
  • Understand and apply concepts like data lake, data warehouse, lake-house, data mesh and data-fabric where relevant
  • Develop data models for cloud data warehouses like Redshift and Snowflake
  • Develop pipelines to ingest data into cloud data warehouses
  • Understand and be able to use different databases like Relational, Document, Graph and Key/Value
  • Analyze data using SQL
  • Use serverless AWS services like Glue, Lambda, Step Functions
  • Use Terraform Code to deploy on AWS
  • Containerize Python code using Docker
  • Use Git for version control and understand various branching strategies
  • Build pipelines to work with large datasets using PySpark
  • Develop proof of concepts using Jupyter Notebooks
  • Work as part of an agile team
  • Create technical documentation as needed

Education:

  • Bachelor’s Degree or equivalent experience in a relevant field such as Mathematics, Computer Science, Engineering, Artificial Intelligence, etc.
     

Required Experience and Skills:

  • 9+ years of total experience
  • Good experience with AWS services like S3, ECS, Fargate, Glue, StepFunctions, CloudWatch, Lambda, EMR
  • SQL
  • Proficient in Python, PySpark
  • Good with Git, Docker, Terraform
  • Ability to work in cross functional teams

Preferred Experience and Skills

  • Any AWS developer or architect certification
  • Agile development methodology


Our Human Health Division maintains a “patient first, profits later” ideology. The organization is comprised of sales, marketing, market access, digital analytics and commercial professionals who are passionate about their role in bringing our medicines to our customers worldwide.  

Current Employees apply HERE

Current Contingent Workers apply HERE

Search Firm Representatives Please Read Carefully 
Merck & Co., Inc., Rahway, NJ, USA, also known as Merck Sharp & Dohme LLC, Rahway, NJ, USA, does not accept unsolicited assistance from search firms for employment opportunities. All CVs / resumes submitted by search firms to any employee at our company without a valid written search agreement in place for this position will be deemed the sole property of our company.  No fee will be paid in the event a candidate is hired by our company as a result of an agency referral where no pre-existing agreement is in place. Where agency agreements are in place, introductions are position specific. Please, no phone calls or emails. 

Employee Status:

Regular

Relocation:

VISA Sponsorship:

Travel Requirements:

Flexible Work Arrangements:

Hybrid

Shift:

Valid Driving License:

Hazardous Material(s):


Required Skills:

Business Intelligence (BI), Database Administration, Data Engineering, Data Management, Data Modeling, Data Visualization, Information Management, Information Technology (IT) Infrastructure, Network Infrastructures, Software Development


Preferred Skills:

Architecture Development, Architecture Development, Artificial Intelligence (AI), Business Case Development, Business Development, Business Model Development, Business Process Development, Business Reporting, Business Transformation, Creativity (Inactive), Data Lake, Data Quality Assessments, Data Reporting, Data Transformation, Data Warehouse Development, Data Warehouse Software, Decision Making, Development Design, Digital Analytics, Documentations, Innovation, Leadership, Management Development, Marketing, Metadata Management {+ 3 more}

Job Posting End Date:

09/28/2024

*A job posting is effective until 11:59:59PM on the day BEFORE the listed job posting end date. Please ensure you apply to a job posting no later than the day BEFORE the job posting end date.

Apply now Apply later
  • Share this job via
  • 𝕏
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0

Tags: Agile Architecture AWS Business Intelligence Computer Science Data governance Data management Data pipelines Data quality Data visualization Data warehouse Docker ECS ELT Engineering ETL Git Jupyter Lambda Machine Learning Mathematics ML models Pipelines PySpark Python Redshift Snowflake SQL Step Functions Terraform

Perks/benefits: Career development Relocation support Team events

Region: Asia/Pacific
Country: India

More jobs like this