Data Engineer
IND - Maharashtra - Pune (Wework)
MSD
At MSD, we're following the science to tackle some of the world's greatest health threats. Get a glimpse of how we work to improve lives.Job Description
Data Engineer
At our company we are leveraging analytics and technology, as we invent for life on behalf of patients around the world. We are seeking those who have a passion for using data, analytics, and insights to drive decision making, that will allow us to tackle some of the world’s greatest health threats.
Within our commercial Insights, Analytics, and Data organization we are transforming to better power decision-making across our end-to-end commercialization process, from business development to late lifecycle management. As we endeavor, we are seeking a dynamic talent for the role of Data Engineer
For the Data Engineer role, we are looking for professional with experience in designing, developing, and maintaining data pipelines. We intend to make data reliable, governed, secure and available for analytics within the organization. As part of a team this role will be responsible for data management with a broad range of activities like data ingestion to cloud data lakes and warehouses, quality control, metadata management and orchestration of machine learning models. We are also forward looking and plan to bring innovations like data mesh and data fabric into our ecosystem of tools and processes.
Primary Responsibilities:
· Design, develop and maintain data pipelines to extract data from a variety of sources and populate data lake and data warehouse
· Develop the various data transformation rules and data modeling capabilities
· Collaborate with Data Analyst, Data Scientists, Machine Learning Engineers to identify and transform data for ingestion, exploration, and modeling
· Work with data governance team and implement data quality checks and maintain data catalogs
· Use Orchestration, logging, and monitoring tools to build resilient pipelines
· Use test driven development methodology when building ELT/ETL pipelines
· Develop pipelines to ingest data into cloud data warehouses
· Analyze data using SQL
· Use serverless AWS services like Glue, Lambda, StepFunctions
· Use Terraform Code to deploy on AWS
· Containerize Python code using Docker
· Use Git for version control and understand various branching strategies
· Build pipelines to work with large datasets using PySpark
· Develop proof of concepts using Jupyter Notebooks
· Work as part of an agile team
· Create technical documentation as needed
Education:
· Bachelor’s Degree or equivalent experience in a relevant field such as Engineering (preferably computer engg.), Computer Science
Required Experience and Skills:
· 4-8 years of relevant experience
· Good experience with AWS services like S3, ECS, Fargate, Glue,
StepFunctions, CloudWatch, Lambda, EMR
· SQL
· Proficient in Python, PySpark
· Good with Git, Docker, Terraform
· Ability to work in cross functional teams
Preferred Experience and Skills
· Any AWS developer or architect certification
· Agile development methodology
Our Human Health Division maintains a “patient first, profits later” ideology. The organization is comprised of sales, marketing, market access, digital analytics and commercial professionals who are passionate about their role in bringing our medicines to our customers worldwide.
Who we are …
We are known as Merck & Co., Inc., Rahway, New Jersey, USA in the United States and Canada and MSD everywhere else. For more than a century, we have been inventing for life, bringing forward medicines and vaccines for many of the world's most challenging diseases. Today, our company continues to be at the forefront of research to deliver innovative health solutions and advance the prevention and treatment of diseases that threaten people and animals around the world.
What we look for …
Imagine getting up in the morning for a job as important as helping to save and improve lives around the world. Here, you have that opportunity. You can put your empathy, creativity, digital mastery, or scientific genius to work in collaboration with a diverse group of colleagues who pursue and bring hope to countless people who are battling some of the most challenging diseases of our time. Our team is constantly evolving, so if you are among the intellectually curious, join us—and start making your impact today.
We are proud to be a company that embraces the value of bringing diverse, talented, and committed people together. The fastest way to breakthrough innovation is when diverse ideas come together in an inclusive environment. We encourage our colleagues to respectfully challenge one another’s thinking and approach problems collectively. We are an equal opportunity employer, committed to fostering an inclusive and diverse workplace.
Current Employees apply HERE
Current Contingent Workers apply HERE
Search Firm Representatives Please Read Carefully
Merck & Co., Inc., Rahway, NJ, USA, also known as Merck Sharp & Dohme LLC, Rahway, NJ, USA, does not accept unsolicited assistance from search firms for employment opportunities. All CVs / resumes submitted by search firms to any employee at our company without a valid written search agreement in place for this position will be deemed the sole property of our company. No fee will be paid in the event a candidate is hired by our company as a result of an agency referral where no pre-existing agreement is in place. Where agency agreements are in place, introductions are position specific. Please, no phone calls or emails.
Employee Status:
RegularRelocation:
VISA Sponsorship:
Travel Requirements:
Flexible Work Arrangements:
HybridShift:
Valid Driving License:
Hazardous Material(s):
Required Skills:
Business Intelligence (BI), Database Administration, Data Engineering, Data Management, Data Modeling, Data Visualization, Information Management, Information Technology (IT) Infrastructure, Network Infrastructures, Software DevelopmentPreferred Skills:
Architecture Development, Architecture Development, Azure Data Factory, Bathymetric Survey, Business Case Development, Business Development, Business Model Development, Business Process Development, Business Transformation, Data Analysis, Data Extraction, Data Infrastructure, Data Mapping, Data Monitoring, Data Quality Assessments, Data Research, Data Science, Data Transformation, Data Warehouse Development, Data Warehouse Software, Decision Making, Development Design, Digital Analytics, Documentations, Geospatial Data {+ 7 more}Job Posting End Date:
11/21/2024*A job posting is effective until 11:59:59PM on the day BEFORE the listed job posting end date. Please ensure you apply to a job posting no later than the day BEFORE the job posting end date.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Architecture AWS Azure Business Intelligence Computer Science Data analysis Data governance Data management Data pipelines Data quality Data visualization Data warehouse Docker ECS ELT Engineering ETL Git Jupyter Lambda Machine Learning ML models Pipelines PySpark Python Research SQL Terraform
Perks/benefits: Career development Relocation support Startup environment Team events
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.