Sr Data Engineer
India - Hyderabad
Amgen
Amgen is committed to unlocking the potential of biology for patients suffering from serious illnesses by discovering, developing, manufacturing and delivering innovative human therapeutics.Career Category
Information SystemsJob Description
Join Amgen’s Mission of Serving Patients
At Amgen, if you feel like you’re part of something bigger, it’s because you are. Our shared mission—to serve patients living with serious illnesses—drives all that we do.
Since 1980, we’ve helped pioneer the world of biotech in our fight against the world’s toughest diseases. With our focus on four therapeutic areas –Oncology, Inflammation, General Medicine, and Rare Disease– we reach millions of patients each year. As a member of the Amgen team, you’ll help make a lasting impact on the lives of patients as we research, manufacture, and deliver innovative medicines to help people live longer, fuller happier lives.
Our award-winning culture is collaborative, innovative, and science based. If you have a passion for challenges and the opportunities that lay within them, you’ll thrive as part of the Amgen team. Join us and transform the lives of patients while transforming your career.
What you will do
About the role
You will play a key role as part of Operations Generative AI (GenAI) Product team to deliver cutting edge innovative GEN AI solutions across various Process Development functions(Drug Substance, Drug Product, Attribute Sciences & Combination Products) in Operations functions.
Role Description:
The Sr Data Engineer for GEN AI solutions across various Process Development functions(Drug Substance, Drug Product, Attribute Sciences & Combination Products) in Operations functions is responsible for designing, building, maintaining, analyzing, and interpreting data to provide actionable insights that drive business decisions, working with large datasets, developing reports, supporting and implementing data governance initiatives and visualizing data to ensure data is accessible, reliable, and efficiently managed. The ideal candidate has strong technical skills, experience with big data technologies, and a deep understanding of data architecture and ETL processes.
Roles & Responsibilities:
Design, develop, and maintain data solutions for data generation, collection, and processing.
Be a key team member that assists in design and development of the data pipeline.
Create data pipelines and ensure data quality by implementing ETL processes to migrate and deploy data across systems.
Contribute to the design, development, and implementation of data pipelines, ETL/ELT processes, and data integration solutions.
Take ownership of data pipeline projects from inception to deployment, manage scope, timelines, and risks.
Collaborate with multi-functional teams to understand data requirements and design solutions that meet business needs.
Develop and maintain data models, data dictionaries, and other documentation to ensure data accuracy and consistency.
Implement data security and privacy measures to protect sensitive data.
Leverage cloud platforms (AWS preferred) to build scalable and efficient data solutions.
Develop solutions for handling unstructured data in AI pipelines.
Collaborate with Data Architects, Business SMEs, and Data Scientists to design and develop end-to-end data pipelines to meet fast paced business needs across geographic regions.
Identify and resolve complex data-related challenges.
Adhere to standard processes for coding, testing, and designing reusable code/component.
Explore new tools and technologies that will help to improve ETL platform performance.
Participate in sprint planning meetings and provide estimations on technical implementation.
Collaborate and communicate effectively with product teams.
What we expect of you
We are all different, yet we all use our unique contributions to serve patients.
Basic Qualifications:
Master’s degree with 4 - 6 years of experience in Computer Science, IT or related field OR
Bachelor’s degree with 6 - 8 years of experience in Computer Science, IT or related field OR
Diploma with 10 - 12 years of experience in Computer Science, IT or related field.
Must-Have Skills:
Hands on experience with big data technologies and platforms, such as Databricks, Apache Spark (PySpark, SparkSQL), workflow orchestration, performance tuning on big data processing.
Proficiency in data analysis tools (eg. SQL) and experience with data visualization tools.
Experienced with software engineering best-practices, including but not limited to version control (Git, Subversion, etc.), CI/CD (Jenkins, Maven etc.), automated unit testing, and Dev Ops
Excellent problem-solving skills and the ability to work with large, complex datasets.
Strong understanding of data governance frameworks, tools, and standard methodologies.
Experience in implementing Retrieval-Augmented Generation (RAG) pipelines, integrating retrieval mechanisms with language models.
Strong programming skills in Python and familiarity with deep learning frameworks such as PyTorch or TensorFlow.
Experience in processing and leveraging unstructured data for GenAI applications
Preferred Qualifications:
Experience with ETL tools such as Apache Spark, and various Python packages related to data processing, machine learning model development.
Strong understanding of data modeling, data warehousing, and data integration concepts.
Knowledge of Python/R, Databricks.
Knowledge of vector databases, including implementation and optimization.
Professional Certifications:
Certified Data Engineer / Data Analyst (preferred on Databricks or cloud environments).
Machine Learning Certification (preferred on Databricks or Cloud environments).
SAFe for Teams certification (preferred).
Soft Skills:
Excellent analytical and troubleshooting skills.
Strong verbal and written communication skills
Ability to work effectively with global, virtual teams
High degree of initiative and self-motivation.
Ability to manage multiple priorities successfully.
Team-oriented, with a focus on achieving team goals
What you can expect of us
As we work to develop treatments that take care of others, we also work to care for your professional and personal growth and well-being. From our competitive benefits to our collaborative culture, we’ll support your journey every step of the way.
In addition to the base salary, Amgen offers competitive and comprehensive Total Rewards Plans that are aligned with local industry standards.
Equal opportunity statement
Amgen is an Equal Opportunity employer and will consider you without regard to your race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or disability status.
We will ensure that individuals with disabilities are provided with reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request an accommodation.
Apply now and make a lasting impact with the Amgen team.
careers.amgen.com
As an organization dedicated to improving the quality of life for people around the world, Amgen fosters an inclusive environment of diverse, ethical, committed and highly accomplished people who respect each other and live the Amgen values to continue advancing science to serve patients. Together, we compete in the fight against serious disease.
Amgen is an Equal Opportunity employer and will consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, or any other basis protected by applicable law.
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
.* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture AWS Big Data CI/CD Computer Science Data analysis Databricks Data governance Data pipelines Data quality Data visualization Data Warehousing Deep Learning ELT Engineering ETL Generative AI Git Jenkins Machine Learning Maven ML models Pipelines Privacy PySpark Python PyTorch R RAG Research Security Spark SQL TensorFlow Testing Unstructured data
Perks/benefits: Career development Competitive pay
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.