Sr Data Scientist
India - Hyderabad
Amgen
Amgen is committed to unlocking the potential of biology for patients suffering from serious illnesses by discovering, developing, manufacturing and delivering innovative human therapeutics.Career Category
Information SystemsJob Description
What you will do
Let’s do this. Let’s change the world. In this vital role you will responsible for developing interconnected business information models and ontologies that capture real-world meaning of data by studying the business, our data, and the industry. With a focus on pharmaceutical industry-specific data, including Clinical, Operations, and Chemistry, Manufacturing, and Controls (CMC), this role involves creating robust semantic models based on data-centric principles to realize a connected data ecosystem that empowers consumers. The Information Modeler drives seamless cross-functional data interoperability, enables efficient decision-making, and supports digital transformation in pharmaceutical operations.
Roles & Responsibilities:
- Lead conversations with business collaborators to elucidate semantic models of pharmaceutical business concepts, aligned definitions, and relationships. Negotiate and debate across collaborators to drive alignment and create system-independent information models, taking a data-centric approach aligned with business data domains.
- Develop comprehensive business information models and ontologies that capture industry-specific concepts, including CMC, Clinical, and Operations data.
- Facilitate whiteboarding sessions with business subject matter experts to elicit knowledge, drive interoperability across pharmaceutical domains, and interface between data producers and consumers.
- Educate peers on the practical use and differentiating value of Linked Data and FAIR+ data principles. Champion standards for master data & reference data.
- Formalize data models in RDF as OWL and SHACL ontologies that interoperate with each other and with relevant industry standards like FHIR and IDMP for healthcare data exchange.
- Build a broad semantic knowledge graph that threads data together across end-to-end business processes and enables the transformation to data-centricity and new ways of working
- Apply pragmatic semantic abstraction to simplify diverse pharmaceutical and healthcare data patterns effectively.
What we expect of you
We are all different, yet we all use our unique contributions to serve patients.
Basic Qualifications:
- Doctorate degree OR
- Master’s degree and 4 to 6 years of Data Science experience OR
- Bachelor’s degree and 6 to 8 years of Data Science experience OR
- Diploma and 10 to 12 years of Data Science experience
Preferred Qualifications:
About amgen
Amgen harnesses the best of biology and technology to fight the world’s toughest diseases, and make people’s lives easier, fuller and longer. We discover, develop, manufacture and deliver innovative medicines to help millions of patients. Amgen helped establish the biotechnology industry more than 40 years ago and remains on the cutting-edge of innovation, using technology and human genetic data to push beyond what’s known today.
About the role
You will play a key role in a regulatory submission content automation initiative which will modernize and digitize the regulatory submission process, positioning Amgen as a leader in regulatory innovation. The initiative uses state-of-the-art technologies, including Generative AI, Structured Content Management, and integrated data to automate the creation, review, and approval of regulatory content.
Role Description:
The Sr Data Scientist is responsible for developing interconnected business information models and ontologies that capture real-world meaning of data by studying the business, our data, and the industry. With a focus on pharmaceutical industry-specific data, including Clinical, Operations, and Chemistry, Manufacturing, and Controls (CMC), this role involves creating robust semantic models based on data-centric principles to realize a connected data ecosystem that empowers consumers. The Information Modeler drives seamless cross-functional data interoperability, enables efficient decision-making, and supports digital transformation in pharmaceutical operations.
Roles & Responsibilities:
- Lead conversations with business partners to elucidate semantic models of pharmaceutical business concepts, aligned definitions, and relationships. Negotiate and debate across partners to drive alignment and create system-independent information models, taking a data-centric approach aligned with business data domains.
- Develop comprehensive business information models and ontologies that capture industry-specific concepts, including CMC, Clinical, and Operations data.
- Facilitate whiteboarding sessions with business subject matter experts to elicit knowledge, drive interoperability across pharmaceutical domains, and interface between data producers and consumers.
- Educate peers on the practical use and differentiating value of Linked Data and FAIR+ data principles. Champion standards for master data & reference data.
- Formalize data models in RDF as OWL and SHACL ontologies that interoperate with each other and with relevant industry standards like FHIR and IDMP for healthcare data exchange.
- Build a broad semantic knowledge graph that threads data together across end-to-end business processes and enables the transformation to data-centricity and new ways of working
- Apply pragmatic semantic abstraction to simplify diverse pharmaceutical and healthcare data patterns effectively.
Basic Qualifications and Experience:
- Doctorate Degree OR
- Master’s degree with 4 to 6 years of experience in Data Science OR
- Bachelor’s degree with 6 to 8 years of experience in Data Science OR
- Diploma with 10 to 12 years of experience in Data Science
Functional Skills:
Must-Have Skills:
- Proven ability to lead and develop successful teams.
- Strong problem-solving, analytical, and critical thinking skills to address complex data challenges.
- Deep understanding of pharmaceutical industry data, including CMC, Process Development, Manufacturing, Engineering Quality, Supply Chain, and Operations.
- Advanced skills in semantic modeling, RDF, OWL, SHACL, and ontology development in TopBraid and/or Protégé.
- Demonstrated experience creating knowledge graphs with semantic RDF technologies (e.g. Stardog, AllegroGraph, GraphDB, Neptune) and testing models with real data.
- Highly proficient with RDF, SPARQL, Linked Data concepts, and interacting with triple stores.
- Highly proficient at facilitating, capturing, and organizing collaborative discussions through tools such as Miro, Lucidspark, Lucidchart, and Confluence.
- Expertise in FAIR data principles and their application in healthcare and pharmaceutical data models.
Good-to-Have Skills:
- Experience in regulatory data modeling and compliance requirements in the pharmaceutical domain.
- Familiarity with pharmaceutical lifecycle data (PLM), including product development and regulatory submissions.
- Knowledge of supply chain and operations data modeling in the pharmaceutical industry.
- Proficiency in integrating data from various sources, such as LIMS, EDC systems, and MES.
- Hands-on data analysis and wrangling experience including SQL-based data transformation and solving integration challenges arising from differences in data structure, meaning, or terminology
- Expertise in FHIR data standards and their application in healthcare and pharmaceutical data models.
Soft Skills:
- Exceptional interpersonal, business analysis, facilitation, and communication skills.
- Ability to interpret complex regulatory and operational requirements into data models.
- Analytical thinking for problem-solving in a highly regulated environment.
- Adaptability to manage and prioritize multiple projects in a dynamic setting.
- Strong appreciation for customer- and user-centric product design thinking.
What you can expect of us
As we work to develop treatments that take care of others, we also work to care for your professional and personal growth and well-being. From our competitive benefits to our collaborative culture, we’ll support your journey every step of the way.
In addition to the base salary, Amgen offers competitive and comprehensive Total Rewards Plans that are aligned with local industry standards.
Apply now
for a career that defies imagination
Objects in your future are closer than they appear. Join us.
careers.amgen.com
.* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Biology Chemistry Confluence Data analysis Engineering Generative AI Pharma RDF SQL Testing
Perks/benefits: Career development Competitive pay
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.