Data Scientist - Real World Data - data42
Dublin (NOCC), Ireland
⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️
Novartis
Working together, we can reimagine medicine to improve and extend people’s lives.Job Description Summary
Are you passionate about the intersection of data, technology and science, and excited by the potential of Real-World Data (RWD) and AI? Do you thrive in collaborative environments and aspire to contribute to the discovery of groundbreaking medical insights? If so, join the data42 team at Novartis!At Novartis, we reimagine medicine by leveraging state-of-the-art analytics and our extensive internal and external data resources. Our data42 platform grants access to high-quality, multi-modal preclinical and clinical data, along with RWD, creating the optimal environment for developing advanced AI/ML models and generating health insights. Our global team of data scientists and engineers utilizes this platform to uncover novel insights and guide drug development decisions.
As an RWD SME / RWE Execution Data Scientist, you will focus on executing innovative methodologies and AI models to mine RWD on the data42 platform. You will be the go-to authority for leveraging diverse RWD modalities patterns crucial to understanding patient populations, biomarkers, and drug targets, accelerating the development of life-changing medicines.
Job Description
Duties and Responsibilities:
- Collaborate with R&D stakeholders to co-create and implement innovative, repeatable, scalable and automated data and technology solutions in line with data42 strategy.
- Be a data Subject Matter Expert (SME), understand Real World Data (RWD) of different modalities, vocabularies (LOINC, ICD, HCPCS etc.), non-traditional RWD (Patient reported outcomes, Wearables and Mobile Health Data) and where and how they can be used, including in conjunction with clinical data, omics data, pre-clinical data, and commercial data.
- Contribute to data strategy implementation such as Federated Learning, tokenization, data quality frameworks, regulatory requirements (submission data to HL7 FHIR formats conversion, Sentinel initiative), conversion to common data models and standards (OMOP, FHIR, SEND etc.), FAIR principles and integration with enterprise catalog
- Define and execute advanced integrated and scalable analytical approaches and research methodologies (including industry trends) in support of exploratory and regulatory use of AI models for RWD analysis across the Research Development Commercial continuum by facilitating research questions.
- Stay current with emerging applications and trends, driving the development of advanced analytic capabilities for data42 across the Real-world evidence generation lifecycle, from ideation to study design and execution.
- Demonstrate high agility working across various cross-located and cross-functional associates across business domains (commercial, Development, Biomedical Research) or Therapeutic area divisions for our priority disease areas to execute complex and critical business problems with quantified business impact/ROI.
Ideal Candidate Profile:
- PhD or MSc. in a quantitative discipline (e.g., but not restricted to Computer Science, Physics, Statistics, Epidemiology) with proven expertise in artificial Intelligence / Machine Learning.
- 8+ years of relevant experience in Data Science (or 4+ years post-qualification in case of PhD).
- Extensive experience in Statistical and Machine Learning techniques: Regression, Classification, Clustering, Design of Experiments, Monte Carlo Simulations, Statistical Inference, Feature Engineering, Time Series Forecasting, Text Mining, and Natural Language Processing, LLMs, and multi-modal Generative AI.
- Good to have skills: Stochastic models, Bayesian Models, Markov Chains, Optimization techniques including, Dynamic Programming Deep Learning techniques on structured and unstructured data, Recommender Systems.
- Proficiency in tools and packages: Python, R(optional), SQL; exposure to dashboard or web-app building using PowerBI, R-Shiny, Flask, open source or proprietary software and packages is an advantage.
- Knowledge in data standards e.g. OHDSI OMOP, and other data standards, FHIR HL7 for regulatory, and best practices.
- Good to have: Foundry, big data programming, working knowledge of executing data science on AWS, DataBricks or SnowFlake
- Strong in Matrix collaboration environments with good communication and collaboration skills with country/ regional/ global stakeholders in an individual contributor capacity.
Novartis is committed to building an outstanding, inclusive work environment and diverse teams representative of the patients and communities we serve.
Skills Desired
Classification Systems, Clinical Trials, Computer Data Storage, Computer Programming, Cross-Functional Teams, Data Analysis, Data Structures, Initiative, Programming Languages, Reporting, Statistical Analysis* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: AWS Bayesian Big Data Classification Clustering Computer Science Data analysis Databricks Data quality Data strategy Deep Learning Engineering Feature engineering Flask Generative AI HCPCS HL7 LLMs LOINC Machine Learning ML models Monte Carlo NLP OMOP Open Source PhD Physics Power BI Python R R&D Recommender systems Research Snowflake SQL Statistics Unstructured data
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.