Staff Data Engineer #4009
Menlo Park, CA
GRAIL is focused on improving lives by developing pioneering technologies to detect cancer early. As a member of our team, you will help manage the end-to-end data lifecycle, ensuring data integrity, reliability, and compliance in a regulated environment. You will work closely with cross-functional teams including lab scientists, data scientists, biostatisticians, medical directors, and software engineers to create critical datasets and data solutions that drive our product pipeline.
We are seeking a Staff Data Engineer to develop, optimize, and manage GRAIL’s data lifecycle from sample ingestion to analysis, ensuring compliance with regulatory and clinical standards. You will partner with cross-functional teams to ensure that data solutions are high-quality, scalable, and aligned with our regulatory requirements, including FDA and other global health authorities.
This is a hybrid role and requires you to be onsite at least 2 days a week in Menlo Park, CA.
We are seeking a Staff Data Engineer to develop, optimize, and manage GRAIL’s data lifecycle from sample ingestion to analysis, ensuring compliance with regulatory and clinical standards. You will partner with cross-functional teams to ensure that data solutions are high-quality, scalable, and aligned with our regulatory requirements, including FDA and other global health authorities.
This is a hybrid role and requires you to be onsite at least 2 days a week in Menlo Park, CA.
Responsibilities:
- Lead the design, development, and optimization of scalable ETL pipelines and data configurations to support the ingestion, transformation, and analysis of clinical and research datasets, ensuring alignment with regulatory and product requirements.
- Collaborate with data scientists, biostatisticians, and clinical teams to understand and address the data needs of various programs, including clinical trials, research studies, and regulatory submissions.
- Ensure data integrity, traceability, and quality through robust validation procedures, ensuring compliance with FDA guidelines and other regulatory requirements.
- Proactively identify new technologies, methodologies, and processes to address evolving data management challenges within a regulated biotechnology environment.
- Manage the generation and maintenance of metadata, data navigation tools, and documentation to support operational objectives and streamline study processes.
- Support study operations by ensuring that datasets are structured to meet clinical, scientific, and regulatory milestones, including data locks, submissions, and monitoring.
Preferred Qualifications:
- BS/MS in a quantitative scientific field (Computer Science, Engineering, Mathematics, Statistics, Bioinformatics, etc.) with 8+ years of experience in data engineering, ideally within a regulated environment such as biotechnology, pharmaceuticals, medical devices, or healthcare.
- Strong understanding of ETL processes, data pipeline development, and database management, with proven experience delivering data solutions in support of clinical or regulatory requirements.
- Expertise in SQL and Python or R
- Experience working with cloud-based data platforms (AWS, Azure, Google Cloud) with a strong understanding of compliance frameworks (e.g., HIPAA, 21 CFR Part 11, GDPR).
- Excellent problem-solving skills with a track record of ensuring data quality and integrity across complex datasets.
- Demonstrated success working in cross-functional, collaborative teams, with the ability to translate user requirements into scalable, high-quality data solutions.
- 3+ years of experience working in a regulated industry (biotechnology, medical devices, healthcare) with knowledge of compliance and regulatory requirements for data management.
- Proven experience in data lifecycle management for clinical trials, including understanding of regulatory submissions (e.g., FDA PMA, IDMC reports).
- Familiarity with tools like Apache Airflow and DBT for data pipeline orchestration in regulated environments.
Highly Desired Qualifications
Job stats:
0
0
0
Categories:
Engineering Jobs
Leadership Jobs
Tags: Airflow AWS Azure Bioinformatics Computer Science Data management Data quality dbt Engineering ETL GCP Google Cloud Mathematics Pharma Pipelines Python R Research SQL Statistics
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Principal Data Scientist jobsPrincipal Data Engineer jobsData Scientist II jobsStaff Data Scientist jobsBI Developer jobsData Manager jobsJunior Data Analyst jobsResearch Scientist jobsData Science Manager jobsBusiness Data Analyst jobsLead Data Analyst jobsSenior AI Engineer jobsData Engineer III jobsData Science Intern jobsSr. Data Scientist jobsData Specialist jobsSoftware Engineer II jobsData Analyst Intern jobsSoftware Engineer, Machine Learning jobsJunior Data Engineer jobsData Analyst II jobsBI Analyst jobsSenior Data Scientist, Performance Marketing jobsSr Data Engineer jobsPrincipal Software Engineer jobs
Economics jobsSnowflake jobsLinux jobsHadoop jobsComputer Vision jobsOpen Source jobsJavaScript jobsMLOps jobsPhysics jobsBanking jobsRDBMS jobsKafka jobsAirflow jobsNoSQL jobsData Warehousing jobsScala jobsR&D jobsGoogle Cloud jobsKPIs jobsStreaming jobsData warehouse jobsClassification jobsGitHub jobsOracle jobsCX jobs
SAS jobsPostgreSQL jobsScikit-learn jobsData Mining jobsScrum jobsE-commerce jobsPandas jobsTerraform jobsDistributed Systems jobsPySpark jobsLooker jobsBigQuery jobsRobotics jobsJira jobsIndustrial jobsJenkins jobsUnstructured data jobsdbt jobsRedshift jobsReact jobsData strategy jobsMicroservices jobsMySQL jobsPharma jobsNumPy jobs