Senior Data Engineer, Health & Bioinformatics

Remote

Gradient AI

Gradient’s AI Investment Copilot helps you move from research to conviction faster—delivering deep company insights, streamlined diligence, and sharper decision-making.

View all jobs at Gradient AI

Apply now Apply later

This is a fully remote opportunity

Gradient AI:    

Gradient AI is revolutionizing Group Health and P&C insurance with AI-powered solutions that help insurers predict risk more accurately, improve profitability, and automate underwriting and claims. Our SaaS platform taps into one of the industry’s largest data lakes—tens of millions of policies and claims—to deliver deep, actionable insights. Trusted by leading carriers, MGAs, TPAs, and self-insured employers, Gradient AI has grown rapidly since our founding in 2018. Backed by $56M in Series C funding, we're scaling fast—and it's an exciting time to join the team. 

About the Role:    

We are seeking a Senior Data Engineer with a deep understanding of healthcare data, including claims, clinical, patient-reported outcomes data, or bioinformatics, to design, build, and manage data pipelines for our health insurance clients. The ideal candidate is a lifelong learner with a proven ability to interpret complex health-related datasets. This role requires an expert in using Airflow to orchestrate ETL pipelines, ensuring the efficient, reliable movement of healthcare data across systems. A subject matter expert (SME) in health and bioinformatics data is essential for success in this role. 

How you will make an impact: 

  • Design, build, and implement data systems to support ML and AI models for our health insurance clients, ensuring strict compliance with healthcare data privacy and security regulations (e.g., HIPAA). 
  • Develop tools for extracting, processing, and profiling diverse healthcare data sources, including EHRs, medical claims, pharmacy data, and genomic data. 
  • Collaborate with data scientists to transform large volumes of health-related and bioinformatics data into modeling-ready formats, prioritizing data quality, integrity, and reliability in healthcare applications. 
  • Build and maintain infrastructure for the extraction, transformation, and loading (ETL) of data from a variety of sources using SQL, AWS, and healthcare-specific big data technologies and analytics platforms. 
  • Apply health and bioinformatics subject matter expertise to ensure data pipelines meet the unique requirements of health, medical, and bioinformatics data processing - including translating complex medical and biological concepts into actionable data requirements. 

Skills needed to succeed:

  • BS in Computer Science, Bioinformatics, or another quantitative discipline with 7+ years working with and interpreting health, medical, and bioinformatics data, including real-world healthcare datasets. 
  • Subject matter expertise (SME) in health and bioinformatics data, with a strong grasp of the complexities and challenges of processing medical and biological information. 
  • Knowledge of healthcare data standards (e.g., FHIR, HL7) and a solid understanding of healthcare data privacy and security regulations (such as HIPAA) are highly desirable. 
  • Proficiency in Python and SQL within a professional environment. 
  • Hands-on knowledge of big data tools like Apache Spark (PySpark), DataBricks, Snowflake, or similar platforms. 
  • Skilled in using data orchestration frameworks such as Airflow, Dagster, or Prefect. 
  • Comfortable working within cloud computing environments, preferably AWS, along with Linux systems. 
  • Ability to work with and visualize health and/or medical data, with Insurtech industry exposure, is considered a plus. 

What We Offer:  

  • A fun, team-oriented startup culture.  
  • Generous stock options - we all get to own a piece of what we’re building.  
  • Unlimited vacation days.  
  • Flexible schedule that supports working from home.  
  • Full benefits package includes medical, dental, vision, 401k, paid paternal leave, and more.  
  • Ample opportunities to learn and take on new responsibilities.  

We are an equal opportunity employer. 

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0

Tags: Airflow AWS Big Data Bioinformatics Computer Science Dagster Databricks Data pipelines Data quality ETL HL7 Linux Machine Learning Pipelines Privacy PySpark Python Security Snowflake Spark SQL

Perks/benefits: Equity / stock options Flex hours Flex vacation Health care Medical leave Startup environment Unlimited paid time off

Region: Remote/Anywhere

More jobs like this