Staff Data Engineer

US Remote

H1

H1 helps life sciences, pharma, health plans and digital health organizations identify key providers and prescribers, accelerate clinical trials, and advance patient care everywhere.

View all jobs at H1

Apply now Apply later

At H1, we believe access to the best healthcare information is a basic human right. Our mission is to provide a platform that can optimally inform every doctor interaction globally. This promotes health equity and builds needed trust in healthcare systems. To accomplish this our teams harness the power of data and AI-technology to unlock groundbreaking medical insights and convert those insights into action that result in optimal patient outcomes and accelerates an equitable and inclusive drug development lifecycle.  Visit h1.co to learn more about us.
Data Engineering is responsible for the development and delivery of our most important asset - our data. Looking across thousands of data sources from across the globe, the data engineering team is responsible for making sense out of that data to create the world’s most extensive and comprehensive knowledge base of healthcare stakeholders and the ecosystem they influence. It is our job to ensure that only accurate, normalized data flows through to our customers, and at a velocity that keeps up with the changes in the real world.

WHAT YOU'LL DO AT H1As a Staff Data Engineer on the H1DN Team, you will be a hands-on technical leader, directly contributing to the development of scalable and efficient data architectures. You will collaborate closely with two IC team members, guiding them while actively participating in the work. This role centers on data ingestion and enrichment workflows, ensuring seamless integration of client data from various sources (CSV, Parquet, JSON, APIs) and addressing scalability, data quality, and standardization challenges.
You will:- Lead the development of new features within our client data ingestion platform, transforming, standardizing, and enriching with H1 dataset to meet business needs, including customizable solutions for high-stakes client integrations.- Focus on optimizing infrastructure to deliver product- and client-ready insights efficiently, ensuring enriched data integrates seamlessly into broader pipelines.- Collaborate with clients, subject matter experts, and product teams to drive critical integrations and shape the evolution of data workflows.- Establish best practices for data quality, system reliability, and scalable processing to support growing datasets.- Mentor and guide engineers on the team, promoting best practices and fostering a culture of technical excellence.- Advocate for engineering improvements, including scalable designs, quality assurance, and technical documentation standards.- Serve as a cultural leader within the engineering team, promoting high standards of excellence and continuous improvement in engineering practices.- Ensure the projects you work on deliver clear end-user impact, align with strategic goals, and are accountable for meaningful outcomes.

ABOUT YOUYou have strong hands-on technical skills and substantial experience in data engineering, with a proven track record of building and maintaining scalable data systems and pipelines. As a proactive and visionary technical leader, you excel at solving complex data engineering challenges and driving innovative solutions. 
- Proven ability to lead the development of complex data workflows, applying business logic for data enrichment and resolving challenges with creative solutions- Expertise in building and scaling data infrastructure, including integrating with core platforms.- Strong experience addressing data quality challenges and implementing robust validation mechanisms.- Self-motivated with the ability to manage tasks and projects independently.- Able to understand and align with broader organizational goals and strategies.- Proactively identifies potential risks and mitigates them early in the project lifecycle.- Passionate about mentoring junior engineers and fostering a collaborative, high-performing team culture.

REQUIREMENTS - 8+ years of experience in data engineering, specializing in building scalable data pipelines and enrichment processes, with a strong track record of working with large datasets, including ingestion, transformation, and optimization- Proficiency in Spark, Python, and SQL for building scalable data processing pipelines.- Hands-on experience with Kubernetes for container orchestration and deployment.- Strong background in AWS, including services such as S3, Lambda, ECS, and RDS for data infrastructure.- Strong SQL skills, including the ability to write optimized complex queries for  large datasets using advanced SQL operators  such as GROUP BY, HAVING, window functions, and complex joins.- Experience with EMR and Databricks to optimize large-scale data workflows.- In-depth understanding of optimizing LLM usage in production, with experience integrating LLMs into real-world applications and applying LLM-powered insights within data pipelines or customer-facing solutions

COMPENSATIONThis role pays $155,000 to $175,000 per year, based on experience, in addition to stock options.
Anticipated role close date: 05/28/2025

H1 OFFERS- Full suite of health insurance options, in addition to generous paid time off- Pre-planned company-wide wellness holidays- Retirement options- Health & charitable donation stipends- Impactful Business Resource Groups- Flexible work hours & the opportunity to work from anywhere- The opportunity to work with leading biotech and life sciences companies in an innovative industry with a mission to improve healthcare around the globe

H1 is proud to be an equal opportunity employer that celebrates diversity and is committed to creating an inclusive workplace with equal opportunity for all applicants and teammates. Our goal is to recruit the most talented people from a diverse candidate pool regardless of race, color, ancestry, national origin, religion, disability, sex (including pregnancy), age, gender, gender identity, sexual orientation, marital status, veteran status, or any other characteristic protected by law. H1 is committed to working with and providing access and reasonable accommodation to applicants with mental and/or physical disabilities. If you require an accommodation, please reach out to your recruiter once you've begun the interview process. All requests for accommodations are treated discreetly and confidentially, as practical and permitted by law.
Apply now Apply later
Job stats:  1  0  0

Tags: APIs Architecture AWS CSV Databricks Data pipelines Data quality ECS Engineering Excel JSON Kubernetes Lambda LLMs Parquet Pipelines Python Spark SQL

Perks/benefits: Equity / stock options Flex hours Flex vacation Insurance

Regions: Remote/Anywhere North America
Country: United States

More jobs like this