Director/Sr. Director - Discovery Data Team Lead Engineer, Molecule Discovery

US: USA Remote, United States

Eli Lilly and Company

Lilly is a medicine company turning science into healing to make life better for people around the world.

View all jobs at Eli Lilly and Company

Apply now Apply later

At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We’re looking for people who are determined to make life better for people around the world.

The Discovery Data Team (DDT) is accelerating molecule discovery through the integration of high-throughput lab data, next-generation sequencing (NGS), lab automation, and machine learning. We’re championing scalable, cloud-native infrastructure to power data pipelines and APIs that unify experimental and computational datasets across the molecule discovery lifecycle and modalities.


We’re seeking a Discovery Data Team Lead Engineer to design and implement robust, scalable infrastructure for ingesting and processing scientific datasets—especially NGS and experimental workflows—from lab instruments, ELNs, and cloud storage systems. You’ll play a key role in leading the and generating the technical, engineering strategy and collaborating closely with scientific and Tech@Lilly team.  You will also lead the strategy to build data pipelines, APIs, and workflow orchestration platforms across AWS and modern data technologies. As the first engineer on the DDT, you’ll also work closely with bench scientists, computational scientists and bioinformatician, and Tech@Lilly on several data initiatives leading the technical strategy and influencing stakeholders and informing the leadership on the technical roadmaps.

Key Responsibilities:

  • Serve as a technical lead and data architect within the Discovery Data Team in Molecule Discovery
  • Thought partner to the DDT head on engineering and technical strategy for projects
  • Influence cross-functional partners and drive the technical design of new data products and pipelines
  • Lead a team of engineers to catalyze and execute on data initiatives in molecule discovery
  • Build and scale cloud-native infrastructure to support data ingestion, processing, and retrieval for molecule discovery and sequencing workflows.
  • Develop workflows using Nextflow for NGS data processing and integrate them into larger data pipeline systems.
  • Integrate and extract data from lab instruments and ELNs (e.g., Benchling, Signals) and route them into structured data lakes or databases.
  • Develop and maintain APIs using FastAPI to interface between data sources, pipelines, and downstream applications.
  • Design and implement data pipelines using Airflow, PostgreSQL, Spark, and columnar storage formats (e.g., Parquet, Redshift).
  • Deploy, monitor, and optimize infrastructure on AWS, including services like Lambda, Batch, S3, and EC2.
  • Build secure, scalable APIs for data sharing and querying between storage systems and data consumers.
  • Work cross-functionally with bioinformaticians, data scientists, and lab informatics teams to enable seamless scientific data workflows

Basic Requirements:

  • Bachelor's degree or higher degree in engineering, computer science or related sciences fields
  • 10+ years of work experience in leading engineering teams and working in cloud infrastructure or DevOps roles with strong focus on strategic leadership and data systems

Additional Skills/Preferences:

  • Familiarity with columnar data formats and scalable storage architectures (e.g., data lakes, Redshift, Parquet).
  • Excellent problem-solving skills and ability to troubleshoot complex issues.
  • Strong communication and collaboration skills.
  • Experience with Nextflow or similar workflow languages for NGS or scientific data processing.
  • Strong hands-on experience with AWS services, especially Lambda, Batch, S3, and container orchestration.
  • Proficiency with Python and frameworks like FastAPI for developing APIs.
  • Experience with scientific data systems and ELNs like Benchling and Signals.
  • Strong understanding of data pipeline orchestration (Airflow), distributed compute (Spark), and data modeling for scientific datasets.
  • Experienced in developing solutions using agile methodology (e.g. Scrum) and tools (e.g. JIRA)
  • Experience working with lab instrumentation data extraction and integration into cloud data stores.
  • Background in bioinformatics, molecular biology, or a related life sciences field.
  • Experience in regulated or GxP-compliant environments.
  • Knowledge of scientific computing environments and HPC systems.
  • Familiarity with workflow containerization (Docker, Singularity) and CI/CD pipelines.

Why Join Us?

  • Be part of a mission-driven, cutting-edge data team advancing scientific discovery through modern data and infrastructure tools.
  • Solve challenging technical problems with real-world impact at one of the biggest healthcare companies in the world
  • Competitive salary, stock options, and excellent benefits package

Lilly is dedicated to helping individuals with disabilities to actively engage in the workforce, ensuring equal opportunities when vying for positions. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form (https://careers.lilly.com/us/en/workplace-accommodation) for further assistance. Please note this is for individuals to request an accommodation as part of the application process and any other correspondence will not receive a response.

Lilly is proud to be an EEO Employer and does not discriminate on the basis of age, race, color, religion, gender identity, sex, gender expression, sexual orientation, genetic information, ancestry, national origin, protected veteran status, disability, or any other legally protected status.


Our employee resource groups (ERGs) offer strong support networks for their members and are open to all employees. Our current groups include: Africa, Middle East, Central Asia Network, Black Employees at Lilly, Chinese Culture Network, Japanese International Leadership Network (JILN), Lilly India Network, Organization of Latinx at Lilly (OLA), PRIDE (LGBTQ+ Allies), Veterans Leadership Network (VLN), Women’s Initiative for Leading at Lilly (WILL), enAble (for people with disabilities). Learn more about all of our groups.

Actual compensation will depend on a candidate’s education, experience, skills, and geographic location.  The anticipated wage for this position is

$154,500 - $242,000

Full-time equivalent employees also will be eligible for a company bonus (depending, in part, on company and individual performance). In addition, Lilly offers a comprehensive benefit program to eligible employees, including eligibility to participate in a company-sponsored 401(k); pension; vacation benefits; eligibility for medical, dental, vision and prescription drug benefits; flexible benefits (e.g., healthcare and/or dependent day care flexible spending accounts); life insurance and death benefits; certain time off and leave of absence benefits; and well-being benefits (e.g., employee assistance program, fitness benefits, and employee clubs and activities).Lilly reserves the right to amend, modify, or terminate its compensation and benefit programs in its sole discretion and Lilly’s compensation practices and guidelines will apply regarding the details of any promotion or transfer of Lilly employees.

#WeAreLilly

Apply now Apply later
Job stats:  1  0  0

Tags: Agile Airflow APIs Architecture AWS Bioinformatics Biology CI/CD Computer Science Data pipelines DevOps Docker EC2 Engineering FastAPI HPC Jira Lambda Machine Learning Parquet Pipelines PostgreSQL Python Redshift Scrum Spark

Perks/benefits: Career development Competitive pay Equity / stock options Flex hours Flex vacation Health care Insurance Medical leave Salary bonus

Regions: Remote/Anywhere North America
Country: United States

More jobs like this