Data Engineer

US: Indianapolis IN Tech Center North

Eli Lilly and Company

Lilly is a medicine company turning science into healing to make life better for people around the world.

View all jobs at Eli Lilly and Company

Apply now Apply later

At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We’re looking for people who are determined to make life better for people around the world.

Join our R&D IT team as a Data Engineer and drive innovation in drug discovery by building cutting-edge data solutions in a leading pharmaceutical company. 

The R&D IT team is actively looking for a skilled Data Engineer with a passion for harnessing data to fuel innovation in pharmaceutical research. Are you someone who thrives in building robust data pipelines, enjoys working with cutting-edge technologies, and has a keen eye for detail in ensuring data quality and integrity? 

What You’ll Be Doing:

As a Data Engineer in our R&D IT team, you'll be at the forefront of transforming vast amounts of research data into actionable insights that drive drug discovery. You'll design and implement advanced data pipelines and cloud-based solutions, ensuring our researchers have the high-quality data they need to develop life-saving treatments. Your work will impact the future of healthcare innovation. 

How You’ll Succeed:

  • Build Robust Data Pipelines: You will design and implement scalable data pipelines that efficiently process and integrate large datasets from various sources. Success means creating reliable, high-performance systems that support our research objectives.

  • Ensure Data Quality and Integrity: By applying best practices in data governance, validation, and monitoring, you'll ensure that the data is accurate, complete, and consistent. Your attention to detail will be crucial in maintaining the trustworthiness of our data assets.

  • Collaborate Across Teams: You will work closely with data scientists, researchers, and IT professionals to understand their data needs and translate them into effective technical solutions. Strong communication and collaboration skills will be key to your success.

  • Leverage Cloud Technologies: By utilizing cloud platforms and services, you’ll build and maintain scalable, secure, and cost-effective data infrastructure. Your ability to innovate and optimize in the cloud will set you apart.

  • Proactively Solve Problems: You’ll anticipate potential challenges, identify bottlenecks, and troubleshoot issues before they impact the research process. Being proactive and solution-oriented will make you a valuable asset to the team.

What You Should Bring:
The successful candidate will leverage their knowledge of modern data management, cloud architecture, strong problem solving, and software development skills to -

  • Foster collaborations with scientists across Indianapolis and San Diego, to design and implement workflow automation.

  • Design data architecture, drive the creation of analysis-ready data products by leveraging informatics, data engineering and data science skills.

  • Ensure that the potential business value from data products and capabilities are captured, optimized, and recognized. Enable timely and efficient data updates, queries, and data mining processes.

  • Identify best practices and implement solutions that help to better extract, transform, and load (ETL) the data into either cloud-based or local database systems.

  • Automate key services and tasks across on-prem and cloud systems to increase efficiency and scalability.

  • Migrating on-prem software solutions and related data to the cloud.

  • Collaborate with computational scientists to scale-up novel informatics pipeline for efficient data processing and visualization

  • Drive the evaluation of external and internal data platforms/services/products with focus on business value, capabilities, and sustainability.

Your Basic Qualifications:

  • Bachelor's degree in computer science, computer engineering, data science or related field

  • Knowledge in the pharmaceutical or life sciences domain

  • 3+ years of experience in processing, organizing, integrating, analyzing and visualizing complex data and information from disparate sources

​Additional Skills:

  • Benchling Experience is a plus

  • Extensive hands-on experience in languages such as Python or R, and related informatics code packages and tools (e.g Pandas, Plotty, Jupyter NoteBook)

  • Experience in cloud (preferably AWS) development, deployment, Docker container management.

  • Experienced in data modeling, SQL and NoSQL databases

  • Experience in GitHub and API development 

  • Experience in Linux OS and shell scripting

  • Experience in Airflow, building pipeline for data ingestion and data harmonization.

  • Ability to work independently to analyze customer requirements and translating to technology spec.

  • Ability to communicate and operate effectively across our multi-disciplinary, global environment and take own initiatives.

  • Demonstrated track record of agile learning and strong problem-solving skills.

​Additional Information:

  • Role located in Indianapolis, In and would require relocation

Lilly is dedicated to helping individuals with disabilities to actively engage in the workforce, ensuring equal opportunities when vying for positions. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form (https://careers.lilly.com/us/en/workplace-accommodation) for further assistance. Please note this is for individuals to request an accommodation as part of the application process and any other correspondence will not receive a response.

Lilly is an EEO/Affirmative Action Employer and does not discriminate on the basis of age, race, color, religion, gender, sexual orientation, gender identity, gender expression, national origin, protected veteran status, disability or any other legally protected status.

Our employee resource groups (ERGs) offer strong support networks for their members and help our company develop talented individuals for future leadership roles. Our current groups include: Africa, Middle East, Central Asia Network, African American Network, Chinese Culture Network, Early Career Professionals, Japanese International Leadership Network (JILN), Lilly India Network, Organization of Latinos at Lilly, PRIDE (LGBTQ + Allies), Veterans Leadership Network, Women’s Network, Working and Living with Disabilities. Learn more about all of our groups.

#WeAreLilly

Apply now Apply later
  • Share this job via
  • 𝕏
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  3  0  0
Category: Engineering Jobs

Tags: Agile Airflow API Development APIs Architecture AWS Computer Science Data governance Data management Data Mining Data pipelines Data quality Docker Drug discovery Engineering ETL GitHub Jupyter Linux NoSQL Pandas Pharma Pipelines Python R R&D Research Shell scripting SQL

Perks/benefits: Career development Relocation support

Region: North America
Country: United States

More jobs like this