Principal Data Engineer

United States - Alameda : 2901 Harbor Bay Parkway, United States

Full Time Senior-level / Expert USD 128K - 256K

Abbott

Innovative medical devices and health care solutions for cardiovascular health, diabetes management, diagnostic testing, nutrition, chronic pain and more.

View all jobs at Abbott

Apply now Apply later

Posted 4 days ago

Abbott is a global healthcare leader that helps people live more fully at all stages of life. Our portfolio of life-changing technologies spans the spectrum of healthcare, with leading businesses and products in diagnostics, medical devices, nutritionals and branded generic medicines. Our 114,000 colleagues serve people in more than 160 countries.

JOB DESCRIPTION:

Working at Abbott

At Abbott, you can do work that matters, grow, and learn, care for yourself and family, be your true self and live a full life. You’ll also have access to:

Career development with an international company where you can grow the career you dream of.

Free medical coverage for employees* via the Health Investment Plan (HIP) PPO

An excellent retirement savings plan with high employer contribution

Tuition reimbursement, the Freedom 2 Save student debt program and FreeU education benefit - an affordable and convenient path to getting a bachelor’s degree.

A company recognized as a great place to work in dozens of countries around the world and named one of the most admired companies in the world by Fortune.

A company that is recognized as one of the best big companies to work for as well as a best place to work for diversity, working mothers, female executives, and scientists.

About Lingo

Meet Lingo, a new biosensing technology that provides users a window into their body. Lingo tracks key biomarkers – such as glucose, ketones, and lactate – to help people make better decisions about their health and nutrition. Biowearable technology will digitize, decentralize and democratize healthcare, enabling consumers to take control of their own health.

The Opportunity

This position works out of our Alameda location in the Lingo Division. We are seeking a Principal Data Engineer with deep expertise in Apache Spark to join our team. You will play a critical role in helping us deliver clinically validated, regulatory grade digital health products and solutions in preventive care using Bio-Wearables. As a senior leader in data engineering, you will help design, develop, and optimize a large-scale real-time data processing system. You will work closely with data analysts, scientists, system designers, and software developers to build scalable data services, pipelines and interfaces to support our mission. This is a hands-on technical role requiring strong problem-solving skills and deep knowledge of distributed systems.

What You’ll Work On

Design, implement, and optimize Apache Spark jobs to define, schedule, monitor, and control data processes.
Develop, test and deploy algorithms for large-scale machine learning in a production environment.
Optimize Spark jobs to maximize speed, scalability and compliance with data-use regulations.
Manage robust data pipelines: acquisition, transformation, and processing workflows.
Perform advanced data processing and exploratory analysis • Build and deploy machine learning models using Spark or MapReduce, and present results through intuitive visualizations.
Collaborate with other spark developers and back-end data engineers to design and develop interactive Spark pipelines.
Develop and maintain REST APIs to manage Spark jobs and facilitate data workflows.
Ensure seamless integration of pipelines with distributed file and database systems.

Required Qualifications

Bachelor's degree in Computer Science, Engineering, or a related field.
5+ years of experience in building and optimizing data processing pipelines.
Expertise in Apache Spark, including Spark SQL, RDD, DataFrame, Dataset APIs, and PySpark.
Proficiency in programming languages such as Python, and Java (Scala is a plus).
In-depth understanding of Spark internals and real-time streaming technology (e.g., Kafka, KSQL, etc.)
Strong experience with big data processing tools (Experience with Apache Druid is a huge plus).
Familiarity with ETL tools and expertise in managing post-loading transformations.
Advanced knowledge of distributed file systems like HDFS, S3, or Ceph.
Experience with cloud platforms such as AWS or Azure for building Apache Spark clusters.
Understanding of machine learning algorithms and their application in large-scale data environments.
Familiarity with data visualization tools (e.g., Tableau, Power BI, or others).
Strong communication and collaboration skills, with a proven ability to work cross functionally with analysts, scientists and developers.

Apply Now

* Participants who complete a short wellness assessment qualify for FREE coverage in our HIP PPO medical plan. Free coverage applies in the next calendar year.

Learn more about our health and wellness benefits, which provide the security to help you and your family live full lives:  www.abbottbenefits.com

Follow your career aspirations to Abbott for diverse opportunities with a company that can help you build your future and live your best life. Abbott is an Equal Opportunity Employer, committed to employee diversity.

Connect with us at www.abbott.com, on Facebook at www.facebook.com/Abbott and on Twitter @AbbottNews.

The base pay for this position is

$128,000.00 – $256,000.00

In specific locations, the pay range may vary from the range posted.