Principal Data Engineer
United States - Alameda : 2901 Harbor Bay Parkway, United States
Full Time Senior-level / Expert USD 128K - 256K
Abbott
Innovative medical devices and health care solutions for cardiovascular health, diabetes management, diagnostic testing, nutrition, chronic pain and more.
JOB DESCRIPTION:
Working at Abbott
At Abbott, you can do work that matters, grow, and learn, care for yourself and family, be your true self and live a full life. You’ll also have access to:
Career development with an international company where you can grow the career you dream of.
Free medical coverage for employees* via the Health Investment Plan (HIP) PPO
An excellent retirement savings plan with high employer contribution
Tuition reimbursement, the Freedom 2 Save student debt program and FreeU education benefit - an affordable and convenient path to getting a bachelor’s degree.
A company recognized as a great place to work in dozens of countries around the world and named one of the most admired companies in the world by Fortune.
A company that is recognized as one of the best big companies to work for as well as a best place to work for diversity, working mothers, female executives, and scientists.
About Lingo
Meet Lingo, a new biosensing technology that provides users a window into their body. Lingo tracks key biomarkers – such as glucose, ketones, and lactate – to help people make better decisions about their health and nutrition. Biowearable technology will digitize, decentralize and democratize healthcare, enabling consumers to take control of their own health.
The Opportunity
This position works out of our Alameda location in the Lingo Division. We are seeking a Principal Data Engineer with deep expertise in Apache Spark to join our team. You will play a critical role in helping us deliver clinically validated, regulatory grade digital health products and solutions in preventive care using Bio-Wearables. As a senior leader in data engineering, you will help design, develop, and optimize a large-scale real-time data processing system. You will work closely with data analysts, scientists, system designers, and software developers to build scalable data services, pipelines and interfaces to support our mission. This is a hands-on technical role requiring strong problem-solving skills and deep knowledge of distributed systems.
What You’ll Work On
Design, implement, and optimize Apache Spark jobs to define, schedule, monitor, and control data processes.
Develop, test and deploy algorithms for large-scale machine learning in a production environment.
Optimize Spark jobs to maximize speed, scalability and compliance with data-use regulations.
Manage robust data pipelines: acquisition, transformation, and processing workflows.
Perform advanced data processing and exploratory analysis • Build and deploy machine learning models using Spark or MapReduce, and present results through intuitive visualizations.
Collaborate with other spark developers and back-end data engineers to design and develop interactive Spark pipelines.
Develop and maintain REST APIs to manage Spark jobs and facilitate data workflows.
Ensure seamless integration of pipelines with distributed file and database systems.
Required Qualifications
Bachelor's degree in Computer Science, Engineering, or a related field.
5+ years of experience in building and optimizing data processing pipelines.
Expertise in Apache Spark, including Spark SQL, RDD, DataFrame, Dataset APIs, and PySpark.
Proficiency in programming languages such as Python, and Java (Scala is a plus).
In-depth understanding of Spark internals and real-time streaming technology (e.g., Kafka, KSQL, etc.)
Strong experience with big data processing tools (Experience with Apache Druid is a huge plus).
Familiarity with ETL tools and expertise in managing post-loading transformations.
Advanced knowledge of distributed file systems like HDFS, S3, or Ceph.
Experience with cloud platforms such as AWS or Azure for building Apache Spark clusters.
Understanding of machine learning algorithms and their application in large-scale data environments.
Familiarity with data visualization tools (e.g., Tableau, Power BI, or others).
Strong communication and collaboration skills, with a proven ability to work cross functionally with analysts, scientists and developers.
* Participants who complete a short wellness assessment qualify for FREE coverage in our HIP PPO medical plan. Free coverage applies in the next calendar year.
Learn more about our health and wellness benefits, which provide the security to help you and your family live full lives: www.abbottbenefits.com
Follow your career aspirations to Abbott for diverse opportunities with a company that can help you build your future and live your best life. Abbott is an Equal Opportunity Employer, committed to employee diversity.
Connect with us at www.abbott.com, on Facebook at www.facebook.com/Abbott and on Twitter @AbbottNews.
The base pay for this position is
$128,000.00 – $256,000.00In specific locations, the pay range may vary from the range posted.
JOB FAMILY:
Product Development
DIVISION:
LNGO Lingo
LOCATION:
United States > Alameda : 2901 Harbor Bay Parkway
ADDITIONAL LOCATIONS:
WORK SHIFT:
Standard
TRAVEL:
No
MEDICAL SURVEILLANCE:
No
SIGNIFICANT WORK ACTIVITIES:
Continuous sitting for prolonged periods (more than 2 consecutive hours in an 8 hour day), Keyboard use (greater or equal to 50% of the workday)Abbott is an Equal Opportunity Employer of Minorities/Women/Individuals with Disabilities/Protected Veterans.
EEO is the Law link - English: http://webstorage.abbott.com/common/External/EEO_English.pdf
EEO is the Law link - Espanol: http://webstorage.abbott.com/common/External/EEO_Spanish.pdf
Tags: APIs AWS Azure Big Data Computer Science Data pipelines Data visualization Distributed Systems Engineering ETL HDFS Java Kafka Machine Learning ML models Pipelines Power BI PySpark Python Scala Security Spark SQL Streaming Tableau
Perks/benefits: Career development Health care Team events Wellness
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.