Data Engineer - ML Platforms

Irving-750 West John Carpenter

CVS Health

America's leading health solutions company, CVS Health® provides advanced health care from pharmacy services and health plans to health and wellness.

View company page

Bring your heart to CVS Health. Every one of us at CVS Health shares a single, clear purpose: Bringing our heart to every moment of your health. This purpose guides our commitment to deliver enhanced human-centric health care for a rapidly changing world. Anchored in our brand — with heart at its center — our purpose sends a personal message that how we deliver our services is just as important as what we deliver.
 
Our Heart At Work Behaviors™ support this purpose. We want everyone who works at CVS Health to feel empowered by the role they play in transforming our culture and accelerating our ability to innovate and deliver solutions to make health care more personal, convenient and affordable.

Position Summary

  • Analyzes complex Data structure from various data sources and design large scale data engineering pipeline.
  • Implement data ingestion pipeline using APIs, third party tools, or create custom codes to ingest high volume data into Cloud environment.
  • Collaborate with cross functional team to understand business requirements and translate them into technical specifications.
  • Develops large scale data structures and pipelines to organize, collect and standardize data that helps generate insights and addresses reporting needs.
  • Implements data quality checks and validation processes to ensure the accuracy, completeness, and consistency of the data.
  • Documents data engineering processes, workflows, and systems for reference and knowledge-sharing purposes.
  • Uses strong programming skills in SQL, pyspark, Python, Java or any of the major languages to build robust data pipelines and dynamic systems
  • Be a team player and work with team members for Business solution and implementation.
  • Ideal Candidate needs to continuously learn and adopt business subject matter expertise and emerging algorithms and techniques. 
  • Analyze and optimize pipelines to reduce cost while maintaining accuracy.
  • Create workflow efficiencies, raise the bar for coding, work with distributed computing and model training
  • Ideal candidate will design and implement advanced ML algorithms and models while ensuring smooth operation and maintenance of existing applications.


Required Qualifications

  • 3+ years of building cloud native analytical products in GCP, Azure or AWS
  • Sound knowledge in any of cloud Technology is must preferably Google cloud Platform (GCP)
  • 3+ years hands-on experience working in scalable distributed computation frameworks like Spark.
  • 3+ years of Data & Analytics related software development experience in designing & developing ML pipelines, metadata frameworks, reusable components and/or platforms.
  • 3+ years hands-on experience working in Dev/ML Ops model, familiarity with industry deployment best practices using CI/CD
  • Knowledge in programing languages such as SQL , Python, Pyspark or Java/Scala
  • Experience with bash shell scripts, UNIX utilities & UNIX Commands
  • Strong problem-solving skills and critical thinking ability
  • Strong collaboration and communication skills within and across teams


Preferred Qualifications

  • Experience with Healthcare domain is highly desirable.
  • Hands-on ML-centric programming experience with either Python(preferred), Java or R
  • Hands-on experience in developing AI solutions leveraging Python libraries such as PyTorch, Tensorflow, Scikit-Learn, XGBoost
  • Expertise working with ML platforms & toolsets such as Vertex AI is preferred.
  • Exposure in implementing Unsupervised Model, Gen AI and/or NLP based solutions using LLMs.
  • Strong interpersonal and communication skills, including the ability to explain and discuss machine learning concepts with cross functional teams.
  • Exposure to Agile Methodology
  • GCP Certifications: Associate Cloud Engineer/Professional Data Engineer.


Education

  • Bachelor's degree or equivalent work experience in Mathematics, Statistics, Computer Science, Business Analytics, Data Science, Engineering, or related discipline
  • Master’s degree Preferred in Computer Science/ML

Pay Range

The typical pay range for this role is:

$72,100.00 - $144,200.00

This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls.  The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors.  This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above. 
 
In addition to your compensation, enjoy the rewards of an organization that puts our heart into caring for our colleagues and our communities.  The Company offers a full range of medical, dental, and vision benefits.  Eligible employees may enroll in the Company’s 401(k) retirement savings plan, and an Employee Stock Purchase Plan is also available for eligible employees.  The Company provides a fully-paid term life insurance plan to eligible employees, and short-term and long term disability benefits. CVS Health also offers numerous well-being programs, education assistance, free development courses, a CVS store discount, and discount programs with participating partners.  As for time off, Company employees enjoy Paid Time Off (“PTO”) or vacation pay, as well as paid holidays throughout the calendar year. Number of paid holidays, sick time and other time off are provided consistent with relevant state law and Company policies.  
 
For more detailed information on available benefits, please visit jobs.CVSHealth.com/benefits

We anticipate the application window for this opening will close on: 07/29/2024

Apply now Apply later
  • Share this job via
  • or

Tags: Agile APIs AWS Azure Business Analytics CI/CD Computer Science Data pipelines Data quality Engineering GCP Generative AI Google Cloud Java LLMs Machine Learning Mathematics Model training NLP Pipelines PySpark Python PyTorch R Scala Scikit-learn Spark SQL Statistics TensorFlow Vertex AI XGBoost

Perks/benefits: Career development Gear Health care Insurance Salary bonus

Region: North America
Country: United States
Job stats:  11  0  0

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.