Machine Learning Engineer - Data Quality

Bengaluru, Karnataka, India

Apple

We’re a diverse collective of thinkers and doers, continually reimagining what’s possible to help us all do what we love in new ways.

View all jobs at Apple

Apply now Apply later

Summary

Posted: Mar 11, 2025

Role Number:200594546

Join Apple’s Data Management Team and Shape the Future of Reliable Data in the Manufacturing Space! At Apple, we believe that great decisions start with great data. Our Data Quality team within Operations is looking for a Machine Learning Engineer with Software Engineering background to help ensure the integrity, accuracy, and reliability of mission-critical data. In this role, you will design and build scalable ML-driven solutions for data validation, anomaly detection, and quality monitoring—helping Apple’s global operations run with precision and confidence. If you're passionate about applying cutting-edge machine learning to solve real-world data challenges at scale, come join us in pushing the boundaries of what’s possible.

Description


At Apple, we rely on high-quality data to drive critical decisions across our global operations. We are looking for a Machine Learning Engineer with software engineering skills to develop and deploy ML-driven data validation solutions that ensure data integrity. In this role, you will build scalable anomaly detection systems, work on Gen AI projects, collaborate with data engineering teams to enhance data quality frameworks, and drive innovations in MLOps and data monitoring. - Develop ML-based data validation and monitoring solutions, focusing on anomaly detection and explainability. - Analyze large datasets to detect data drift, integrity issues, and emerging quality risks. - Apply the full ML lifecycle, from exploratory data analysis (EDA) and feature engineering to model selection, training, deployment, and monitoring. - Experiment with different methodologies to improve model accuracy and reliability. - Collaborate with data engineering teams to design frameworks for detecting missing, inconsistent, or duplicate data. - Investigate root causes of data quality issues and propose scalable, automated solutions. - Stay up to date with the latest advancements in data science, MLOps, and data engineering best practices.

Minimum Qualifications


  • 3+ industry experience in building ML solutions and collaborating with software teams.
  • Strong experience in machine learning for anomaly detection, data validation, or data quality improvement.
  • Proficiency in Python (Pandas, NumPy, Scikit-learn, PyTorch/TensorFlow, etc.).
  • Hands-on experience with SQL and databases (PostgreSQL, Snowflake, MySQL, etc.).
  • Strong knowledge of statistical methods (PCA, exponential smoothing, and etc.) for detecting anomalies, drift, and inconsistencies.
  • Experience with version control (Git) and software development best practices.


Preferred Qualifications


  • Experience with MLOps tools (MLflow, Kubeflow) for managing data quality models.
  • Exposure to big data frameworks (Spark, Kafka) for real-time data validation.
  • Familiarity with CI/CD for data pipelines and model deployments.
  • Strong problem-solving skills and ability to diagnose complex data issues.
  • Experience working with large-scale structured and unstructured data.
  • Familiarity with data engineering concepts, including ETL pipelines, batch/stream processing.



Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  1  1  0

Tags: Big Data CI/CD Data analysis Data management Data pipelines Data quality EDA Engineering ETL Feature engineering Generative AI Git Kafka Kubeflow Machine Learning MLFlow MLOps MySQL NumPy Pandas Pipelines PostgreSQL Python PyTorch Scikit-learn Snowflake Spark SQL Statistics TensorFlow Unstructured data

Region: Asia/Pacific
Country: India

More jobs like this