Machine Learning Engineer - Data Quality
Bengaluru, Karnataka, India
Apple
We’re a diverse collective of thinkers and doers, continually reimagining what’s possible to help us all do what we love in new ways.Summary
Posted: Mar 11, 2025Role Number:200594546
Join Apple’s Data Management Team and Shape the Future of Reliable Data in the Manufacturing Space! At Apple, we believe that great decisions start with great data. Our Data Quality team within Operations is looking for a Machine Learning Engineer with Software Engineering background to help ensure the integrity, accuracy, and reliability of mission-critical data. In this role, you will design and build scalable ML-driven solutions for data validation, anomaly detection, and quality monitoring—helping Apple’s global operations run with precision and confidence. If you're passionate about applying cutting-edge machine learning to solve real-world data challenges at scale, come join us in pushing the boundaries of what’s possible.
Description
At Apple, we rely on high-quality data to drive critical decisions across our global operations. We are looking for a Machine Learning Engineer with software engineering skills to develop and deploy ML-driven data validation solutions that ensure data integrity. In this role, you will build scalable anomaly detection systems, work on Gen AI projects, collaborate with data engineering teams to enhance data quality frameworks, and drive innovations in MLOps and data monitoring. - Develop ML-based data validation and monitoring solutions, focusing on anomaly detection and explainability. - Analyze large datasets to detect data drift, integrity issues, and emerging quality risks. - Apply the full ML lifecycle, from exploratory data analysis (EDA) and feature engineering to model selection, training, deployment, and monitoring. - Experiment with different methodologies to improve model accuracy and reliability. - Collaborate with data engineering teams to design frameworks for detecting missing, inconsistent, or duplicate data. - Investigate root causes of data quality issues and propose scalable, automated solutions. - Stay up to date with the latest advancements in data science, MLOps, and data engineering best practices.
Minimum Qualifications
- 3+ industry experience in building ML solutions and collaborating with software teams.
- Strong experience in machine learning for anomaly detection, data validation, or data quality improvement.
- Proficiency in Python (Pandas, NumPy, Scikit-learn, PyTorch/TensorFlow, etc.).
- Hands-on experience with SQL and databases (PostgreSQL, Snowflake, MySQL, etc.).
- Strong knowledge of statistical methods (PCA, exponential smoothing, and etc.) for detecting anomalies, drift, and inconsistencies.
- Experience with version control (Git) and software development best practices.
Preferred Qualifications
- Experience with MLOps tools (MLflow, Kubeflow) for managing data quality models.
- Exposure to big data frameworks (Spark, Kafka) for real-time data validation.
- Familiarity with CI/CD for data pipelines and model deployments.
- Strong problem-solving skills and ability to diagnose complex data issues.
- Experience working with large-scale structured and unstructured data.
- Familiarity with data engineering concepts, including ETL pipelines, batch/stream processing.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Big Data CI/CD Data analysis Data management Data pipelines Data quality EDA Engineering ETL Feature engineering Generative AI Git Kafka Kubeflow Machine Learning MLFlow MLOps MySQL NumPy Pandas Pipelines PostgreSQL Python PyTorch Scikit-learn Snowflake Spark SQL Statistics TensorFlow Unstructured data
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.