Senior ML Platform Engineer
Boston, MA
ā ļø We'll shut down after Aug 1st - try fooš¦ for all jobs in tech ā ļø
WHOOP
Optimize sleep, strain, and recovery with WHOOP, the most advanced fitness and health wearable. With personalized insights, improve performance, build healthier habits, and extend healthspan with continuous health monitoring.We are seeking a highly skilled Senior ML Platform Engineer to join our ML Platform team. This role is pivotal in scaling our ML infrastructure and enabling the efficient deployment and monitoring of machine learning models across cloud environments. As a senior contributor, you will act as a force multiplierābuilding robust abstractions, platforms, and tooling to supercharge our Data Science and AI teams. You will design and implement scalable systems that operationalize machine learning at WHOOP with speed, reliability, and precision. Your work will directly support our use of AI to enhance team efficiency, automate decision-making, and personalize member experiences.
RESPONSIBILITIES:
- Architect, build, and maintain scalable ML infrastructure in cloud environments (e.g., AWS), optimizing for speed, observability, cost, and reproducibility.
- Develop and evolve MLOps platforms and frameworks that standardize model deployment, versioning, drift detection, and lifecycle management at scale.
- Implement end-to-end CI/CD pipelines for ML models using orchestration tools (e.g., Prefect, Airflow, Argo Workflows), ensuring robust testing, reproducibility, and traceability.
- Partner closely with Data Science, Sensor Intelligence, Data Platform teams to streamline model development, deployment, and monitoring workflows.
- Build and maintain real-time and batch inference infrastructure, supporting diverse use cases from physiological analytics to personalized feedback loops for WHOOP members.
- Design and collaborate on automated observability tooling (e.g., for model latency, data drift, accuracy degradation), integrating metrics, logging, and alerting with existing platforms.
- Leverage AI-powered tools and automation to reduce operational overhead, enhance developer productivity, and accelerate model release cycles.
- Contribute to internal platform documentation, SDKs, and training materials, enabling self-service capabilities for model deployment and experimentation.
- Continuously evaluate emerging technologies and deployment strategies, influencing WHOOPās roadmap for AI-driven platform efficiency and scale.
QUALIFICATIONS:
- Bachelorās or Masterās Degree in Computer Science, Engineering, or a related field; or equivalent practical experience.
- 5+ years of experience in software engineering with a focus on ML infrastructure, cloud platforms, or MLOps.
- Strong programming skills in Python, with experience in building distributed systems and REST/gRPC APIs.
- Deep knowledge of cloud-native services and infrastructure-as-code (e.g., AWS CDK, Terraform, CloudFormation).
- Hands-on experience with model deployment platforms such as AWS SageMaker, Vertex AI, or Kubernetes-based serving stacks.
- Proficiency in ML lifecycle tools (MLflow, Weights & Biases, BentoML) and containerization strategies (Docker, Kubernetes).
- Understanding of data engineering and ingestion pipelines, with ability to interface with data lakes, feature stores, and streaming systems.
- Proven ability to work cross-functionally with Data Science, Data Platform, and Software Engineering teams, influencing decisions and driving alignment.
- Passion for AI and automation to solve real-world problems and improve operational workflows.
Interested in the role, but donāt meet every qualification? We encourage you to still apply! At WHOOP, we believe there is much more to a candidate than what is written on paper, and we value character as much as experience. As we continue to build a diverse and inclusive environment, we encourage anyone who is interested in this role to apply.
WHOOP is an Equal Opportunity Employer and participates inĀ E-verifyĀ to determine employment eligibility.Ā It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index š°
Tags: Airflow APIs AWS BentoML CI/CD CloudFormation Computer Science Distributed Systems Docker Engineering Kubernetes Machine Learning MLFlow ML infrastructure ML models MLOps Model deployment Pipelines Python SageMaker Streaming Terraform Testing Vertex AI Weights & Biases
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.