ML Ops Infrastructure Engineer
Tasks
- Architect model deployment pipelines from staging to production
- Build A B testing infrastructure for model rollouts
- Build observability dashboards for model health
- Create build and test environments that mirror production
- Define model quality gates with research engineers
- Design CI CD pipelines for ML model development validation deployment
- Develop automated retraining pipelines for data changes and performance degradation
- Establish model versioning artifact management and rollback
- Implement production monitoring for model performance latency drift regression
- Optimize model serving infrastructure for latency throughput and cost
Perks/Benefits
- 401k match
- Conference and talks participation
- Flexible schedule
- Home office stipend
- Learning & education stipend
- Mental health support
- Paid US company holidays
- Paid parental leave
- Unlimited PTO
- Wellness stipend
Skills/Tech-stack
A/B | A/B Testing | Automated retraining | B testing | CI/CD | Canary Deployment | Data Versioning | Datadog | Docker | Drift Detection | Feature Stores | Feature evaluation | Grafana | Inference Server | Infrastructure as Code | Kubernetes | ML metadata | ML metadata management | Metadata Management | Model Deployment | Model Serving | Model Validation | Monitoring | NVIDIA Triton | NVIDIA Triton Inference | NVIDIA Triton Inference Server | ONNX Runtime | Observability | Progressive Delivery | Prometheus | Pulumi | Python | Quality Assurance | Regression Alerts | TensorRT | Terraform | Triton Inference Server | “as-code”
Education
N/A
Related jobs
-
Featured Feat. AI Engineer (MTS) USD 160K-300KAPI Development | AWS | Amazon Web Services | Deep learning | FastAPIMentoring | Open source contributions | Remote workMid-levelRemote R19d ago
-
Featured Feat. Data Engineer USD 80K-150KData Monitoring | Data Quality | Data Validation | ELT | ETLRemote workEntry-levelRemote R19d ago
-
AI frameworks | AI-assisted coding | AWS Bedrock | Agentic AI | Agentic AI FrameworksMid-level Full TimeGuatemala, Guatemala (Remote) R2h ago
-
Machine Learning Engineer USD 128K-214KAWS | Agile | Azure | Cloud platform | GitHealth insurance | Holiday pay | Learning and development | Life insurance | Long-term disabilityMid-level Full TimeUSA-Remote Work R7h ago
-
Head of AI - JT AI Labs (M/W/D) EUR 95K-110KData Annotation | Data Quality | Data Security | Deep learning | Language ModelsFamily care policy | Flexible work hours | Health insurance | Home office equipment support | Lunch vouchersExecutive-level Full TimeParis, IDF, France R9h ago
-
Machine Learning Engineer (W/M/D) EUR 50K-65KAWS SageMaker | Artificial Intelligence | Azure Machine Learning | CI/CD | DVCAdditional paid days off | Flexible work environment | Free yoga lessons | Health insurance | Home office equipment supportMid-level Full TimeParis, IDF, France R10h ago
-
Junior Data Engineer SEK 348K-384KAWS | CI/CD | Data Governance | Data Modeling | DatabricksHack days | Hybrid work model | Insurance | Learning culture | Paid vacationEntry-level Full TimeStockholm, Sweden R12h ago
-
AI Research Engineer GBP 110K-200KC# | CUDA | Deep learning | Machine Learning | PyTorchHybrid Remote | Remote Interview AccommodationMid-level Full TimeHybrid (UK) R13h ago
-
CSS | Data Modeling | ETL | HTML | JavaScriptSenior-level Contract TemporaryRemote or Hybrid (Finchley, North London … R13h ago
-
AWS | Apache Airflow | Azure | CI/CD | Data EngineeringCareer growth opportunities | Continuous learning | Flexible working hours | Fully remote | Home office setup supportSenior-level Full TimeBrazil R14h ago
-
Cloud Computing | Data Pipelines | Debugging | Deployment | ETLCareer growth opportunities | Continuous learning culture | Coworking access | Flexible schedule | Fully remoteMid-level Full TimeNetherlands R15h ago
-
Cloud Computing | Data Pipelines | ETL | Google Colab | Information RetrievalCareer growth opportunities | Continuous learning culture | Coworking access | Flexible schedule | Fully remote workMid-level Full TimeIreland R15h ago
-
Cloud Computing | ETL | Google Colab | Information Retrieval | Jupyter NotebooksCareer growth opportunities | Coworking access | Employee benefits | Flexible schedule | Fully remote workMid-level Full TimeSwitzerland R15h ago
-
Cloud Computing | Data pipeline | Debugging | ETL | Google ColabCareer growth | Continuous learning | Flexible work hours | Fully remote | International collaborationMid-level Full TimeFrance R15h ago
-
Cloud Computing | Data Pipelines | Debugging | ETL | Google ColabCareer growth opportunities | Flexible work schedule | Fully remote | Inclusive culture | Optional coworking accessMid-level Full TimeSpain R15h ago
-
Cloud Computing | Data Pipelines | ETL | Google Colab | Information RetrievalCareer growth | Continuous learning culture | Coworking access | Flexible schedule | Fully remote workMid-level Full TimeBrazil R15h ago
-
Cloud infrastructure | Data Pipelines | Debugging | ETL | Google ColabCareer growth opportunities | Continuous learning opportunities | Coworking access | Flexible work hours | Fully remoteMid-level Full TimeGermany R15h ago
-
1094- AI Platform Engineer (Generative AI) USD 158K-168KAI Agents | AWS | CI/CD | DevOps | Generative AIRemote workSenior-level Full TimeRemote R18h ago
-
AWS | AWS IAM | AWS Lambda | Airbyte | Amazon EC2Certification allowance | Flexible hours | Hybrid remote work | Paid leave | Performance-based rewardsSenior-level Full TimeVietnam - Remote R19h ago
-
AI Research Engineer - Applied AI INR 2000K-3000KAPI Design | AWS SageMaker | Anomaly Detection | Azure Machine Learning | Bias auditingAsynchronous culture | Distributed team | Remote workMid-level Full TimeRemote - REMOTE, India, India R19h ago
-
Computer Vision | Deep learning | Docker | GPU Computing | Inference ServerActive lifestyle reimbursement | Continuous learning bonus | Health care reimbursement | Home office reimbursement | Paid time offMid-level Full TimeMexico - Remote R19h ago
-
API Development | API Gateway | AWS Lambda | Amazon API | Amazon API GatewayHybrid work | Remote work days per weekSenior-level Full TimeEspoo, Finland R19h ago
-
Entry-level Full TimeUnited States - Remote R19h ago
-
CI/CD | Docker | Drift Detection | Embeddings | Experiment trackingMentorship | Remote workSenior-level Full TimeUnited States - Remote R19h ago
-
Lead AI Engineer GBP 72K-120KAPI | AWS | Amazon Bedrock | CI/CD | DockerAnnual leave | Bank holidays | Employee assistance program | Flexible benefits | Hybrid work modelSenior-level Full TimeLondon, United Kingdom R19h ago