ML Ops Infrastructure Engineer
Tasks
- Architect model deployment pipelines from staging to production
- Build A B testing infrastructure for model rollouts
- Build observability dashboards for model health
- Create build and test environments that mirror production
- Define model quality gates with research engineers
- Design CI CD pipelines for ML model development validation deployment
- Develop automated retraining pipelines for data changes and performance degradation
- Establish model versioning artifact management and rollback
- Implement production monitoring for model performance latency drift regression
- Optimize model serving infrastructure for latency throughput and cost
Perks/Benefits
- 401k match
- Conference and talks participation
- Flexible schedule
- Home office stipend
- Learning & education stipend
- Mental health support
- Paid US company holidays
- Paid parental leave
- Unlimited PTO
- Wellness stipend
Skills/Tech-stack
A/B | A/B Testing | Automated retraining | B testing | CI/CD | Canary Deployment | Data Versioning | Datadog | Docker | Drift Detection | Feature Stores | Feature evaluation | Grafana | Inference Server | Infrastructure as Code | Kubernetes | ML metadata | ML metadata management | Metadata Management | Model Deployment | Model Serving | Model Validation | Monitoring | NVIDIA Triton | NVIDIA Triton Inference | NVIDIA Triton Inference Server | ONNX Runtime | Observability | Progressive Delivery | Prometheus | Pulumi | Python | Quality Assurance | Regression Alerts | TensorRT | Terraform | Triton Inference Server | “as-code”
Education
N/A
Related jobs
-
Amazon QuickSight | Business Continuity | Business Intelligence | Data Backup | Data GovernanceAdditional paid time off | Early stage company equity potential | Health insurance | Indefinite term contract | Learning budgetSenior-level Full TimeTrabajo a distancia R11h ago
-
Staff Data Engineer BRL 325K-443KAWS Glue | AWS Glue Catalog | Amazon MWAA | Amazon S3 | Apache AirflowRemote workSenior-level Full TimeSão Paulo, SP, Brazil R11h ago
-
Data Engineer PHP 420K-480KAWS | Amazon Redshift | Apache Airflow | Apache Spark | BigQueryEmployee discount | Health insurance | Mentorship | Sports voucher | Training opportunitiesMid-level Full TimeAlabang, Philippines (Hybrid) R14h ago
-
Data Engineers EUR 30K-36KAmazon S3 | Apache Airflow | Apache Spark | CI/CD | Data QualityCompany training | Conference attendance | Flexible compensation benefits | Flexible working hours | Indefinite contractMid-level Full TimeMadrid, MD, Spain R15h ago
-
Sr. Data and AI Engineer USD 180K-200KAgile | Amazon Web Services | Azure | Big Data | Data ArchitecturePublic trust clearance support | Remote workSenior-level Full TimeWork from home, VA, United States R15h ago
-
Senior Data Engineer (Perm, Ireland, Hybrid ) EUR 46K-79KAmazon Redshift | Amazon Web Services | Apache Spark | Automation | CI/CDFlexible working hours | Income protection | Paid time off | Pension match | Private healthcareSenior-level Full TimePermanent R15h ago
-
Senior Data Engineer (Perm, Italy, Hybrid) EUR 51K-79KAgile | Amazon Redshift | Amazon Web Services | Apache Spark | AutomationDeath in service cover | Health insurance | Paid time off | Pension match | Remote working allowanceSenior-level Full TimePermanent R16h ago
-
Senior Data Engineer (Perm, UK, Hybrid) EUR 44K-79KAWS | Amazon Redshift | Apache Hadoop | Apache Spark | AutomationDeath in service coverage | Income protection | Marriage leave | Paid time off | Pension matchSenior-level Full TimePermanent R16h ago
-
Machine Learning Engineer (W/M/D) EUR 48K-60KAWS SageMaker | Artificial Intelligence | Azure ML | CI/CD | DVCAdditional paid days off | Annual learning budget | Family care policy | Flexible work schedule | Free therapy or coaching sessionsMid-level Full TimeParis, IDF, France R17h ago
-
API | AWS | Automated testing | Azure | BigQueryConferences | Flexible time off | Health insurance | Mentorship | Referral bonusesSenior-level Full TimeSpain R17h ago
-
AWS | Anthropic Claude | Azure | Bedrock | CI/CDAnnual leave | Employee referral program | HMO coverage | Work from homeSenior-level Full TimePasig Central Post Office, Philippines R17h ago
-
Mid Fullstack Software Engineer USD 75K-153KFastAPI | Git | Next.js | PostgreSQL | PythonInternational team collaboration | Remote workMid-level Full TimeRemote R18h ago
-
Sr. Applied AI Engineer EUR 58K-81KAgent Orchestration | Agentic loops | Asynchronous services | Benchmarking | CachingEmployee resource groups | Flexible work environment | Hybrid work model | Remote work optionSenior-level Full TimeLisbon, Portugal R18h ago
-
API | Artificial Intelligence | CI/CD | Concurrency | Design PatternsCareer growth | International projects | Language courses | Medical care | Multisport cardSenior-level Full TimeWarsaw R18h ago
-
Senior / Staff Engineer, Data Platform CAD 103K-140KAirflow | Apache Flink | Apache Kafka | Apache Spark | Batch ProcessingCompany-wide events | Game nights | Hybrid work | Team lunchSenior-level Full TimeMontreal R18h ago
-
App Development | CRM Integrations | Dashboard Development | Data Pipelines | Document processingCareer growth | Modern workplace | Remote workMid-level Full TimeRemote R18h ago
-
Data Engineer - H/F EUR 50K-58KAWS Glue | Ansible | Azure Data | Azure Data Factory | Azure DevOpsEmployee stock ownership plan | Health insurance | Maternity return support | Paid time off bonus | TeleworkMid-level Full TimeRennes, Brittany, France R19h ago
-
Data Engineer Spark / Scala - H/F EUR 50K-58KAWS Glue | Ansible | Azure Data | Azure Data Factory | Azure DevOpsEmployee share ownership | Health insurance coverage | Maternity leave return with reduced schedule without salary loss | Paid vacation bonus | Training programsMid-level Full TimeNantes, Pays de la Loire, France R19h ago
-
Data Analyst GBP 26K-26KBigQuery | CI/CD | DBT | Data Modeling | Data PipelinesCompany events | Dental insurance allowance | Flexible working | Four day week option | Gym membership allowanceMid-level Full TimeLondon, England, United Kingdom R21h ago
-
Senior Machine Learning Engineer GBP 74K-100KA/B | A/B Testing | Apache Spark | B testing | ClusteringDental insurance allowance | Flexible working | Gym membership allowance | Holiday and bank holidays | Learning and development budgetSenior-level Full TimeLondon, England, United Kingdom R21h ago
-
AI Engineer III (Remote) EUR 48K-62KCI/CD | Distributed Systems | Evaluation | Information Retrieval | JavaAgile Nomads Experience | Buddy onboarding | Coach support | Company welfare programs | Conference attendanceSenior-level Full TimeRende, Italy R23h ago
-
Evaluation Frameworks | Evaluation Pipelines | Function Calling | Java | LLM tool-useEquity packages | Flexible leave options | Inclusive parental leave | Virtual interviews | Wellbeing allowanceSenior-level Full TimeBrisbane, QLD, Australia R1d ago
-
Evaluation Pipelines | Function Calling | Governance | Java | LLM tool-useEquity packages | Flexible leave options | Flexible remote work within Australia | Inclusive parental leave | Wellbeing allowanceSenior-level Full TimeMelbourne, VIC, Australia R1d ago
-
Function Calling | Java | LLM tool-use | Langchain | LanggraphEquity packages | Flexible leave options | Inclusive parental leave | Virtual interviews | Wellbeing allowanceSenior-level Full TimeSydney, Australia R1d ago
-
Ingénieur / ingénieure Data – IA Générative EUR 60K-70KAWS | AWS Bedrock | Azure DevOps | CI/CD | Copilot StudioCareer development support | Diversity inclusion agreements | Employee share participation | Health insurance | Paid time offSenior-level Full TimeÉchirolles, Auvergne-Rhône-Alpes, France R1d ago