ML Ops Infrastructure Engineer
Tasks
- Architect model deployment pipelines from staging to production
- Build A B testing infrastructure for model rollouts
- Build observability dashboards for model health
- Create build and test environments that mirror production
- Define model quality gates with research engineers
- Design CI CD pipelines for ML model development validation deployment
- Develop automated retraining pipelines for data changes and performance degradation
- Establish model versioning artifact management and rollback
- Implement production monitoring for model performance latency drift regression
- Optimize model serving infrastructure for latency throughput and cost
Perks/Benefits
- 401k match
- Conference and talks participation
- Flexible schedule
- Home office stipend
- Learning & education stipend
- Mental health support
- Paid US company holidays
- Paid parental leave
- Unlimited PTO
- Wellness stipend
Skills/Tech-stack
A/B | A/B Testing | Automated retraining | B testing | CI/CD | Canary Deployment | Data Versioning | Datadog | Docker | Drift Detection | Feature Stores | Feature evaluation | Grafana | Inference Server | Infrastructure as Code | Kubernetes | ML metadata | ML metadata management | Metadata Management | Model Deployment | Model Serving | Model Validation | Monitoring | NVIDIA Triton | NVIDIA Triton Inference | NVIDIA Triton Inference Server | ONNX Runtime | Observability | Progressive Delivery | Prometheus | Pulumi | Python | Quality Assurance | Regression Alerts | TensorRT | Terraform | Triton Inference Server | “as-code”
Education
N/A
Related jobs
-
Integrador de Dados SR BRL 118K-119KAWS CloudWatch | AWS IAM | AWS Lambda | Amazon EC2 | Amazon RDSCloud optimization focus | Remote workSenior-level Full TimeRemote R3h ago
-
Associate Engineer, Data (FS) USD 119K-165KAzure Blob | Azure Blob Storage | Azure Data | Azure Data Factory | Azure SQLLong-term work from home | Night shift schedule | Remote workMid-level Full TimeRemote R4h ago
-
Ingénieur / ingénieure Data – IA Générative EUR 60K-70KAWS | AWS Bedrock | Azure | CI/CD | Copilot StudioHealth insurance | Paid time off | Telework | Training opportunitiesSenior-level Full TimeÉchirolles, Auvergne-Rhône-Alpes, France R7h ago
-
Apache Hadoop | Apache Hive | Apache Kafka | Apache Spark | Azure HDInsightCareer development opportunities | Cooptation bonus | Employee representative council | Health insurance | Meal vouchersSenior-level Full TimeNantes, Pays de la Loire, France R8h ago
-
Agentic AI | Azure Repos | CI/CD | Cron | DeepEvalHybrid workSenior-level Contract Full TimeGlasgow, Scotland, United Kingdom R8h ago
-
Senior Data Engineer - Paris - H/F EUR 45K-55KAirflow | Batch data | CI/CD | ClickHouse | Data CatalogMeal allowance | Profit sharing | Remote work | Restaurant voucherSenior-level Full TimeParis, IDF, France R9h ago
-
Applied AI Software Engineer GBP 75K-90KAgile | Azure | Continuous Delivery | Data Observability | Data PrivacyAnnual leave | Critical illness protection | Cycle to work | Dental coverage | Electric vehicle schemeSenior-level Full TimeGlasgow, Lanarkshire, United Kingdom R10h ago
-
ADLS Gen2 | Apache Spark | Auto Loader | Azure | Azure DataSenior-level Full TimeAlphaville - Barueri, BR, 06.454-000 R15h ago
-
Data Engineer PHP 1200K-1440KAmazon Redshift | Amazon Web Services | Apache Airflow | Apache Hive | Apache SparkAnnual leave | Bereavement leave | Birthday leave | Flexible work arrangement | Hybrid work arrangementSenior-level Full TimeTaguig, Metro Manila, Philippines R17h ago
-
AWS CloudFormation | Airflow | Amazon Kinesis | Amazon Redshift | BigQueryFlexible schedule | Remote workMid-level Full TimePakistan - Remote R17h ago
-
Statistics & Python Expert - Freelance AI Trainer USD 146K-146KCombinatorics | Graph theory | MATLAB | NumPy | Number theoryFlexible hours | Freelance opportunities | Project based workSenior-level FreelanceNew York, New York, United States … R17h ago
-
C# | MATLAB | NumPy | Pandas | PythonFlexible schedule | Project based workSenior-level FreelanceUnited Kingdom - Remote R17h ago
-
Computer Science | MATLAB | NumPy | Pandas | PythonFlexible part-time hours | Project based workSenior-level FreelanceArgentina - Remote R17h ago
-
Statistics & Python Expert - Freelance AI Trainer USD 146K-146KCombinatorics | Graph theory | Mathematics | NumPy | Number theoryFreelance opportunity | Part-time project-based workSenior-level FreelanceFlorida, United States - Remote R17h ago
-
C# | MATLAB | NumPy | Pandas | PythonProject based workSenior-level FreelanceSpain - Remote R17h ago
-
Statistics & Python Expert - Freelance AI Trainer USD 100K-100KMATLAB | NumPy | Pandas | Python | RPart time freelance assignments | Project based workSenior-level FreelanceFrance - Remote R17h ago
-
Statistics & Python Expert - Freelance AI Trainer USD 146K-146KC# | Combinatorics | Graph theory | MATLAB | NumPyFlexible hours | Part-time opportunities | Project based workSenior-level FreelanceTexas, United States - Remote R17h ago
-
Statistics & Python Expert - Freelance AI Trainer USD 146K-146KC# | MATLAB | NumPy | Pandas | PythonPart-time availability | Project based workSenior-level FreelanceMichigan, United States - Remote R17h ago
-
Senior-level FreelancePortugal - Remote R17h ago
-
AI | API | Airtable | Automation | Google SheetsFlexible work hours | Remote workMid-level Full TimePakistan - Remote R17h ago
-
API Integration | Agent systems | Asynchronous processing | Chunking | Cost OptimizationCompetitive salary based on experience | High-impact role | Opportunity to scale AI systems | Strong ownershipMid-level Full TimeAustin, Texas, United States - Remote R17h ago
-
AI Full Stack Engineer - KS001 USD 160K-225KAds API | Agent systems | Anthropic Claude | Cost monitoring | EmbeddingsHigh-impact role | Strong ownershipMid-level Full TimeAustin, Texas, United States - Remote R17h ago
-
Computer Scientist - Fullstack INR 1800K-3500KAsynchronous programming | Bazel | CI/CD | Distributed Systems | ExpressMid-level Full TimeBangalore, India R17h ago
-
Technical Architect - Machine Learning USD 165K-200KAKS | AWS | Async Programming | Autogen | AzureSenior-level Full TimeUSA - Remote, United States R17h ago
-
Mid-level Full TimeRemote R18h ago