MLOps Engineer
Tasks
- Build autoscaling and capacity management
- Collaborate on new model release support
- Design model serving platforms
- Develop deployment workflows with canary and rollback
- Document operational procedures and tuning guidance
- Drive observability for latency and errors
- Implement caching and response reuse
- Implement request routing and multiplexing
- Implement security controls at the serving layer
- Integrate with API gateways and identity systems
- Operate incident response for reliability
- Optimize inference performance
- Tune GPU utilization and KV cache
Perks/Benefits
Skills/Tech-stack
APIs | Abuse detection | Autoscaling | C++ | Caching | Canary Releases | Capacity Planning | Content Filtering | Distributed Systems | GPU memory | GPU memory management | Go | Incident Response | KV cache | Kubernetes | LLM Inference | Memory Management | Metrics | Observability | Performance Engineering | Python | Quality of Service | Rate Limiting | Request Signing | Rust | Shadow testing | Structured Logging | TensorRT-LLM | Tracing | VLLM
Education
Roles
Related jobs
-
Research Engineer, SysML - FAIR USD 141K-208KAgent Orchestration | Bias Mitigation | C# | C++ | CUBLASMid-level Full TimeMenlo Park, CA | Remote, US R17h ago
-
AI Foundry | AWS Bedrock | Agent Frameworks | Agent Orchestration | AnthropicMid-level Full TimeHouston, TX, United States R23h ago
-
Embedded Semiconductor Engineer USD 120K-150KAutomation | BSP | C# | C++ | CloudCareer growth | Mentorship | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Senior AI Systems Engineer USD 130K-195KAlerting | Bash | CI/CD | CMMC | Configuration ManagementHybrid work option | Remote work optionSenior-level Full TimeRaleigh, North Carolina, United States; Albuquerque, … R1d ago
-
Senior AI Technologist USD 145K-208KAI Agents | AI Foundry | AWS Bedrock | Apache Airflow | Apache SparkCross-functional collaboration | Fully remote or hybrid or onsite optionSenior-level Full TimeRaleigh, North Carolina, United States; Albuquerque, … R1d ago
-
Data Engineer USD 120K-150KAzure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake Storage | CI/CDSenior-level Full TimeRemote, United States R1d ago
-
API | AWS Glue | Amazon Redshift | Amazon Web Services | Apache AirflowFully remote | Mentorship opportunities | On-call supportSenior-level Full TimeOrlando, FL, United States R1d ago
-
Senior Data Engineer – Hardware & Supply Chain USD 147K-203KAWS | Apache Iceberg | DBT | Data Governance | Data ModelingFlexible wellness time off | Paid Holidays | Paid sick leave | Paid time off | Parental leaveSenior-level Full TimeHybrid - San Francisco, California R1d ago
-
Principal Database Engineer USD 102K-203KASH | AWR | AWS IAM | AWS KMS | AWS RDSFlexible hybrid work model | Health and life insurance | Paid time off | Pension/retirement benefits | Personal/Family Care leavesSenior-level Full TimeTampa, FL, United States R1d ago
-
Computational Scientist USD 90K-105KC++ | Documentation | Fortran | HPC clusters | LinuxConference attendance | Open source contributions | Professional development opportunities | Student mentoringEntry-level Full TimeBlacksburg, Virginia, Hybrid R1d ago
-
Software Developer Sr USD 95K-169K.NET | ARM | Apache Spark | Automated testing | Azure401k plan | Charity programs | Dental insurance | Life insurance | Medical insuranceSenior-level Full TimeUnited States R1d ago
-
Software Developer Sr USD 95K-169K.NET | ARM | Apache Spark | Automated testing | Azure401k | Charity support | Dental insurance | Global Employee Stock Purchase Plan | Life insuranceSenior-level Full TimeUnited States R1d ago
-
Senior Machine Learning Engineer II USD 156K-250KAWS | Azure | Batch Processing | Container Orchestration | Data PipelinesDiscretionary paid time off | Emotional and mental wellness support | Employee resource groups | Fitness programs | Learning and development programsSenior-level Full TimeSeattle, Washington, United States R1d ago
-
AI orchestration | Alerting | Caching | Distributed Systems | DockerDental insurance | Medical insurance | Paid time off | Savings plan options | Vision insuranceSenior-level Full TimeSan Francisco, CA, United States R1d ago
-
A/B | A/B Testing | APIs | Anthropic | B testingDental insurance | Medical insurance | PTO | Remote | Savings plan optionsMid-level Full TimeSan Francisco, CA, United States R1d ago
-
Alerting | Batching | Caching | Distributed Systems | DockerDental insurance | Medical insurance | Paid time off | Remote work | Savings planMid-level Full TimeSan Francisco, CA, United States R1d ago
-
Senior Software Engineer USD 221K-253KAlgorithm Design | Audio technologies | C++ | Cause analysis | Code ReviewsBonus | Equity | Health benefits | Hybrid work scheduleSenior-level Full TimeMountain View, CA, USA R1d ago
-
Security Operations AI Engineer, Contract USD 151K-225KAI Governance | AI RMF | AI Security | Adversarial Attacks | Adversarial Machine LearningMid-level Full TimeRemote, United States R1d ago
-
Data Engineer USD 97K-122K.NET | API Development | API Gateway | AWS | AWS Glue401k match | Commuter benefits | Dental insurance | Dependent Care Savings Account | Education assistanceMid-level Full TimeRemote, United States R1d ago
-
AWS | Alerting | Azure | BigQuery | CI/CDEquity opportunities | Fully remote within Canada | Health, dental, vision coverage | Paid parental leave | Performance-Based IncentivesSenior-level Full TimeCanada R1d ago
-
API Design | Background Processing | CI/CD | Celery | DockerFully remote within Canada | High autonomy | High impact AI strategy involvementSenior-level Full TimeCanada R1d ago
-
ASAM OpenX | CARLA | Computer Vision | Data Curation | Diffusion ModelsFlexible location | Remote workSenior-level Full TimeUnited States Home Office, United States R2d ago
-
Principal Data Engineer USD 160K-170KAWS | Amazon Redshift | Apache Airflow | BigQuery | Cloud platform401k match | Dental insurance | Health insurance | Paid time off | Remote workSenior-level Full TimeRemote (United States) R2d ago
-
AI/ML Engineer - School USD 101K-163KAI Safety | AI safety evaluation | AWS Bedrock | AWS ECS | AWS LambdaMid-level Full TimeVirtual US IL, United States R2d ago
-
Code review | Data Governance | Data Ingestion | Data Mastering | Data SharingSenior-level Full TimePrimary location: Dublin, Franklin, OH R2d ago