Staff Machine Learning Systems Engineer (MLOps)
Tasks
- Build GitOps deployment pipelines
- Build LLM observability and tracing stack
- Create ephemeral preview environments
- Create monitoring analytics pipelines
- Define SLOs alerting and runbooks
- Design AI production infrastructure
- Ensure HIPAA compliant secure access controls
- Implement IAM and secrets management
- Implement LLM gateway routing and failover
- Improve CI CD pipelines and monorepo build system
- Manage Infrastructure as Code modules
- Mentor engineers on MLOps best practices
- Operate Kubernetes platform
- Scale inference and model serving
- Write technical design documents and lead reviews
Perks/Benefits
Skills/Tech-stack
AWS EKS | Alerting | Autoscaling | CI/CD | ClickHouse | Datadog | Docker | GitOps | Helm | IAM | Inference Serving | Kubernetes | Kustomize | LLM routing | Langfuse | OIDC | OTLP | Observability | OpenTelemetry | Python | SLO | Secrets management | Terraform
Education
N/A
Related jobs
-
Data & AI Platform Engineer USD 95K-155KAI Search | APIs | AWS | Airflow | ArcGIS401k matching | Dental insurance | Health insurance | Life insurance | Paid HolidaysSenior-level Full TimeRemote, United States R17h ago
-
Sr Data Engineer USD 100K-120KAPIs | AWS | AWS Glue | Airflow | Amazon RedshiftFully remote | Mentorship | On-call supportSenior-level Full TimeOrlando, FL, United States R17h ago
-
Senior Applied AI Engineer / Forward Deployed Engineer USD 150K-170KAI Foundry | AI Search | API Integration | Azure AI | Azure AI Foundry401k matching | Career growth | Dental insurance | Disability insurance | Fully remote workSenior-level Full TimeMinneapolis, MN, United States R1d ago
-
Edge AI Engineer USD 100K-150KC++ | Core ML | Device deployment | Embedded Systems | Federated LearningRemote workSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAccelerators | Computer Vision | Data Quality | Data labeling | Data quality monitoringRemote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KApache Beam | Apache Spark | CI/CD | Caching | Code review100 percent remote work | Career growth opportunities | H1B transfer support for qualified candidates | Long term multi year engagementMid-level Full TimeUnited States - Remote R1d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapter | Attention Optimization | DPO | Distributed Training | Evaluation benchmarksMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering Architect USD 100K-150KAPIs | Agentic Workflows | Embeddings | Evaluation Frameworks | Fine TuningSenior-level Full TimeUnited States - Remote R1d ago
-
Robotics Software Engineer USD 100K-150KBehavior Trees | C++ | Computer Vision | Concurrent programming | Control SystemsCareer growth potential | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Senior Data Engineer USD 72K-156KDAX | Data Governance | Data Quality | Databricks | Databricks Lakehouse401k company match | Associate discounts | Dental insurance | Health insurance | Life insuranceSenior-level Full TimeRemote, United States R1d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | Compiler optimization | Continuous batching | Deep learningBenefits | Career growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Storage Engineer (NetApp / Pure / Ceph) USD 100K-150KAnsible | CRUSH maps | CSI | Capacity Planning | CephRemote workSenior-level Full TimeUnited States - Remote R1d ago
-
Storage Engineer (NetApp / Pure / Ceph) USD 100K-150KAnsible | Automation | CRUSH maps | CSI | Capacity PlanningRemote workSenior-level Full TimeUnited States - Remote R1d ago
-
Senior AI/ML Engineer USD 125K-188KAWS | AWS Architecture Patterns | AWS CDK | AWS Lambda | AWS architecture401k matching | Dental insurance | Health savings account | Medical insurance | Online trainingSenior-level Full TimeHerndon, Virginia, United States R1d ago
-
Sr Software Engineer, MLOps USD 150K-180KCI/CD | Cloud Monitoring | DVC | Dataset versioning | Deployment Automation24/7 medical hotline | 401k employer match | Employee discounts | Employee resource groups | Flexible paid time awaySenior-level Full TimeVIRTUAL, WA, US, 00000 R1d ago
-
Analytics Engineer USD 147K-225KApache Airflow | BigQuery | DBT | Databricks | Python401k | Comprehensive benefits | Equity | Flexible time offSenior-level Full TimeUS Remote, San Francisco, CA; New … R1d ago
-
Staff Data & Machine Learning Engineer USD 118K-136KDBT | Data Architecture | Data Governance | Data Quality | Data Streaming401k match | Dental insurance | Family planning resources | Flexible vacation | Fully remoteSenior-level Full TimeRemote - USA R1d ago
-
Senior AI Engineer, Real-World Data USD 125K-175KAI orchestration | AWS | AWS Fargate | AWS Lambda | Agile deliverySenior-level Full TimeUS Remote R1d ago
-
Staff Data Platform Engineer USD 210K-240KAuditing | Azure Event | Azure Event Hubs | Batch Processing | CI/CDHealth plan subsidies | Paid global offsites | Remote-first work culture | WFH office reimbursementSenior-level Full TimeRemote - US R1d ago
-
A/B | A/B Testing | B testing | C++ | Cloud Computing401k employer match | Family planning support | Flexible vacation | Gender-affirming care | Healthcare benefitsSenior-level Full TimeRemote - United States R1d ago
-
Senior Data Analytics Engineer USD 170K-225KAirbyte | Airflow | Amazon Redshift | BigQuery | CI/CD401k match | Childcare discounts | Equity incentive programs | Gym membership | Health insuranceSenior-level Full TimeAustin, TX - Hybrid R1d ago
-
Senior Backend Engineer- AI Agents (Remote) USD 180K-240KA/B | A/B Testing | API Design | AWS | AzureFlexible vacation policy | Health insurance coverage | Team offsites and in person collaboration | Work with distributed teamSenior-level Full TimeUnited States R1d ago
-
AI Forward Deployed Engineer | $120K-$150K + Hybrid + Equity | AI-Based Outage Intelligence Company A USD 120K-150KAPI Integration | Artificial Intelligence | Automation | Backend Development | InfrastructureDirect Impact on Product Growth | Equity | Hybrid work | Significant ownership | Unlimited PTOMid-level Full TimeKing of Prussia, PA, United States R1d ago
-
Data Engineer USD 115K-145KApache Airflow | CD pipelines | CI/CD | CI/CD pipelines | Data Observability401k match | Caregiver leave | Equity | Generous PTO | Health plansEntry-level Full TimeUnited States R1d ago
-
Sr. Staff Machine Learning Engineer, Agentic Ads USD 227K-469KBehavior Modeling | Boosted Trees | C++ | Data Pipelines | Data QualitySenior-level Full TimeSan Francisco, CA, US; Remote, US R1d ago