Senior Director, AI Operations (AI/LLM Production Systems)
USD 220K-300K (estimate) Executive-level Full Time
Tasks
- Build escalation paths and ownership
- Build operational risk controls
- Control agent behavior and model usage
- Define AI Operations practice
- Define SLAs and SLOs
- Drive performance reliability and cost optimization
- Establish operating model and governance
- Identify systemic issues and drive continuous improvement
- Implement observability and monitoring
- Lead operational reviews and reporting
- Manage incidents
- Monitor latency drift behavior failures cost
- Own production reliability
- Partner with engineering product and platform for readiness and alignment
- Set production readiness standards
Perks/Benefits
- N/A
Skills/Tech-stack
Artificial Intelligence | Cost Optimization | Datadog | Distributed Systems | Drift Detection | Governance | Grafana | Hallucination detection | ITIL | Incident Management | Language Models | Large Language Models | Latency monitoring | MLOps | Machine Learning | Monitoring | Observability | OpenTelemetry | Orchestration | Orchestration frameworks | Production Operations | Prometheus | RAG | Reliability Engineering | Responsible AI | Service Level | Service Level Agreements | Service Management | Service-Level Objectives | Site Reliability | Site Reliability Engineering | Telemetry
Education
Roles
AI | AI Operations Lead | Director | Director of AI | Director of AI Operations | Lead | Operations | Operations Lead
Related jobs
-
Senior-level Full TimeHerdon, VA, US11h ago
-
AI Research Scientist (Robot Learning) USD 175K-251KData Flywheel | Deep learning | Distributed Training | Fine Tuning | Generative ModelsFree gym | Sports subscription | Team activities | Team tripsMid-level Full TimeSan Francisco15h ago
-
AI Research Engineer (Robot Learning) USD 175K-250KData Flywheel | Deep learning | Distributed Training | Fine Tuning | Generative ModelsFree gym membership | Sports subscription | Stock options | Team meals | Team tripsMid-level Full TimeSan Francisco15h ago
-
Advanced AI Architect USD 136K-177KAI Foundry | AWS Bedrock | Argo CD | Artificial Intelligence | AuditabilitySenior-level Full TimeAEP Headquarters, United States1d ago
-
Director, Precision Medicine Data Science & AI USD 194K-361KAWS | Apache Spark | Azure | CPT | Claims data401-k match | Comprehensive benefits package | Hybrid work | Paid time off | Travel 10 percentExecutive-level Full TimeEast Hanover, United States1d ago
-
Azure Data | Azure Data Factory | Azure DevOps | CI/CD | Data Factory401k match | Disability insurance | Education benefit | Employee stock purchase plan | Life insuranceSenior-level Full TimePrudential Tower, 655 Broad Street, Newark, … R1d ago
-
AI Foundry | AI Search | Active Directory | Azure AI | Azure AI Foundry401k match | Company pension plan | Disability insurance | Education benefit | Employee stock purchase planExecutive-level Full TimePrudential Tower, 655 Broad Street, Newark, … R1d ago
-
Principal Digital Product Manager, Applied AI USD 159K-258KA/B | A/B Testing | AI Platform | AWS SageMaker | Azure Machine Learning401k savings plan | Adoption benefits | Career development | Employee assistance program | Employee discountsSenior-level Full TimeIrving, Texas, United States1d ago
-
ViiV Healthcare (GSK) Statistics Leader USD 169K-252KAI | Bayesian Methods | Biostatistics | Causal Inference | Clinical Study DesignSenior-level Full TimeDurham Blackwell Street, United States1d ago
-
Lead Applied AI Engineer USD 305KArtificial Intelligence | Data integration | Foundation Models | Guardrails | Language Processing401k plan | Disability benefits | Health benefits | Life insurance | Paid time offSenior-level Full Time111432-TX-Las Colinas Bldg A, Irving Campus, …1d ago
-
Principal MarTech SWE / AI Engineer USD 145K-235KAPIs | AWS | Agile Development | Asynchronous processing | AzureSenior-level Full TimeSanta Clara, CA1d ago
-
Director, Precision Medicine Data Science & AI USD 194K-361KAI Governance | AWS | Azure | CDS frameworks | CPTExecutive-level Full TimeEast Hanover, United States1d ago
-
Sr. Director, Analyst, CIO & AI Leader Group – Cybersecurity & Emerging Technologies, Enterprise Risk - Remote, US USD 172K-202KArtificial Intelligence | Blockchain | CCPA | CIS Controls | Cloud SecurityFlexible work environment | Mentoring and coaching | Professional development | Remote work | Travel up to 25 percentSenior-level Full TimeRemote - Texas, United States R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAccelerators | Computer Vision | Data Quality | Data labeling | Data quality monitoringRemote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAccelerator hardware | Computer Vision | Data Quality | Data quality monitoring | Deep learningRemote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAccelerator hardware | Computer Vision | Data labeling | Deep learning | Distributed TrainingMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAccelerators | Computer Vision | Data Modeling | Data Quality | Data ValidationCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Accelerator hardware | Agentic Systems | Data Quality | Deep learningMid-level Full TimeUnited States - Remote R1d ago
-
Director, Advanced Analytics USD 110K-220KCloud platform | Cluster Analysis | Cross-functional | Cross-functional leadership | Data Mining401k | Adoption expense reimbursement | Company discounts | Company paid life insurance | Dental insuranceExecutive-level Full Time(USA) Ol Roy Building AR Bentonville …1d ago
-
AWS | Azure | C# | C++ | ExperimentationSenior-level Full TimeNew York, NY, United States1d ago
-
AI Governance | AWS | Azure | Experimentation | Foundation ModelHealth benefits | Incentive compensation | Inclusive workplaceSenior-level Full TimeNew York, NY, United States1d ago
-
Lead Machine Learning Engineer USD 179K-225KAgile | Dask | Data Pipelines | Deep learning | Distributed ComputingSenior-level Full TimeCambridge, MA, United States1d ago
-
Lead Machine Learning Engineer (Manager IC) USD 179K-225KAWS Bedrock | Agentic AI | Agile | Azure | DaskHealth benefits | Incentive compensationSenior-level Full TimeMcLean, VA, United States1d ago
-
AWS | Artificial Intelligence | Asynchronous programming | Context engineering | Distributed Systems401k matching | Dental insurance | Health insurance | Learning opportunities | Relocation assistanceSenior-level Full TimeSan Antonio, Texas, United States1d ago
-
Inference Intern USD 60K-142KC++ | Collective communication | Compilers | Consensus Protocols | Consistency modelsDaily meals | Direct mentorship | Housing support | Paid internshipEntry-level InternshipSan Jose1d ago