Senior Director, AI Operations (AI/LLM Production Systems)
USD 220K-300K (estimate) Executive-level Full Time
Tasks
- Build escalation paths and ownership
- Build operational risk controls
- Control agent behavior and model usage
- Define AI Operations practice
- Define SLAs and SLOs
- Drive performance reliability and cost optimization
- Establish operating model and governance
- Identify systemic issues and drive continuous improvement
- Implement observability and monitoring
- Lead operational reviews and reporting
- Manage incidents
- Monitor latency drift behavior failures cost
- Own production reliability
- Partner with engineering product and platform for readiness and alignment
- Set production readiness standards
Perks/Benefits
- N/A
Skills/Tech-stack
Artificial Intelligence | Cost Optimization | Datadog | Distributed Systems | Drift Detection | Governance | Grafana | Hallucination detection | ITIL | Incident Management | Language Models | Large Language Models | Latency monitoring | MLOps | Machine Learning | Monitoring | Observability | OpenTelemetry | Orchestration | Orchestration frameworks | Production Operations | Prometheus | RAG | Reliability Engineering | Responsible AI | Service Level | Service Level Agreements | Service Management | Service-Level Objectives | Site Reliability | Site Reliability Engineering | Telemetry
Education
Roles
AI | AI Operations Lead | Director | Director of AI | Director of AI Operations | Lead | Operations | Operations Lead
Related jobs
-
Sr Director, Head of AI USD 180K-230KAI Governance | AI Policy | AI ethics | AI policy development | Agentic AIExecutive-level Full TimeEl Segundo, CALIFORNIA, United States5h ago
-
A/B | A/B Testing | AWS | Adversarial Testing | Amazon SQSHybrid work | W2 employmentSenior-level Contract Full TimeIrvine, CA, United States R10h ago
-
Project Management Specialist - AI Products USD 84K-175KComputer Vision | Confluence | Figma | Generative AI | Human-Computer InteractionEntry-level Full Time Internship底特律11h ago
-
Dir, Data Governance & Observability USD 165K-200KAPI Design | Alerting | Automated testing | CI/CD | Data ContractsExecutive-level Full TimeNew York, NEW YORK, United States13h ago
-
GenAI Engineer USD 93K-163KAWS Bedrock | Agentic Workflows | C++ | CI/CD | CohereHealth and wellness benefits | Mentorship | Professional developmentEntry-level Full TimeArlington/Rosslyn, Virginia, United States16h ago
-
C++ | Data Compression | Data Ingestion | Data Processing | Data StorageSenior-level Full TimeSan Jose, California, United States17h ago
-
Agent Frameworks | Benchmarking | Evaluation metrics | GitHub | Information RetrievalDevelopment workshops | Mentorship | Social eventsEntry-level InternshipSan Jose, California, United States17h ago
-
Computer Vision | Data Pipelines | Language Models | Language Processing | Large Language ModelsSenior-level Full TimeBellevue, WA | Menlo Park, CA17h ago
-
AI Research Scientist, Reinforcement Learning USD 170K-251K3D data | C plus plus | Computer Vision | Computer Vision 3D | Control TheorySenior-level Full TimeNew York, NY17h ago
-
Technical Lead, AI/ML Storage USD 207K-300KAI/ML | AI/ML frameworks | Artificial Intelligence | Benchmarking | Cloud MLHealth insurance | Paid time off | Professional development | Retirement benefitsSenior-level Full TimeSeattle, WA, USA18h ago
-
AI Solutions Architect, Digital Technology Solutions USD 139K-223KAI Studio | AWS | AWS Bedrock | Agent Orchestration | AltairDental insurance | Health insurance | Paid Holidays | Paid parental leave | Paid time offSenior-level Full TimeCincinnati, OH, US, 4522123h ago
-
Director, Machine Learning USD 211K-385KAPIs | Data Integrity | Data Pipelines | Distributed Systems | Experimentation401k matching | Development programs | Employee stock purchase plan | Medical coverage | MentorshipExecutive-level Full TimeAustin, Texas, United States; Chicago; Palo …1d ago
-
API Gateway | AWS CloudFormation | AWS Lambda | AWS Step Functions | Alerting401k plan | Continuing education | Dental insurance | Employee assistance program | Flexible spending accountSenior-level Full TimeOakland, CA, United States1d ago
-
API Design | Anthropic | Artificial Intelligence | Distributed Systems | Enterprise Integration401k plan | Adoption reimbursement | Commuter benefits | Critical caregiving leave | Disability benefitsSenior-level Full Time142019-NC-300 South Brevard, Charlotte, United States1d ago
-
Mid-level Full TimeUSA, MD, Annapolis Junction (308 Sentinel …1d ago
-
AI Engineer - Application Development USD 75K-158KAWS | AWS Bedrock | AWS GovCloud | Agno | Amazon BedrockFlexible time off | Learning resourcesMid-level Full Time999 REMOTE, United States R1d ago
-
AI Solutions Architect USD 126K-225KAir gapped deployment | Air-gapped | Apache Kafka | Apache NiFi | Data PipelinesCareer development | Employee resource groups | Flexible work from home | Generous paid time off | Paid volunteer timeSenior-level Full TimeUS-Washington DC-Remote, United States R1d ago
-
Generative AI Engineer USD 117K-175KAI Platform | Automl | Bias Mitigation | BigQuery | C++401k matching | Healthcare | Paid time offEntry-level Full TimeUSA - Atlanta - One Atlantic …1d ago
-
AI Safety | Artificial Intelligence | Benchmarking | Bias Audit | Data RequirementsComprehensive Vision Coverage | Comprehensive dental coverage | Comprehensive medical coverage | Employee communities | Generous paid time offSenior-level Full TimeSeattle, WA, United States1d ago
-
Software Engineering Director USD 113K-145KAWS | Agile | Atlassian Bitbucket | Atlassian Confluence | Atlassian Jira401k matching | Healthcare packages | Online learning platform | Paid time offExecutive-level Full TimeUSA - Georgia - Alpharetta - …1d ago
-
AI/ML Intern - Radiation Oncology USD 58K-80KAlgorithm deployment | Algorithms | Angular | D3 | Data AnalysisEntry-level Internship Part TimeRochester, MN, United States1d ago
-
Machine Learning Engineer, Foundation Model USD 129K-247KAuto-regressive models | C++ | Data Pipelines | Deep learning | Diffusion ModelsSenior-level Full TimeSan Jose1d ago
-
Head of AI (Applied AI & Automation) USD 173K-220KAI Agents | AWS | Anthropic | BYO LLM | Cloud ComputingExecutive-level Full TimeRemote (United States) R1d ago
-
Clinical Director I / Senior Principal Data Scientist, Director I, Data and AI Convergence USD 177K-336KAWS | Airflow | Artificial Intelligence | Azure | Big DataExecutive-level Full TimeNorth Chicago, IL, United States1d ago
-
AWS | Azure | CI/CD | Cloud platform | Data PipelinesLong-term contractSenior-level Contract Full TimeDallas, TX, United States1d ago