Lead Principal Machine Learning Engineer
USD 169K-355K Senior-level Full Time
Tasks
- Build agentic AI systems
- Build vector retrieval systems
- Coordinate multi-agent workflows
- Create eval suites and regression testing
- Define OCI AI platform architecture
- Design inference and model serving
- Develop agent memory and context management
- Enforce policies and safety guardrails
- Establish AgentOps and LLMOps observability
- Implement tool calling services
- Integrate Model Context Protocol
- Integrate enterprise APIs and cloud services
- Lead multi team execution plans
- Mentor engineers through code reviews
- Orchestrate LLM agents
- Run incident analysis and follow up
- Set SLIs and SLOs
Perks/Benefits
Skills/Tech-stack
Access Control | Agent systems | AgentOps | Auditability | Autogen | Batching | Caching | Cloud infrastructure | CrewAI | Docker | Embeddings | Evaluation | Function Calling | GPU Inference | Human-in-the-loop | Incident Response | Inference Optimization | Kubernetes | LLM | LLMOps | Langchain | Langgraph | Llamaindex | Long Context | Model Serving | Monitoring | Multi-Agent | Multi-Agent Systems | Multi-tenant | Multi-tenant architecture | Observability | Oracle Cloud | Oracle Cloud Infrastructure | Prompt engineering | Python | Quantization | RAG | Regression testing | Retrieval-Augmented Generation | SLI | SLO | Security | Structured outputs | Tenant architecture | The Loop | Tool-Calling | Tracing | Vector Databases
Education
Regions
Countries
States
Related jobs
-
Software Engineer III - Senior Java Spark Developer USD 113K-188KAgile | Apache Spark | CI/CD | Concurrency | Distributed SystemsSenior-level Full TimeJersey City, New Jersey, United States3h ago
-
Applied AI Engineer - AI Solutions USD 172K-300KAgentic Workflows | Airflow | Apache Spark | Chroma | CrewAIAnnual travel up to 25% | Employee stock options | Hybrid work | Professional developmentMid-level Full TimeNew York City, NY (Hybrid); Redwood … R10h ago
-
AI Solutions Engineer, Talent Acquisition USD 129K-171KAPIs | Access Control | Agentic Workflows | Audit trails | AuthenticationMid-level Full TimeSeattle, Washington, United States13h ago
-
Network Engineer, Supercomputing USD 350K-475KCUDA | Congestion Control | Container Orchestration | Debugging | Deep learningDental benefits | Health benefits | Paid parental leave | Relocation support | Unlimited PTOSenior-level Full TimeSan Francisco13h ago
-
Principal Product Manager, Inference Engine USD 218K-273KAutoscaling | Batching | Capacity Efficiency | Capacity Planning | GPU EconomicsEmployee assistance program | Flexible time off | LinkedIn Learning | Training reimbursementSenior-level Full TimeSeattle16h ago
-
AI Engineer, Agentic Ad Creative (Multimodal) USD 120K-220KA/B | A/B Testing | Ad Policy Compliance | B testing | CUDA401k matching | Commuter benefits | FSA | HSA | Health, dental, and vision insuranceMid-level Full TimeMountain View, California, United States16h ago
-
Backend Engineer - Applied AI USD 175K-240KCRDT | Caching | Circuit Breakers | Database Query | Database Query OptimizationMid-level Full TimeNew York City16h ago
-
Senior Data Engineer USD 126K-142KAzure Cloud | Azure Cloud Platform | Azure Data | Azure Data Factory | Azure Data LakeSenior-level Full TimeUnited States17h ago
-
2026 Fall Health AI Scholar, Digital Health Algorithms USD 124K-150KAWS SageMaker | Amazon Redshift | Android | Artificial Intelligence | C++Entry-level Full Time665 Clyde Avenue, Mountain View, CA, …17h ago
-
Senior Software Engineer - Infrastructure Storage USD 266K-395KAPI Design | Block Storage | Ceph | Distributed Systems | Fibre Channel401k match | Commuter stipend | Flexible paid time off | Health, dental, vision coverage | Wellness stipendSenior-level Full TimeSan Francisco Office (Fremont St)17h ago
-
Senior-level Full TimeCosta Mesa, California, United States17h ago
-
Sr. Data Engineer (Remote) USD 163K-192KAccess Control | Amazon Web Services | Apache Iceberg | Apache Kafka | Apache Spark401k plan | Dental insurance | Disability insurance | Employee assistance program | FSA/HSASenior-level Full TimeRemote - United States R17h ago
-
Machine Learning Engineer - Simulation Framework USD 160K-234K3D Graphics | C++ | CUDA | Deep learning | Deterministic systemsEntry-level Full TimeFoster City, CA17h ago
-
Reliability Engineer, Supercomputing USD 350K-475KBMC | Container Orchestration | DCGM | Debugging | Firmware ManagementDental benefits | Health benefits | Paid parental leave | Relocation support | Unlimited PTOMid-level Full TimeSan Francisco17h ago
-
AWS | Agent Orchestration | CI/CD | Cloud platform | Databricks401k match | Counseling membership | Employer subsidized medical dental and vision | Flexible time away program | Life insuranceMid-level Full Time-REMOTE, USA- R17h ago
-
Software Engineer III - Big Data & AWS USD 175K-186KAPI Gateway | AWS Glue | AWS Lambda | AWS Step Functions | Amazon APISenior-level Full TimePlano, TX, United States18h ago
-
AI Solutions Engineer, Talent Acquisition USD 129K-171KAPI Integration | APIs | Access Control | Agentic Workflows | Audit trailsMid-level Full TimeBoston, Massachusetts, United States18h ago
-
AI Solutions Engineer, Talent Acquisition USD 129K-171KAPIs | Access Control | Agentic Workflows | Audit Logs | AuthenticationHealth insurance | Paid time off | Wellness programsMid-level Full TimeCosta Mesa, California, United States18h ago
-
Agentic Workflows | Data Pipelines | Deep learning | Deployment | Experiment designLong-term engagementMid-level Full TimeRedmond, WA18h ago
-
AI Engineer, Ecosystem USD 171K-240KAPI Integration | Access Management | Audit Logging | Authentication | AuthorizationHybrid work | Remote work up to 4 weeks per yearMid-level Full TimeSan Francisco, California, United States R18h ago
-
AI Engineer, Ecosystem USD 171K-240KAPI Integration | Access Control | Audit Logging | Authentication | AuthorizationFlexible schedule | Hybrid work | Remote work 4 weeks per yearMid-level Full TimeSeattle, Washington, United States R18h ago
-
Forward Deployed AI Solutions Engineer USD 95K-145KAPIs | Agentic Workflows | Audit Logging | Cloud Computing | Command Line401k benefits | Commuter benefits | Employee referral program | Fertility care benefits | Free testingMid-level Full TimeUS Remote R18h ago
-
Staff Analytics Engineer USD 159K-187KClaude | Claude Code | DBT | Data Contracts | Data Modeling401k company match | Accident insurance | Company funded HSA contributions | Critical illness insurance | Health, dental, vision coverageSenior-level Full TimeRemote (United States) R18h ago
-
AI Engineer, Product USD 171K-240KA/B | A/B Testing | API Design | B testing | Data ModelingHybrid work | Remote work up to four weeks per yearMid-level Full TimeSan Francisco, California, United States R18h ago
-
C plus plus | Concurrency | Device Management | Distributed Systems | Embedded LinuxEqual opportunity employer | Medical/Dental/Vision insurance | Paid time offSenior-level Full TimeSouth San Francisco, California, USA18h ago