Senior Engineer, Inference Control Plane
Tasks
- Contribute to architecture decisions for traffic management service orchestration and platform scalability
- Design and build scalable multi tenant AI inference services
- Develop and operate distributed systems for reliability availability and performance
- Improve observability capacity management automation and operational tooling
- Lead incident management and continuous improvement
- Participate in on call rotations and reduce operator pain
- Partner with platform GPU infrastructure and product engineering teams
Perks/Benefits
- Employee assistance program
- Employee stock purchase program
- Flexible time off
- LinkedIn Learning access
- Reimbursement for training and education
Skills/Tech-stack
API Gateway | Capacity Planning | Distributed Systems | Fault Tolerance | GPU Utilization | Go | Incident Management | Kubernetes | Microservices | Observability | Operational automation | Reliability Engineering | Service Mesh | Service orchestration | Time Per Output Token | Time To First Token | Traffic Routing
Education
N/A
Related jobs
-
Featured Feat. Applied AI Engineer - Bay Area USD 211K-263KArtificial Intelligence | C plus plus | C# | Embeddings | Feature Engineering401k | Comprehensive health and wellness benefits | Learning and development opportunities | Unlimited time offMid-level Full TimeHQ (San Francisco)27d ago
-
Data Engineer, Infrastructure FinOps USD 146K-194KAPI Design | BigQuery | CI/CD | CloudFormation | DBTMid-level Full TimeCosta Mesa, California, United States11h ago
-
Senior Software Engineer, Data Platform USD 166K-220KAWS | Amazon Athena | Apache Iceberg | Apache Spark | AzureSenior-level Full TimeCosta Mesa, California, United States11h ago
-
Ad Ranking | Agentic Workflows | Automl | Deep learning | Distributed Systems401k matching | Dental insurance | Employee assistance program | Health insurance | Paid time offSenior-level Full TimePalo Alto, California, USA12h ago
-
Senior Data Engineer USD 135K-168KAWS | Airflow | Amazon Redshift | Data Modeling | Data Warehousing401k | Flexible work arrangement | Health insurance | Mentorship program | Paid time offSenior-level Full TimePlano, TX13h ago
-
AI Full Stack Developer & Architect USD 140K-161KCloud Run | Kubernetes | MLOps | Machine Learning | Next.jsSenior-level Contract Full TimeSan Jose, CA, United States16h ago
-
Data & AI Platform Engineer USD 95K-155KAI Search | APIs | AWS | Airflow | ArcGIS401k matching | Dental insurance | Health insurance | Life insurance | Paid HolidaysSenior-level Full TimeRemote, United States R17h ago
-
Senior Data Engineer USD 140K-160KApache Airflow | CI/CD | Change Data Capture | Cluster Sizing | Data Capture401k | Dental insurance | Health insurance | Hybrid work | Paid leaveSenior-level Full TimeNorth Hollywood, CA, United States17h ago
-
Machine Learning Engineer USD 72K-131KCI/CD | Data Fusion | Data Pipelines | Data analytics | Deep learningHybrid work environment | MentorshipSenior-level Full TimeMelbourne, Florida, United States18h ago
-
Junior Machine Learning Engineer USD 64K-106KCI/CD | Data Fusion | Data analytics | Data sets | Deep learningComprehensive benefits package | Hybrid work environment | Supportive work environment | US security clearance supportedEntry-level Full TimeMelbourne, Florida, United States18h ago
-
Staff Machine Learning Systems Engineer (MLOps) USD 210K-250KAWS EKS | Alerting | Autoscaling | CI/CD | ClickHouseFlexible remote work | Healthcare industry domain experienceSenior-level Full TimeUS Remote R19h ago
-
Senior Machine Learning Platform Engineer USD 155K-215KAI Observability | AWS SageMaker | Alerting | Amazon ECS | Audio ProcessingOn-call rotationSenior-level Full TimeNew York, NY21h ago
-
Member of Technical Staff (AI Software Engineer, Agents) USD 220K-405KAI Evaluation | Agent architecture | Browser technologies | Chrome DevTools | Chrome DevTools ProtocolSenior-level Full TimeSan Francisco1d ago
-
Senior Applied AI Engineer / Forward Deployed Engineer USD 150K-170KAI Foundry | AI Search | API Integration | Azure AI | Azure AI Foundry401k matching | Career growth | Dental insurance | Disability insurance | Fully remote workSenior-level Full TimeMinneapolis, MN, United States R1d ago
-
Sr Technical Solutions Engineering USD 130K-178KAWS | Automated Patch Deployment | Azure | Bash | CloudFormation24x7 on-call support | Secure facility accessSenior-level Full TimeMcLean, Virginia1d ago
-
Staff Technical Solution Engineering USD 153K-210KAir-gapped | Air-gapped networks | Automation | Bash | Cloud infrastructure24x7 on call coverage flexibility | Benefits package | Secure facility onsite workSenior-level Full TimeMcLean, Virginia1d ago
-
Senior Reliability Engineer- Surgical Robotics USD 107K-160KAutomation Scripting | Cause analysis | Cause map | Data Analysis | FMEA401k plan with employer match | Health, dental, vision insurance | Onsite work | Paid Holidays | Paid time offSenior-level Full TimeUSA-CT North Haven, United States1d ago
-
AI Foundry | AWS | Amazon Bedrock | Anthropic Claude | Autogen401k plan | Bonus opportunities | Dental insurance | Life insurance | Long-term disabilityMid-level Full TimeDallas, 5205 N OConnor Las Colinas, …1d ago
-
Senior Lead AI Engineer (GenAI Platform Services) USD 229K-286KAI Governance | AWS | Azure | Experimentation | GoSenior-level Full TimeSan Jose, CA, United States1d ago
-
Sr. AI Engineer - Forward Deployed Engineer Senior Manager · Snowflake · Reinvention Centers USD 112K-338KAnthropic | Anthropic Claude | Autogen | CI/CD | Containerization401k plan | Hybrid work | Paid Holidays | Paid time offSenior-level Full TimeDallas, 5205 N OConnor Las Colinas, …1d ago
-
AI Engineer - Forward Deployed Engineer · Snowflake · Reinvention Centers Associate Manager USD 62K-220KAgent Orchestration | Apache Spark | CI/CD | Cortex AI | Cortex Analyst401k plan | Dental insurance | Life insurance | Long-Term Disability coverage | Medical insuranceMid-level Full TimeDallas, 5205 N OConnor Las Colinas, …1d ago
-
Senior Robotics Solutions Engineer USD 91K-147KCause analysis | Customer Service | Data Analysis | ECO process | Electro-mechanicalCaregiver leave | Domestic Travel up to 50% | Holiday pay | Parental leave | Sick timeSenior-level Full TimeUS328 CA Santa Clara - 5490 …1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAccelerators | Computer Vision | Data Quality | Data labeling | Data quality monitoringRemote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KApache Beam | Apache Spark | CI/CD | Caching | Code review100 percent remote work | Career growth opportunities | H1B transfer support for qualified candidates | Long term multi year engagementMid-level Full TimeUnited States - Remote R1d ago
-
Senior Software Engineer, DGXC Data Services USD 152K-287KAWS | Algorithms | Apache Iceberg | Apache Spark | C plus plusSenior-level Full TimeUS, CA, Santa Clara, United States1d ago