Staff Software Engineer, Model Serving
Tasks
- Build model container builds and deployment workflows
- Define technical roadmap and long term architecture
- Design and implement core systems and APIs
- Develop routing caching observability and runtime systems
- Establish code quality testing and operational readiness best practices
- Improve latency availability and cost effectiveness
- Influence cross organizational technical discussions
- Mentor engineers through design reviews and technical guidance
- Optimize performance throughput autoscaling and operational efficiency
- Translate customer needs into reliable performant systems
Perks/Benefits
- N/A
Skills/Tech-stack
APIs | Algorithms | Autoscaling | CPU | Caching | Data Structures | Deployment Workflows | Distributed Systems | GPU | Inference Systems | Low Latency | Observability | Reliability | Routing | Scalability | Scheduling | System design
Education
N/A
Roles
Regions
Countries
States
Related jobs
-
Partner Engineering GenAI - US USD 136K-203KAPIs | C++ | Claude | Cloud Computing | Data integrationSenior-level Full TimeMenlo Park, CA | Seattle, WA …5h ago
-
AI Specialist - Product and Applied Research USD 180K-200KC++ | Computer Vision | Crawling | Data Mining | Data RegressionMid-level Full TimeMenlo Park, CA | New York, …5h ago
-
Software Engineer, Databases (Technical Leadership) USD 161K-297KAI Tooling | Automation | Consensus Protocols | Data Integrity | Database InternalsSenior-level Full TimeBellevue, WA | Menlo Park, CA5h ago
-
Senior-level Full TimeMenlo Park, CA5h ago
-
Senior Software Engineer, AI/ML, Platforms and Devices USD 174K-252KC++ | Data Processing | Data Structures | Data Structures and Algorithms | DebuggingBonus | Equity | Health insurance | Learning and development | Paid time offSenior-level Full TimeMountain View, CA, USA; United States5h ago
-
Staff Software Engineer, AI/ML GenAI, Google Cloud USD 207K-300KComputer Vision | Data Preparation | Data Processing | Debugging | Distributed ComputingSenior-level Full TimeSunnyvale, CA, USA; San Francisco, CA, …5h ago
-
C++ | Data Processing | Data Structures | Data Structures and Algorithms | DebuggingSenior-level Full TimeKirkland, WA, USA; New York, NY, …6h ago
-
Full Stack AI Software Engineer USD 216K-283KAPI contracts | AWS | Azure | Classification | Data PipelinesAdoption leave | Commuter benefits | Dental insurance | Disability insurance | Equity ESPPExecutive-level Full TimeSan Mateo, CA, United States10h ago
-
Machine Learning Engineer, AI Agent Platform USD 110K-230KAPI Development | Benchmarking | Distributed Systems | Evaluation | LLM orchestrationHealth insurance | Paid time off | Parental leaveSenior-level Full TimeBay Area15h ago
-
Entry-Level AI / ML Software Engineer USD 60K-74KAgile | Algorithms | Code review | Data Structures | Deep learningEntry-level Full TimeHopkins, MN, United States17h ago
-
MLOps Engineer II USD 105K-189KAWS | AWS CDK | AWS CloudFormation | AWS Lambda | AWS Step FunctionsSenior-level Full TimeInnovation Point, United States17h ago
-
Senior-level Full TimeCarlsbad, California, United States; Scottsdale, Arizona, …1d ago
-
Senior AI/ML Engineer USD 155K-210KCI/CD | Cloud Architecture | Distributed Systems | Docker | EmbeddingsEquity participation | Paid Company Holidays | Professional development opportunities | Unlimited PTO | Weekly in-office cateringSenior-level Full TimeLeawood, KS, US1d ago
-
Senior Software Engineer - Agentic AI USD 145K-195KAWS | Azure | Cloud Computing | Distributed Systems | Docker401k matching | Health insurance | Hybrid work arrangement | Paid time off | Relocation assistanceSenior-level Full TimeBoston, Massachusetts, United States - Remote R1d ago
-
Staff Software Engineer - Big Data (Gen AI) USD 170K-170KAWS | AWS EMR | Apache Hudi | Apache Spark | Artificial IntelligenceHealth benefits | Paid time off | Remote workSenior-level Full TimeUnited States1d ago
-
AWS | Anomaly Detection | Azure | Big Data | CI/CDSenior-level Full TimeSan Jose, California, United States1d ago
-
Software Engineer, Perception and Prediction Evaluation USD 155K-213KAWS | Amazon Batch | Amazon ECS | Amazon S3 | Apache AirflowCatered meals | Daily Drinks | Equity awards | Flexible hours | Health and wellness benefitsMid-level Full TimeRemote US & Canada R1d ago
-
Senior Machine Learning Operations Engineer USD 140K-208KCloud Native | Distributed Systems | Docker | Elasticsearch | Event Driven401k matching | Company-Paid Holidays | Counseling sessions | Dental insurance | Disability coverageSenior-level Full TimeChicago, IL1d ago
-
Principal Embedded System Automation Engineer USD 120K-180KBash | Bitbake | CI/CD | CMake | Continuous DeliveryDental insurance | Disability insurance | FSA | HSA | Health insuranceSenior-level Full TimeAustin, TX1d ago
-
Mission Engineer USD 135K-216KArtificial Intelligence | Cloud infrastructure | Data Structures | Data integration | Front-endSenior-level Full TimeWashington, DC1d ago
-
Staff AI engineer USD 130K-185KAI Evaluation | AWS | Agent Orchestration | Caching | Data PipelinesFlexible working hours | Hybrid work culture | Unlimited time offSenior-level Full TimeSan Francisco1d ago
-
LLM Engineer (GenAI, NYC) USD 150K-230KAgent systems | Data Analysis | Exploratory Data Analysis | Langchain | Language ModelsMid-level Full TimeNYC Hybrid R1d ago
-
Sr. Back-End Software Engineer - Machine Learning USD 180K-250KAPI Development | C plus plus | Computer Vision | Distributed Systems | Language Processing401k matching | Commuter benefits | Employee Medical Premium Coverage | Employee referral program | Flexible spending accountsSenior-level Full TimeSanta Clara, CA1d ago
-
Space Operations Engineer (Embedded Software) USD 100K-140KAPI Integration | ARM | C# | C++ | Command and controlMid-level Full TimeSan Francisco, CA1d ago
-
Engineering Intern – Gen AI for FP&A Platform USD 80K-100KAPI Integration | Agentic AI | Cloud Computing | Data Structures | Data structures algorithmsHybrid work option | Mentorship | Remote work optionEntry-level InternshipUnited States1d ago