Staff Software Engineer, Model Serving
Tasks
- Build model container builds and deployment workflows
- Define technical roadmap and long term architecture
- Design and implement core systems and APIs
- Develop routing caching observability and runtime systems
- Establish code quality testing and operational readiness best practices
- Improve latency availability and cost effectiveness
- Influence cross organizational technical discussions
- Mentor engineers through design reviews and technical guidance
- Optimize performance throughput autoscaling and operational efficiency
- Translate customer needs into reliable performant systems
Perks/Benefits
- N/A
Skills/Tech-stack
APIs | Algorithms | Autoscaling | CPU | Caching | Data Structures | Deployment Workflows | Distributed Systems | GPU | Inference Systems | Low Latency | Observability | Reliability | Routing | Scalability | Scheduling | System design
Education
N/A
Roles
Regions
Countries
States
Related jobs
-
Algorithms | Android | Cloud platform | Google Cloud | Google Cloud PlatformAsynchronous culture | Flexible management | Portfolio and LinkedIn links encouraged | Remote work optionSenior-level Full TimeAnchorage, AK, USA1d ago
-
Principal AI/ML Researcher / Engineer Reasoning, Planning, and Decision-making systems USD 296K-370KAgent systems | Artificial Intelligence | Belief State Tracking | Caching | Causal modelingSenior-level Full TimeUnited States R1d ago
-
AWS | Alphafold | Bioinformatics | Cloud Computing | Data PipelinesSenior-level Full TimeNew York, NY1d ago
-
API Design | AWS | Asynchronous processing | Cloud Security | Data ModelingSenior-level Full TimeNew York, NY1d ago
-
Senior Gen AI Engineer USD 200K-250KAmazon Neptune | Autogen | CI/CD | CrewAI | CypherCollaborative culture | Flexible working hours | Hybrid work | Remote | Startup speed workSenior-level Full TimeUnited States1d ago
-
AWS | Asynchronous programming | Context engineering | Distributed Systems | Embeddings401k | Health, dental, vision coverage | Learning stipend | Onsite work | Relocation assistanceSenior-level Full TimeIllinois, Illinois, United States1d ago
-
Data Scientist (Generative AI) USD 125K-160KAPIs | AWS Bedrock | AWS Kendra | AWS SageMaker | Adversarial NetworksEntry-level Full TimeMcLean, VA, United States1d ago
-
Mid-level Full TimeWashington, DC1d ago
-
Media Software Engineer, Speech (All Levels) USD 120K-180KAndroid | Artificial Intelligence | Audio Processing | C# | C++401k retirement savings plan | Company holidays | Complimentary lunch and snacks | Fertility support | Medical, dental, and vision insuranceEntry-level Full TimeSunnyvale R1d ago
-
Senior Software Engineer - Tools Development (Robotics) USD 126K-169KAlgorithm Design | C++ | Distributed Systems | Linux | PythonSenior-level Full TimeDallas, TX1d ago
-
Principal Data Engineer (streaming) USD 118K-134KAWS | Alerting | Apache Flink | Apache Hudi | Apache KafkaAllyship and inclusion communities | Caregiver leave | Continuous development support program | Employee assistance program | Employee recognitionSenior-level Full TimeRemote, USA R1d ago
-
Staff AI engineer USD 200K-250KAI Evaluation | AWS | Agent Orchestration | Artificial Intelligence | CachingFlexible working hours | Hybrid work | Unlimited time offSenior-level Full TimeSan Francisco2d ago
-
Senior-level Full TimeChicago, Illinois, United States2d ago
-
Storage Engineer (NetApp / Pure / Ceph) USD 100K-150KAnsible | Backups | CRUSH maps | CSI drivers | Capacity PlanningCareer growth | H1B transfers supported | Remote workSenior-level Full TimeUnited States - Remote R2d ago
-
Senior-level Full TimeScottsdale, Arizona, United States2d ago
-
Staff AI Engineer (Life Sciences) USD 155K-240KAPIs | AWS | Agent Orchestration | Agent systems | AzureSenior-level Full TimePalo Alto2d ago
-
Lead Data Engineer USD 100K-200KAWS | Airflow | Apache Spark | Batch Processing | CI/CD401k company contribution | Coaching and Counseling Sessions | Employee health and protective benefits | Employee resource groups | Flexible time offSenior-level Full TimeNew York, NY2d ago
-
Principal Machine Learning Engineer USD 180K-368KAWS Lambda | Amazon SQS | Amazon SageMaker | Automated testing | Backend EngineeringHybrid workSenior-level Full TimeRemote, USA R2d ago
-
Benchmarking | Code review | Data Pipelines | Distributed Systems | EvaluationSenior-level Full TimeMenlo Park, CA2d ago
-
APIs | Agent systems | Cloud platform | CrewAI | Data PipelinesSenior-level Full TimeSan Francisco, CA, USA; Atlanta, GA, …2d ago
-
C++ | Code review | Compute Technologies | Data Analysis | Data StructuresSenior-level Full TimeSunnyvale, CA, USA2d ago
-
Senior Software Engineer, AI/ML, AI and Infrastructure USD 174K-252KC++ | Data Processing | Data Storage | Data Structures | Data structures algorithmsSenior-level Full TimeMountain View, CA, USA; Kirkland, WA, …2d ago
-
Software Engineer III, AI/ML, Google Workspace USD 147K-211KC++ | Data Processing | Debugging | Distributed Computing | Information RetrievalSenior-level Full TimeKirkland, WA, USA2d ago
-
AI Solution Engineer USD 109K-155KAPIs | AWS | Azure | Embeddings | GCP401k match | Basic life insurance | Dental insurance | Disability coverage | Medical insuranceMid-level Full TimePiscataway, NJ, US2d ago
-
Software Engineer, RL Training Infra USD 295K-445KAgent systems | Async systems | Debugging | Distributed Systems | Hardware ReliabilityMid-level Full TimeSan Francisco3d ago