Find jobs in AI/ML, Data Science and Big Data
7 results
for Inference Serving
(Skill/Tech stack)
-
APIs | Agent architecture | Embeddings | Inference Serving | LLMDirect technical influence | Early stage equity upside | Fast Moving Engineering Culture | High technical autonomy | Remote workMid-level Full TimeSan Francisco, CA; Onsite R2d ago
-
Staff Backend Engineer, ML Inference Systems USD 192K-305KCI/CD | Docker | GCP | Go | GrafanaCommute subsidy | Comprehensive health insurance | Disability insurance | Employee resource groups | Employee stock ownershipSenior-level Full TimeMountain View, CA, USA12d ago
-
Staff Backend Engineer, ML Inference Systems USD 192K-271KCI/CD | Distributed Systems | Docker | GCP | GolangCommute subsidy | Comprehensive health life and disability insurance | Employee resource groups | Employee stock ownership | Generous vacation and personal daysSenior-level Full TimeNew York, NY, USA12d ago
-
Senior Product Manager - GPU Optimization Department (GPUOD) JPY 8600K-10600KChargeback | Competitive Analysis | Go-to-market | Inference Serving | KPI DevelopmentSenior-level Full TimeRakuten Crimson House, Japan21d ago
-
Engineering Manager, Agentic Systems - Moveworks USD 113K-192KC++ | Deep learning | DeepSpeed | Distributed Training | GPU infrastructureMid-level Full TimeMountain View, CALIFORNIA, United States29d ago
-
Senior Software Engineer, ML Infrastructure USD 250K-330KCost Optimization | Distributed Training | Experimentation | Fault Tolerance | GPU clustersDental benefits | Flexible vacation | Free meals and snacks | Medical benefits | Vision benefitsSenior-level Full TimeSan Francisco1mo ago
-
Engineering Manager, Agentic Systems USD 162K-284KC++ | Deep learning | DeepSpeed | Distributed Training | GPU OptimizationMid-level Full TimeMountain View, CALIFORNIA, United States1mo ago