Senior Engineering Manager AI Inference Platform, Distributed Cloud
Tasks
- Define technical vision and strategy for LLM serving stack
- Design implement and optimize LLM serving architectures
- Lead, mentor, and grow engineering teams
- Optimize GPU accelerator performance and minimize latency
- Oversee performance analysis profiling and benchmarking
- Partner with research SRE product and core library teams
Perks/Benefits
- N/A
Skills/Tech-stack
Benchmarking | C++ | Continuous batching | Deep learning | Disaggregated serving | Distributed Computing | GPU | JAX | LLM serving | Latency optimization | Machine Learning | Performance Profiling | PyTorch | Python | Resource Optimization | Scalability | System Optimization | TensorFlow | XLA
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Roles
AI | Engineering | Engineering Manager | Engineering Manager, AI | Manager
Regions
Countries
States
Cities
Related jobs
-
Forward Deployed Engineer V, Generative AI, Google Cloud USD 262K-365KAPI Integration | Agent systems | Cloud platform | CrewAI | Data PipelinesSenior-level Full TimeSan Francisco, CA, USA; Atlanta, GA, …5h ago
-
Technical Program Manager, Robotics, DeepMind USD 217K-237KDashboarding | Data Analysis | Data Quality | Hardware Integration | Logistics planningMid-level Full TimeMountain View, CA, USA5h ago
-
Principal Product Manager, AI Transformation USD 281K-392KAI Agent | AI agent architectures | Agent architectures | Artificial Intelligence | AutomationSenior-level Full TimeSunnyvale, CA, USA5h ago
-
Automation | C# | C++ | CSS | Database DesignMid-level Full TimeAnn Arbor, MI, USA5h ago
-
Principal Consultant, AI/ML, Mandiant, Google Cloud USD 168K-244KAI Evaluation | Agent systems | Cloud APIs | Cybersecurity | Data leakageSenior-level Full TimeReston, VA, USA; United States5h ago
-
Agentic AI | AppScript | Artificial Intelligence | Business Intelligence | Concurrency systemsSenior-level Full TimeSan Bruno, CA, USA5h ago
-
Technical Program Manager III, AI/ML, Google Ads USD 163K-237KAI | Cross-functional | Cross-functional leadership | Executive Communication | ExperimentationSenior-level Full TimeMountain View, CA, USA; New York, …5h ago
-
Senior-level Full TimeSan Jose, California, United States11h ago
-
Mid-level Full TimeMountain View, California, United States16h ago
-
Mid-level Full TimeSan Francisco, California, United States16h ago
-
AWS | Access Control | Agile | CRM | Cloud ComputingSenior-level Full TimeFlorida, United States17h ago
-
Associate Consultant, Generative AI USD 80K-110KAWS | Azure | CI/CD | Chroma | Cloud platformHealth savings account | Paid parental leave | Paid time offMid-level Full TimeNew York, NY, United States17h ago
-
VP, AI Solutions USD 175K-250KAdtech | Analytics frameworks | Artificial Intelligence | Attribution Modeling | Audience intelligence401k matching | Dental insurance | Flexible spending account | Medical insurance | Relocation optionExecutive-level Full TimeSan Diego, CA, Los Angeles, CA, …18h ago
-
Senior Software Engineer (AI Data Engineering) USD 180K-210KAgentic Workflows | Architecture alignment | Backend Development | Code review | Data Modeling401k matching | Catered lunches | Dental insurance | Flexible schedule | Medical insuranceSenior-level Full TimeNew York, NY or Remote R18h ago
-
Manager, Yield Management - GTM AA Data Scientist USD 132K-250KAWS | Agentic AI | Azure | Cloud platform | ContainerizationDental insurance | Flexible family care days | Health insurance | Paid Holidays | Paid parental leaveSenior-level Full TimeDearborn, MI, United States21h ago
-
Manager, Data & Analytics USD 123K-243KAgile | Alteryx | Applied statistics | Artificial Intelligence | Automation AnywhereConference attendance | Knowledge sharing | Learning budget for certifications and training | Mentoring | Travel up to 25 percentMid-level Full TimeMcLean, VA21h ago
-
Sr. Manager, Yield Management - GTM Advanced Analytics USD 141K-268KAWS | Apache Spark | Azure | Cloud Platforms | DashEmployee resource groups | Fertility treatments support | Flexible family care days | Immediate medical dental vision prescription drug coverage | Option to purchase additional vacation timeSenior-level Full TimeDearborn, MI, United States22h ago
-
Principal Technical Product Manager - VALORANT, AI/ML USD 178K-249KCross-functional | Cross-functional leadership | Functional leadership | Generative AI | Integration401k company match | Dental insurance | Flexible work schedules | Life insurance | Medical insuranceSenior-level Full TimeLos Angeles, USA22h ago
-
Vice President Product Manager - Data Governance USD 225K-267KAI for Data | Backlog Management | Compliance | Data Governance | Data LineageExecutive-level Full TimeJersey City, NJ, United States22h ago
-
Manager, Data Engineering- Data Visualization USD 108K-150KAgile | Alation | Atlan | Azure | Data Definition LanguageMid-level Full TimeOak Brook, IL, United States23h ago
-
Healthcare AI Solutions Intern USD 40K-48KAPIs | Agent systems | Decision Graphs | FHIR | Instruction TuningEntry-level Internship Part TimeUnited States23h ago
-
Agentforce | Analytics | Anthropic | Application development | CRM dataAsync working culture | Documentation support in Confluence | Office hours | Remote work optionMid-level Full TimeEast Coast, United States; London, United … R23h ago
-
AI Intern USD 41K-50KArtificial Intelligence | Automation | Bot development | Cloud Computing | DashboardsInternship credits | Mentorship | Networking opportunities | On-site or remote workEntry-level Internship Part TimeUnited States, United States23h ago
-
Senior-level Full TimeNew York23h ago
-
Senior AI Engineer USD 130K-197KAPI Design | AWS Bedrock | Access Control | Async Programming | Audit Logging401k retirement plan | Employee referral bonuses | Entertainment discounts | Healthcare dental vision plans | On Call RunbooksSenior-level Full TimeFort Lauderdale, Florida, United States1d ago