Senior Product Manager, AI Inference - Dynamo
US, CA, Santa Clara, United States
USD 208K-327K Senior-level Full Time
Tasks
- Author PRDs
- Author software application design documents
- Collaborate on hardware/software co-design
- Define routing logic to minimize redundant prefill
- Design KV cache offloading strategy
- Develop agentic inference capabilities
- Drive product strategy for Dynamo modular components
- Integrate with SGLang
- Integrate with TensorRT LLM
- Integrate with vLLM
- Optimize time to first token
- Support multi turn stateful AI applications
Perks/Benefits
- N/A
Skills/Tech-stack
Agentic AI | Artificial Intelligence | Cache Management | Data-driven | Data-driven project management | Disaggregated serving | Distributed Systems | GPU Computing | KV cache | LLM Inference | MLOps | Machine Learning | Offloading | Prefill Decode | Product Management | Project Management | Responsible AI | Routing | Software Requirements | Systems Design | Time To First Token
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
Agile | Dependency management | Product Management | Program Management | Risk ManagementSenior-level Full TimeSan Francisco, California, USA9h ago
-
Senior Manager, Customer Value and Analytics USD 160K-200KBigQuery | CMS guidelines | Cross-Functional Collaboration | Cross-functional | Data EngineeringSenior-level Full TimeRedwood City, CA12h ago
-
CDAO - Enterprise - Generative AI Program Manager USD 139K-191KAcquisition | Artificial Intelligence | Autonomy | Data Science | Generative AIMid-level Full TimeArlington, VA12h ago
-
AI Assurance | Adversarial AI | Artificial Intelligence | Budget Management | CybersecurityMid-level Full TimeArlington, VA12h ago
-
Acquisition regulations | Artificial Intelligence | Budget Management | Contract Negotiation | Data ScienceMid-level Full TimeArlington, VA13h ago
-
Product Manager - AI Inference & Model Serving USD 160K-275KAI Inference | Autoscaling | Cache Management | Cold Start | Cold Start OptimizationConference attendance | Professional development and training | Stock options | Workstation providedMid-level Full TimeAustin, TX, United States13h ago
-
Engineering Manager, Model Inference USD 220K-270KAPIs | Attention Mechanism | Batching | Distributed Systems | Docker401k matching | Commuter benefits | Flexible PTO | Flexible spending accounts | Generous time offMid-level Full TimeSF Office13h ago
-
Technical Product Manager, AI Storage USD 168K-240KBacklog Management | BeeGFS | Block Storage | CSI | DAOSConference attendance | Customized workstation | Professional development and trainingMid-level Full TimeAustin, TX, United States13h ago
-
Manager, Data Quality Engineering USD 180K-247KADLS | Active Directory | Agile | Apache Spark | Azure401k matching | Adoption Assistance | Childcare tuition discounts | Company Mental Health Support | Fertility benefitsSenior-level Full TimeAnn Arbor, MI, United States14h ago
-
Sr. Manager, People Analytics USD 205K-308KAPI Integration | Automation | Dashboarding | Data Governance | Data ModelingCompany-sponsored team events | Flexible time off | Wellness resourcesSenior-level Full TimeSanta Clara, California15h ago
-
Senior Solution Owner | AI & Data Solutions USD 160K-195KAPIs | Agentic AI | Agile | Agile Ceremonies | Amazon Web ServicesSenior-level Full TimeNew York19h ago
-
Agile project management | Authentication | Behavioral biometrics | Case management | Cause analysisHybrid work | Incentive compensation plan | Profit sharingSenior-level Full TimeGlen Allen, Virginia, United States R22h ago
-
Cross-Functional Collaboration | Cross-functional | Functional collaboration | GenAI | OKR ManagementMid-level Full TimeSunnyvale, CA, USA22h ago
-
AI | AI Agents | Agent systems | Cloud Computing | Context engineeringSenior-level Full TimeSan Francisco, CA, USA; New York, …22h ago
-
Technical Program Manager II, AI/ML, Google Ads USD 138K-198KCross-Functional Collaboration | Cross-functional | Data analytics | Functional collaboration | Gemini ModelsMid-level Full TimeNew York, NY, USA22h ago
-
ML Infrastructure Engineer USD 151K-230KAirflow | Amazon SageMaker | Apache Spark | Argo Workflows | C++Entry-level Full TimeOakland, CA1d ago
-
AWS | Agentic Frameworks | Context engineering | Data analytics | Design analyticsBackup childcare | Financial coaching | Health care | Mental health support | On-site health and wellness centersExecutive-level Full TimeNY, United States1d ago
-
Senior Manager, Data Analytics - Interoperability USD 75K-165KAnalytics reporting | BigQuery | Cloud platform | Data Modeling | Data analyticsDental insurance | Medical insurance | Paid time off | Remote work | Retirement savings optionsSenior-level Full TimeWork At Home-Florida, United States1d ago
-
Senior Manager, Data Analytics - Interoperability USD 67K-149KBigQuery | Clinical data | Clinical data exchange | Cloud platform | Data ModelingDental insurance | Medical insurance | Paid time off | Remote work | Retirement savingsSenior-level Full TimeWork At Home-Georgia, United States1d ago
-
Manager of Data Science USD 104K-170KAPIs | Agentic Workflows | Artificial Intelligence | Automation | CloudSenior-level Full TimeHudson, WI, United States1d ago
-
Manager, Decision Science Products USD 171K-230KA/B | A/B Testing | Agile | B testing | Backlog ManagementBonus | Financial benefits | Long-term incentive units | Medical benefitsMid-level Full TimeUSA - CA - 820 S …1d ago
-
Advanced Analytics | Analytics | Artificial Intelligence | BigQuery | Cloud platformDental insurance | Medical insurance | Paid time off | Retirement savings | Vision insuranceSenior-level Full TimeNew York-161 Ave of the Americas, …1d ago
-
Automation | Big Data | Cause analysis | Data Visualization | ExcelSenior-level Full TimeBellevue, Washington, USA1d ago
-
Observational Research Manager (Pharmacovigilance Epidemiology and Causal Inference / Obesity) USD 109K-147KAdministrative Claims | Causal Inference | Causal Roadmap | Clinical Trials Design | Clinical trialsCareer development | Flexible spending accounts | Hybrid work | Life and disability insurance | Medical/Dental/Vision insuranceMid-level Full TimeUS - California - Thousand Oaks … R1d ago
-
Global Manager, Survey Science, Analytics & Programming USD 154K-192KC plus plus | CICD | Causal Inference | Cloud | Data Engineering401k matching | Disability coverage | Life insurance | Medical, dental, and vision plans | Vacation and sick timeSenior-level Full TimeLakewood, CO, US1d ago