Senior Product Manager, AI Inference - Dynamo
US, CA, Santa Clara, United States
USD 208K-327K Senior-level Full Time
Tasks
- Author PRDs
- Author software application design documents
- Collaborate on hardware/software co-design
- Define routing logic to minimize redundant prefill
- Design KV cache offloading strategy
- Develop agentic inference capabilities
- Drive product strategy for Dynamo modular components
- Integrate with SGLang
- Integrate with TensorRT LLM
- Integrate with vLLM
- Optimize time to first token
- Support multi turn stateful AI applications
Perks/Benefits
- N/A
Skills/Tech-stack
Agentic AI | Artificial Intelligence | Cache Management | Data-driven | Data-driven project management | Disaggregated serving | Distributed Systems | GPU Computing | KV cache | LLM Inference | MLOps | Machine Learning | Offloading | Prefill Decode | Product Management | Project Management | Responsible AI | Routing | Software Requirements | Systems Design | Time To First Token
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
Product Manager II - AI & Data Security USD 165K-215KAI system security | Application Architecture | Cloud Computing | Customer discovery | Data RiskCareer development opportunities | Cross departmental buddy program | Employee onboarding program | Hybrid work environment | Inclusive workplace cultureMid-level Full TimeNew York, New York, USA11h ago
-
Senior Project Manager, Machine Learning Operations USD 120K-180KAnnotation Workflows | Cause analysis | Data Pipelines | Data Quality | Data labelingCompetitive benefits package | Equity | Performance bonusSenior-level Full TimeMountain View, California (HQ)12h ago
-
Finance Systems, Head of AI & Innovation USD 315K-365KAnomaly Detection | Artificial Intelligence | Audit management | Backlog Management | BigQueryFlexible working hours | Generous vacation | Hybrid work policy | Optional equity donation matching | Parental leaveExecutive-level Full TimeSan Francisco, CA14h ago
-
Product Analyst - Generative AI Platform USD 110K-171KAPI | Agentic Systems | Agile | Cloud Computing | Data Processing401k | Dental insurance | Life insurance | Medical insurance | Paid time offEntry-level Full TimeAustin, TX, United States16h ago
-
AI Solution Strategist USD 176K-265KArtificial Intelligence | Conversational AI | Customer Experience | Language Processing | Machine LearningMid-level Full TimeUSA - Remote R17h ago
-
API | AWS | Agile | Artificial Intelligence | Azure401k matching | Dental insurance | Flexible work schedule | Health insurance | Paid HolidaysSenior-level Full TimeTexas R18h ago
-
APIs | Agile | Amazon Web Services | Artificial Intelligence | Cloud infrastructure401k matching | Dental insurance | Flexible work schedule | Health insurance | Paid HolidaysSenior-level Full TimeNew York R18h ago
-
APIs | AWS | Agile | Azure | Cloud Computing401k match | Dental insurance | Flexible work schedules | Health insurance | Paid HolidaysSenior-level Full TimeNorth Carolina R18h ago
-
Sr. Tech Lead, GTM Applied AI & Analytics USD 150K-243KAirflow | Data Warehousing | Databricks | Fine Tuning | LLM APIsSenior-level Full TimeSan Francisco, CA, United States19h ago
-
Product Lead, Applied AI, Data and Analytics Platform USD 192K-278KAccess Control | Analytics | Artificial Intelligence | BigQuery | Data LineageSenior-level Full TimeMountain View, CA, USA20h ago
-
Senior Technical Program Manager, AI/ML, Finance USD 192K-278KArtificial Intelligence | Business Intelligence | Data Architecture | Data Governance | Data WarehousingSenior-level Full TimeSunnyvale, CA, USA20h ago
-
Technical Program Manager III, Machine Learning, Core USD 163K-237KArtificial Intelligence | Automation | Deep learning | Machine Learning | Speech RecognitionSenior-level Full TimeSunnyvale, CA, USA20h ago
-
Accelerated computing | Artificial Intelligence | Competitive Analysis | Content development | Cross-Functional CollaborationCollaborative work environment | Comprehensive benefits package | Equity opportunities | Flexible work arrangements | Professional development opportunitiesSenior-level Full TimePennsylvania R1d ago
-
AI Inference | Accelerated computing | Competitive Analysis | Cross-Functional Collaboration | Cross-functionalCollaborative work environment | Comprehensive benefits package | Equity opportunities | Flexible work arrangements | Professional development opportunitiesSenior-level Full TimeMassachusetts R1d ago
-
AI Inference | Accelerated computing | Competitive Analysis | Data center | Data center architectureCollaborative work environment | Comprehensive benefits package | Equity opportunities | Flexible work arrangements | Professional developmentSenior-level Full TimeMaryland R1d ago
-
AI Inference | Accelerated computing | Competitive Analysis | Content development | Cross-Functional CollaborationCollaborative work environment | Comprehensive benefits package | Equity opportunities | Flexible work arrangements | Professional development opportunitiesSenior-level Full TimeCalifornia R1d ago
-
Manager, IT AI Engineering - Agent Engineering USD 96K-131KA/B | A/B Testing | API Design | Access Control | Agent architecture401k program | Car discounts | Cruise discounts | Employee assistance program | Flexible spending accountsMid-level Full TimeFort Worth, TX, US1d ago
-
AWS CloudFormation | Amazon Web Services | Anomaly Detection | Attribution Modeling | Budget Quota ManagementSenior-level Full TimeMountain View, California1d ago
-
Sr Manager, People Analytics Insight Partner USD 182K-235KAWS | Advanced Statistics | Analytics | Business Analytics | Data ManagementDental insurance | Health insurance | Life insurance | Paid time off | Vision insuranceSenior-level Full TimeUS - CA - Foster City, …1d ago
-
Senior Manager - Data Science USD 152K-204KData Science | Machine Learning | Resource Management | Stakeholder management | Strategic PlanningSenior-level Full TimeAtlanta Support Center, United States1d ago
-
Gen AI - Tech Product Manager USD 140K-160KA/B | A/B Testing | AI Evaluation | AI Studio | Azure AI401k plan | Dental insurance | Disability insurance | Flexible time off | Health insuranceMid-level Full TimeNew York, United States1d ago
-
Lead AI Engineer USD 150K-230KAI Services | Agile | Angular | Argo CD | CI/CD401k matching | Career growth | Healthcare benefits | Online learning platform | Paid time offSenior-level Full TimeUSA - Georgia - Alpharetta - …1d ago
-
Sr. Business Manager - SBB Banker Analytics USD 182K-229KAnalytics | Credit Risk | Credit risk modeling | Data Analysis | Economic forecastingSenior-level Full TimeMcLean, VA, United States1d ago
-
Sales Leader (Analytics & Consulting Services) USD 168K-200KAccount Planning | Artificial Intelligence | Consultative selling | Contract Negotiation | CybersecurityEntrepreneurial environment | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
AWS | Apache Spark | Business Intelligence | Cause analysis | Data ModelingHybrid work | Limited travel | People management experience | Project management supportMid-level Full TimeRiverwoods, IL, United States1d ago