Product Manager - AI Inference & Model Serving
USD 165K-275K (estimate) Mid-level Full Time
Tasks
- Collaborate on system design trade offs for serving topology
- Define lifecycle for inference services
- Define performance measurable outcomes and metrics
- Drive go to market execution for inference products
- Lead technical discovery and translate findings into requirements
- Own product strategy and roadmap for inference and model serving
Perks/Benefits
Skills/Tech-stack
AI Inference | Artificial Intelligence | Autoscaling | Cache Management | Continuous batching | Disaggregated serving | Distributed Systems | GPU scheduling | Inference Server | KV cache | KV-cache management | Latency optimization | Machine Learning | Model Serving | Multi model serving | Multi-model | Network Optimization | Observability | Performance Engineering | Reliability Engineering | Routing | SGLang | Serverless | Storage Optimization | TensorRT-LLM | Throughput Optimization | Triton Inference | Triton Inference Server | VLLM | Workload placement
Education
N/A
Roles
Related jobs
-
Technical Program Manager, FAIR (AI Research) USD 183K-271KBias Mitigation | Compute Infrastructure | Data Engineering | Data Management | Data QualitySenior-level Full TimeNew York, NY6h ago
-
Bridge | Compliance Management | Contract Management | Environmental Health and Safety | Environmental healthMid-level Full TimeBridgeport, AL, USA6h ago
-
Senior Product Data Scientist Manager, Android USD 240K-334KClassification | Data Visualization | Machine Learning | Personalization | PythonSenior-level Full TimeSan Jose, CA, USA; Kirkland, WA, …6h ago
-
Program Manager, AI/ML, Trust and Safety USD 136K-197KArtificial Intelligence | Audit | Capacity Planning | Compliance | Cross-Functional CollaborationMid-level Full TimeKirkland, WA, USA; Austin, TX, USA6h ago
-
Technical Program Manager, Engineering & Delivery USD 70K-300KAutomation | CI/CD | Cycle time | Delivery Predictability | Deployment frequencyMid-level Full TimeIrvine, CA15h ago
-
Lead GTM Enablement & Scale Architect, Lakebase USD 174K-299KAI content | AI content generation | Agent architecture | Apache Spark | Cloud infrastructureSenior-level Full TimeUnited States19h ago
-
C++ | Data Mining | Data Processing | Data Processing Pipelines | Experiment designBonus | Equity incentive | Health benefits | Paid time offSenior-level Full TimeMountain View, California, United States; San …20h ago
-
Engineering Manager I, Threat Detection USD 192K-240KArtificial Intelligence | Automation | CI/CD | Detection engineering | Incident ResponseBest in class onboarding | Continuous career development | Cross departmental buddy program | Employee stock purchase plan | Hybrid work environmentMid-level Full TimeNew York, New York, USA20h ago
-
Data Platform Product Manager USD 99K-192KAPI Integration | Alerting | Big Data | Cloud Computing | Cloud platformAdoption surrogacy expense reimbursement | Back-Up childcare | Community service time off | Employee resource groups | Fertility treatmentsMid-level Full TimeDearborn, MI, United States20h ago
-
Data Insights – Tech Manager – IC4 USD 138K-225KAirflow | Amazon SageMaker | Business Intelligence | Customer Health | DBTHybrid work option | Inclusion and fun culture | Mentoring and growth opportunitiesMid-level Full TimeSan Francisco, CA, United States21h ago
-
Product Manager, Databricks Experimentation Platform USD 133K-204KAI Platform | Compliance | Data Governance | Databricks | Feature EngineeringBackup childcare | Financial coaching | Health and wellness centers | Health care coverage | Mental health supportMid-level Full TimeWilmington, DE, United States1d ago
-
Delivery Lead USD 140K-165KAI Governance | Data Architecture | Data Engineering | Data Pipelines | Generative AISenior-level Full TimeUnited States1d ago
-
Data Modeling | Data analytics | Forecasting | Fraud Prevention | Machine LearningExecutive-level Full TimeNew York, NY, United States1d ago
-
Staff Product Manager | Cloud Data Platform USD 135K-170KCloud infrastructure | Databases | Distributed Systems | Enterprise SaaS | ObservabilitySenior-level Full TimeUnited States1d ago
-
Director of Engineering, Lakehouse Platform USD 230K-278KAI Assisted Development | API contracts | AWS | Anomaly Detection | Apache Flink401k | Employer Paid Benefits | Flexible working arrangements | LTD/STD | Life insuranceExecutive-level Full TimeBoston, Massachusetts, United States1d ago
-
Program Manager, AI/ML, Finance Data and Analytics USD 148K-215KArtificial Intelligence | Business Intelligence | Data Architecture | Data Governance | Data ModelingSenior-level Full TimeChicago, IL, USA2d ago
-
AI workloads | Agentic Applications | Cloud Computing | Data Pipelines | Edge ComputingMid-level Full TimeSunnyvale, CA, USA2d ago
-
Sr. Manager, AI Lead - Semantic Layer - Remote USD 168K-224KAPI Integration | Analytics | Artificial Intelligence | Data Governance | Data ModelingRemote workSenior-level Full TimeCalifornia - Home Teleworkers, United States R2d ago
-
Senior Manager, Global Marketing & C&CL Consumption & Purchase Advanced Analytics Intelligence Services USD 130K-147KAdvanced Analytics | Azure Machine Learning | Consumer analytics | Data Analysis | Data ScienceGlobal exposure | High leadership visibility | Hybrid workSenior-level Full TimeUS - GA - Atlanta, United …2d ago
-
ML Infrastructure Engineer USD 100K-150KAmazon SageMaker | Apache Airflow | Argo Workflows | C++ | Cloud platformEntry-level Full TimeOakland, CA3d ago
-
Director of AI & Data Analytics USD 240K-302KAmazon Web Services | Apache Airflow | Compliance | DBT | Data Governance401k match | Generous PTO | Health and wellness benefitsExecutive-level Full TimeDallas, TX, US4d ago
-
Director, Business Lead, FAIR USD 119K-272KAgent Orchestration | Artificial Intelligence | Bias Mitigation | Cross-functional | Cross-functional leadershipSenior-level Full TimeMenlo Park, CA4d ago
-
Responsible AI Program Manager, Google Public Sector USD 165K-239KAI Governance | AI Safety | Compliance | Executive Communication | GovernanceSenior-level Full TimeReston, VA, USA; Washington D.C., DC, …4d ago
-
AppScript | Artificial Intelligence | BI tools | Business Intelligence | ConcurrencySenior-level Full TimeSan Bruno, CA, USA4d ago
-
Capacity Planning | Compute Infrastructure | Cross-functional | Cross-functional project management | Data AnalysisSenior-level Full TimeSunnyvale, CA, USA; New York, NY, …4d ago