aijobs.net

Product Manager - AI Inference & Model Serving

Austin, TX, United States

USD 160K-275K (estimate) Mid-level Full Time

Apply Save
Found 11h ago
Tasks
Perks/Benefits
Skills/Tech-stack

AI Inference | Autoscaling | Cache Management | Cold Start | Cold Start Optimization | Continuous batching | Dedicated Endpoints | Disaggregated serving | DynamoDB | GPU scheduling | Inference Server | KV cache | KV-cache management | Model Serving | Multi model serving | Multi-model | Network Optimization | Observability | Performance Engineering | Prefill Decode | Prefill Decode Optimization | Reliability Engineering | Routing | SGLang | Serverless | Storage Optimization | TensorRT-LLM | Triton Inference | Triton Inference Server | VLLM | Workload placement

Education

N/A

Roles

Manager | Product Manager | Technical Product Manager

Regions

North America

Countries

United States

States

Texas, US

Cities

Austin, Texas, US

Apply Save
Language: en Views: 1 Clicks: 0 Saves: 0

Related jobs