aijobs.net

Product Manager - AI Inference & Model Serving

Austin, TX, United States

USD 165K-275K (estimate) Mid-level Full Time

Apply Save
Found 6h ago
Tasks
Perks/Benefits
Skills/Tech-stack

AI Inference | Artificial Intelligence | Autoscaling | Cache Management | Continuous batching | Disaggregated serving | Distributed Systems | GPU scheduling | Inference Server | KV cache | KV-cache management | Latency optimization | Machine Learning | Model Serving | Multi model serving | Multi-model | Network Optimization | Observability | Performance Engineering | Reliability Engineering | Routing | SGLang | Serverless | Storage Optimization | TensorRT-LLM | Throughput Optimization | Triton Inference | Triton Inference Server | VLLM | Workload placement

Education

N/A

Roles

Manager | Product Manager | Technical Product Manager

Regions

North America

Countries

United States

States

Texas, US

Cities

Austin, Texas, US

Apply Save
Language: en Views: 1 Clicks: 0 Saves: 0

Related jobs