aijobs.net

Engineering Manager, Inference Benchmarking — AI Perf

US, CA, Santa Clara, United States

USD 224K-356K Senior-level Full Time

Apply Save
Found 2d ago
Tasks
Perks/Benefits
Skills/Tech-stack

DCGM | Distributed Systems | GPU Telemetry | GPU observability | Helm | ITL | KV cache | Kubernetes | Kubernetes Operators | LLM Inference | Load generation | Microservices | Open Source | Open source communities | Prefill Decode | Prometheus | PyNVML | SGLang | Speculative decoding | Statistical Analysis | Systems engineering | TTFT | TensorRT-LLM | VLLM | ZMQ

Education

Bachelor of Engineering | Bachelor of Science

Roles

Engineering | Engineering Manager | Manager | Technical Lead | Technical Lead Manager

Regions

North America

Countries

United States

States

California, US

Cities

Santa Clara, California, US

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs