aijobs.net

Senior Performance Analyst, Inference

Sunnyvale, CA

USD 175K-260K (estimate) Senior-level Full Time

Apply Save
Found 1d ago
Tasks
Perks/Benefits
Skills/Tech-stack

Attention Mechanism | CUDA | Flash Attention | GPU kernel optimization | KV cache | Kernel optimization | Quantization | SGLang | TensorRT | TensorRT-LLM | Transformer | Triton | VLLM

Education

N/A

Roles

Analyst | Machine Learning Performance Analyst | Performance Analyst

Regions

North America

Countries

United States

States

California, US

Cities

Sunnyvale, California, US

Apply Save
Language: en | Views: 0 | Clicks: 0 | Saves: 0

Related jobs