aijobs.net

AI Inference Performance Engineer - New College Grad 2026

US, CA, Santa Clara, United States

USD 124K-241K Senior-level Full Time

Apply Save
Found 6d ago
Tasks
Perks/Benefits
Skills/Tech-stack

Attention | Batching | C++ | CUDA | CUDA kernels | Cutlass | Deep learning | Deep learning inference | Distributed inference | Graph lowering | JAX | KV cache | Kubernetes | MPI | Memory Management | NCCL | Operator fusion | Profiling | PyTorch | Python | Quantization | Roofline analysis | Scheduling | Speculative decoding | Torch compile | Triton

Education

Bachelor of Science | Master of Science | PhD

Roles

AI | AI Inference Performance Engineer | Engineer | Inference Performance Engineer | Performance Engineer

Regions

North America

Countries

United States

States

California, US

Cities

Santa Clara, California, US

Apply Save
Language: en Views: 2 Clicks: 0 Saves: 0

Related jobs