aijobs.net

Staff Software Engineer, Inference

Sunnyvale, CA / Bellevue, WA

USD 188K-275K Senior-level Full Time

Apply Save
Found 5d ago
Tasks
Perks/Benefits
Skills/Tech-stack

BF16 | C++ | CUDA | Distributed Systems | FP8 | GPU interconnects | Go | Inference Server | Kubernetes | Latency optimization | Mixed Precision | NCCL | NUMA | Networking | Performance optimization | Python | RDMA | Ray Serve | Streaming inference | TensorRT-LLM | Throughput Optimization | Torchserve | Triton Inference | Triton Inference Server | VLLM

Education

N/A

Roles

Engineer | Software Engineer | Staff Software Engineer

Regions

North America

Countries

United States

States

California, US | Washington, US

Cities

Sunnyvale, California, US | Bellevue, Washington, US

Apply Save
Language: en Views: 1 Clicks: 0 Saves: 0

Related jobs