aijobs.net

Senior Software Engineer II, Inference

Sunnyvale, CA / Bellevue, WA

USD 165K-242K Senior-level Full Time

Apply Save
Found 8d ago
Tasks
Perks/Benefits
Skills/Tech-stack

Autoscaling | BF16 | C++ | CI/CD | CUDA | Caching | Capacity Planning | Distributed Systems | FP8 | GPU Computing | Go | Grafana | Incident Management | Inference Batching | KV cache | Kubernetes | Latency optimization | Microbatching | Mixed Precision | NCCL | NUMA | Networking | OpenTelemetry | Performance Engineering | Prometheus | Python | RDMA | Ray Serve | Reliability Engineering | SLI | SLO | Speculative decoding | Streaming token delivery | TensorRT-LLM | Throughput Optimization | Torchserve | Triton | VLLM

Education

N/A

Roles

Engineer | Senior Software Engineer | Software Engineer

Regions

North America

Countries

United States

States

California, US | Washington, US

Cities

Sunnyvale, California, US | Bellevue, Washington, US

Apply Save
Language: en | Views: 0 | Clicks: 0 | Saves: 0

Related jobs