aijobs.net

Software Engineer, Machine Learning Infrastructure - Generative AI

San Francisco, CA; Sunnyvale, CA; Seattle, WA

USD 137K-299K Mid-level Full Time

Apply Save
Found 1d ago
Tasks
Perks/Benefits
Skills/Tech-stack

APIs | AWQ | AWS | Autoscaling | Backend Development | Batch inference | DPO | Data Pipelines | Debugging | Distributed Systems | FP8 | Fine Tuning | GCP | GPTQ | GPU Utilization | GPU autoscaling | INT8 | Incident Response | KV cache | Kubernetes | Latency optimization | LoRA | Machine Learning | Machine Learning Infrastructure | Model Serving | Monitoring | Observability | Python | Quantization | RAG | SFT | SGLang | TensorRT-LLM | Throughput Optimization | Tracing | VLLM | Vector Databases

Education

Bachelor of Science | Master of Science | PhD

Roles

Engineer | Learning Engineer | Machine Learning Engineer | Software Engineer

Regions

North America

Countries

United States

States

California, US | Washington, US

Cities

San Francisco, California, US | Seattle, Washington, US | Sunnyvale, California, US

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs