aijobs.net

AI Infra Engineer - Large Model Inference Systems (Multimodal/LLM/VLM)

San Jose, California, United States

USD 198K-387K (estimate) Mid-level Full Time

Apply Save
Found 5d ago
Tasks
Perks/Benefits
Skills/Tech-stack

Attention Mechanisms | Batching | CUDA | DP | Distributed Systems | EP | Latency optimization | Load Balancing | Mixture of Experts | Multimodal fusion | TP | Throughput Optimization | Triton

Education

N/A

Roles

AI | AI Infrastructure Engineer | Engineer | Infrastructure Engineer

Regions

North America

Countries

United States

States

California, US

Cities

San Jose, California, US

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs