aijobs.net

Senior AI Infra Engineer - Large Model Inference Systems (Multimodal/LLM/VLM)

San Jose, California, United States

USD 198K-368K (estimate) Senior-level Full Time

Apply Save
Found 1d ago
Tasks
Perks/Benefits
Skills/Tech-stack

Attention Mechanisms | Batching | CUDA | Data parallelism | Distributed Systems | Language Models | Large Language Models | Latency optimization | Load Balancing | Machine Learning | Mixture of Experts | Model Parallelism | Multimodal AI | Pipeline parallelism | Tensor Parallelism | Throughput Optimization | Triton

Education

N/A

Roles

AI | AI Infrastructure Engineer | Engineer | Infrastructure Engineer | Senior AI Infrastructure Engineer

Regions

North America

Countries

United States

States

California, US

Cities

San Jose, California, US

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs