aijobs.net

LLM Pre-training & Distributed Engineer (AI Infrastructure)

San Francisco Bay Area, USA

A USD 136K-259K (estimate) Entry-level Full Time

Apply Save
Found 21h ago
Tasks
Perks/Benefits
Skills/Tech-stack

C++ | CUDA | Data parallelism | DeepSpeed | Infiniband | Kubernetes | Megatron-LM | Pipeline parallelism | PyTorch | Python | RDMA | Slurm | Tensor Parallelism

Education

N/A

Roles

Distributed Systems Engineer | Engineer | Learning Engineer | Machine Learning Engineer | Systems Engineer

Regions

North America

Countries

United States

Apply Save
Language: en | Views: 0 | Clicks: 0 | Saves: 0

Related jobs