aijobs.net

Senior ML Engineer — Distributed LLM Training Infrastructure

Remote R

A USD 180K-250K (estimate) Senior-level Full Time

Apply Save
Found 1d ago
Tasks
Perks/Benefits
Skills/Tech-stack

AWS | Azure | C++ | CI/CD | CUDA | Checkpointing | DDP | DeepSpeed | Docker | FSDP | Fairscale | GCP | Gradient Accumulation | Gradient compression | Kubernetes | MPI | Megatron-LM | Mixed Precision | Model Parallelism | NCCL | PyTorch | Python | Quantization | RPC | Sparsification | TorchTitan

Education

Bachelor of Science | Master of Science

Roles

Engineer | Learning Engineer | Machine Learning Engineer | Senior Machine Learning Engineer

Apply Save
Language: en | Views: 0 | Clicks: 0 | Saves: 0

Related jobs