aijobs.net

Senior Software Engineer, RL Post-Training Frameworks

US, CA, Santa Clara, United States

USD 184K-356K Senior-level Full Time

Apply Save
Found 1d ago
Tasks
Perks/Benefits
Skills/Tech-stack

Actor Based Programming | C# | C++ | Consistency models | DPO | DeepSpeed | Distributed Systems | FSDP | FSDP2 | Failure recovery | GRPO | High Performance | High-Performance Computing | High-performance inference | Infiniband | Kubernetes | LLM post training | MOE | Megatron-LM | Mixed Precision | NCCL | NVLink | PPO | Performance Computing | Pipeline parallelism | Post-training | PyTorch | Python | Quantization aware training | RLHF | Ray | Reinforcement Learning | Reinforcement Learning for LLM Post Training | Reward Modeling | Service boundaries | Task-based programming | Tensor Parallelism | TensorRT-LLM | VLLM

Education

Master of Science | PhD

Roles

Engineer | Senior Software Engineer | Software Engineer

Regions

North America

Countries

United States

States

California, US

Cities

Santa Clara, California, US

Apply Save
Language: en | Views: 0 | Clicks: 0 | Saves: 0

Related jobs