LLM Reinforcement Learning Framework Engineer
Tasks
- Debug training pipelines
- Develop reinforcement learning algorithms for LLM post training
- Ensure robustness scalability and reliability in production
- Integrate reinforcement learning components into LLM training and serving stack
- Run experiments and evaluations
Perks/Benefits
- N/A
Skills/Tech-stack
Asynchronous programming | Asyncio | Deep learning | DeepSpeed | Distributed Training | GPU Architecture | Language Models | Large Language Models | Megatron-LM | NEMO | Optimization | Probability | PyTorch | Python | Ray | Reinforcement Learning | Statistics | TensorRT-LLM | Torch.distributed | VLLM
Education
N/A
Related jobs
-
3D Perception | Bimanual manipulation | C++ | Deep learning | Foundation ModelsMid-level Full TimeChina, Shanghai13h ago
-
C# | C++ | CUDA | Data analytics | Deep learningComprehensive benefits packageMid-level Full TimeChina, Shanghai13h ago
-
Ai算法工程师实习 CNY 25K-37KAgent architecture | Computer Vision | Deep learning | Fine Tuning | Hugging FaceFull-time internship | Long term internship preference | Mentorship | Team collaborationEntry-level Internship合肥、上海19h ago
-
Mid-level Full Time深圳19h ago
-
Mid-level Full TimeChina - Hong Kong1d ago
-
Algorithm Development | Data Analysis | Data Cleansing | Data Modeling | Data PreprocessingFlexible work schedule | In-person collaborationMid-level Full TimeCN004 - Shanghai, China (CN004)1d ago
-
AI Performance Engineer Intern CNY 38K-50KC# | C++ | CUDA | Code optimization | DebuggingOn site work arrangement | Student intern employmentEntry-level Full Time InternshipCHN - Minhang, China1d ago
-
Senior-level Full TimeBeijing Yizhuang, China1d ago
-
Nlp算法工程师 CNY 240K-360KC++ | Deep learning | Information Retrieval | Intent Recognition | Language ModelsMid-level Full Time深圳1d ago
-
Dba工程师 CNY 240K-420KBackup and Recovery | Change Management | Database Backup | Database Change Management | Database backup and recoveryMid-level Full Time深圳1d ago
-
Data Infra Tech Lead CNY 240K-480K3D processing | C++ | Cloud processing | Data Governance | Data LineageSenior-level Full Time北京1d ago
-
Recommendation Algorithm Engineer CNY 25K-37KCold Start | Deep learning | Java | Ranking algorithms | Recommendation SystemsTeam collaborationEntry-level InternshipGuangzhou1d ago
-
AI Algorithms | Blockchain | Data Analysis | Decentralized applications | Deep learningSenior-level Full TimeChina2d ago
-
Blockchain | Data Analysis | Deep learning | Java | Language ProcessingSenior-level Full TimeShenzhen2d ago
-
Lead / Staff Engineer, AI Agent Platform CNY 240K-480KAgent Orchestration | Asynchronous Concurrency | Budget Governance | Checkpointing | Context AssemblySenior-level Full TimeSuzhou2d ago
-
Entry-level Internship深圳、上海2d ago
-
具身智能算法实习生 (Manipulation) CNY 25K-37KCLIP | Code debugging | Data Preprocessing | Deep learning | Diffusion ModelsEntry-level Internship深圳2d ago
-
AI Software Engineering Intern CNY 38K-50KAgent Development | Algorithms | Computer Vision | Deep learning | Fine TuningHands-on projects | On-site work | Professional developmentEntry-level Full Time InternshipCHN - Minhang, China2d ago
-
AI framework vLLM optimization Intern CNY 38K-50KDeep learning | Fine Tuning | Inference Optimization | Language Models | Large Language ModelsEntry-level Full Time InternshipCHN - Minhang, China2d ago
-
Senior Data Scientist II CNY 160K-192KAI Governance | Amazon Web Services | Apache Spark | BERT | Deep learningFlexible benefits platform | Flexible work hours | Health insurance | Life insurance | Long service awardSenior-level Full TimeChina-Shanghai (Tianshan-W-Rd)2d ago
-
大模型算法工程师-c端方向 CNY 240K-480KChain-of-Thought | Deep search | LLM Inference | LLM Training | Language ModelsMid-level Full Time北京2d ago
-
Entry-level Internship上海2d ago
-
Mid-level Internship上海2d ago
-
Mid-level Full Time北京2d ago
-
【校招储备】算法实习生 CNY 25K-37KAgent Frameworks | Algorithms | Auto Tool Calling | Autogen | Data StructuresEntry-level Internship Temporary上海2d ago