aijobs.net

大模型 Infra 研发实习生(Agentic RL 方向)

深圳

CNY 25K-37K (estimate) Entry-level Internship

Apply Save
Found 3h ago
Tasks
Perks/Benefits
Skills/Tech-stack

Alerting | Asynchronous programming | Concurrency | Data pipeline | Distributed Systems | Distributed tracing | Docker | Git | Gymnasium | Kubernetes | Language Models | Large Language Models | Linux | Make | Monitoring | Mujoco | Observability | OpenAI Gym | PPO | PPO GRPO | Python | RLAIF | RLHF | Ray | Reinforcement Learning | Retrieval | SGLang | Shell | Slurm | Storage | Task Environment Abstraction | VLLM | Version control

Education

N/A

Roles

Engineer | Learning Engineer | Machine Learning Engineer | Software Engineer

Regions

Asia/Pacific

Countries

China

States

Hainan, CN

Cities

Wenchang, Hainan, CN

Apply Save
Language: zh Views: 3 Clicks: 2 Saves: 0

Related jobs