Agent 全栈研发工程师(前/后端)-MiMo
Tasks
- Analyze failed cases and improve training data or evaluation
- Build code evaluation benchmarks and tests
- Collaborate with training algorithm and product teams
- Create agent task environments for RL training
- Design frontend backend coding tasks for model
- Design reward signals and sandbox execution
- Develop APIs and integrate database interactions
- Implement backend services and web applications
Perks/Benefits
- N/A
Skills/Tech-stack
API Design | Benchmark design | CI/CD | Cypress | Database | Django | Docker | End to End | End-to-End Testing | FastAPI | Flask | Integration Testing | JavaScript | Jest | Langchain | Next.js | Node.js | Playwright | Pytest | Python | REST API | RLAIF | RLHF | React | Regression testing | Reinforcement Learning | Reinforcement Learning Training | Reward Design | Routing | Sandbox execution | Shell | State management | Testing | Testing Library | TypeScript | Unit Testing | Visual regression | Visual regression testing | Vitest | Vue
Education
N/A
Related jobs
-
Entry-level Full Time北京 R4h ago
-
高级算法工程师(Nlp方向) CNY 240K-480KAgent Development | Agent development tools | Agent memory | CUDA | ChromaSenior-level Full Time北京4h ago
-
Entry-level Full Time北京 R4h ago
-
Mid-level Full Time北京 R4h ago
-
具身世界模型训练INFRA工程师 - XiaomiRobotics CNY 180K-360KDeep learning | DeepSpeed | Distributed Training | Fault Tolerance | KubernetesMid-level Full Time北京4h ago
-
具身智能算法工程师-模型 CNY 500K-500KActor-critic | Deep learning | Distributed Training | Implicit Q Learning | Inference accelerationMid-level Full Time北京 R4h ago
-
AI基础设施研发工程师(Sandbox / 容器化)-MiMo CNY 180K-420KAppArmor | Argo Workflows | CPU resource scheduling | Cgroup | ContainerdMid-level Full Time北京 R4h ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitEntry-level Internship深圳5h ago
-
Ai应用工程师(提效方向 0-1) CNY 50K-50KAI Programming | AI Programming Tools | API Integration | JavaScript | Language ProcessingEngineering resource support | Hands-on product development | Model and compute support | Real world usageEntry-level Internship深圳5h ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data Retrieval | Data StorageEntry-level Internship深圳5h ago
-
Entry-level Full Time北京5h ago
-
Entry-level Internship北京5h ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KAgentic RL | DAPO | Distributed Training | Function Calling | GRPOEntry-level Full Time上海、北京5h ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data pipeline | Distributed SystemsEntry-level Internship深圳5h ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitFlexible work schedule | Internship opportunity | MentorshipEntry-level Internship深圳5h ago
-
Entry-level Full Time北京、上海6h ago
-
AGI 服务端资深工程师-Talkie&星野 CNY 180K-300KData Engineering | Dify | Distributed Systems | Go | Inference OptimizationMid-level Full Time北京、上海6h ago
-
Mid-level Full TimeBeijing, China22h ago
-
Mid-level Full TimeChina Shanghai23h ago
-
Asynchronous programming | Dashboards | Data Observability | Data Validation | DatabasesMid-level Full TimeChina, Shanghai23h ago
-
Senior-level Full TimeWuxi, Jiangsu, China1d ago
-
Entry-level Internship上海2d ago
-
Entry-level Full Time上海2d ago
-
数据算法工程师(实习生) CNY 25K-37KC++ | Data Generation | Data Modeling | Data Transformation | Data cleaningInternshipEntry-level Internship上海2d ago
-
Llm算法实习生(具身大脑方向) CNY 25K-37KAgentic RL | Data Annotation | Fine Tuning | Human Feedback | LLM AgentEntry-level Internship深圳3d ago