大模型算法研究员-MiMo
Tasks
- Analyze model capability mechanisms
- Build algorithm evaluation methods
- Design deep neural network architectures
- Generate and preprocess training datasets
- Implement RLHF and RLAIF preference alignment
- Implement supervised fine tuning
- Improve data quality with instruction tuning
- Optimize active learning and curriculum learning
- Optimize multimodal models
- Train and optimize large language models
Perks/Benefits
- N/A
Skills/Tech-stack
Active Learning | C++ | Curriculum learning | Deep learning | Fine Tuning | Language Models | Language Processing | Large Language Models | Multimodal Learning | Natural Language | Natural Language Processing | PyTorch | Python | RLAIF | RLHF | Reinforcement Learning | Supervised Fine Tuning | TensorFlow
Education
N/A
Related jobs
-
机器人VLA算法研究员 - XiaomiRobotics CNY 500K-500KDeep learning | Diffusion Models | Language Models | Machine Learning | Mixture of ExpertsEntry-level Full Time北京13d ago
-
Asset pricing | Backtesting | Convexity | Credit Analysis | Data AnalysisExecutive-level Full TimeShanghai, China15d ago
-
Behavior Cloning | Deep learning | Imitation Learning | Machine Learning | PyTorchMid-level Full TimeChina18d ago
-
Algorithmic trading | Autocorrelation | Autoregression | Backtesting | C++Financial wellness tools | Free meals | Gym reimbursement | Hybrid work | Paid time offNone Full TimeShanghai30d ago