大模型训练调优工程师
Tasks
- Analyze training stability and convergence
- Build multimodal model training system
- Collaborate with research teams on model and training optimization
- Design training quality evaluation and anomaly monitoring
- Develop parameter efficient finetuning strategies
- Implement distributed training acceleration
- Incorporate mixed precision and compilation optimizations
- Maintain pretraining finetuning evaluation deployment pipelines
- Optimize training pipeline scheduling
Perks/Benefits
- N/A
Skills/Tech-stack
Adapter | CI/CD | CUDA | DDP | DeepSpeed | FSDP | LoRA | Mixed Precision | PEFT | PyTorch | Transformer
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Regions
Countries
States
Related jobs
-
AI ML Engineer CNY 280K-360KAWS | Azure | C++ | Cloud Computing | Computer VisionPerformance bonuses | Professional development opportunities | Remote workMid-level Full TimeShenzhen, Guangdong Province, China R4d ago
-
AI工程师-Agent Memory & RAG 方向(成都) CNY 240K-480KBERT | Chroma | Cross-Encoder | Embedding Models | FaissSenior-level Full Time成都 R5d ago
-
AI工程师-Agent Memory & RAG 方向(武汉) CNY 240K-480KAlgorithms | BERT | Chroma | Cross-Encoder | Data StructuresSenior-level Full Time武汉 R5d ago
-
AI工程师-Agent Memory & RAG 方向(北京) CNY 240K-480KBERT | Chroma | Cross-Encoder | Embedding Models | FaissSenior-level Full Time北京 R5d ago
-
AWQ | AWS | Accelerate | Azure | BatchingMid-level Full TimeShenzhen, Guangdong, China R6d ago
-
Mid-level Full Time北京 R8d ago
-
Entry-level Full Time北京 R8d ago
-
Mid-level Full Time北京 R8d ago
-
具身智能算法工程师-模型 CNY 500K-500KActor-critic | Deep learning | Distributed Training | Implicit Q Learning | Inference accelerationMid-level Full Time北京 R8d ago
-
Generative AI - ML System Engineering CNY 360K-600KC++ | CUDA | Compilation | Data pipeline | Diffusion ModelsFully remote option | On-site work flexibilitySenior-level Full TimeShanghai R11d ago
-
Mid-level Full Time上海、深圳 R18d ago
-
Entry-level Internship上海 R25d ago
-
AWS | Azure | JavaScript | NoSQL | Node.jsFast-paced environment | Remote workMid-level Full TimeHangzhou R1mo ago
-
AWS | Agile | Azure | Blockchain | CursorRemote workMid-level Full TimeShenzhen R1mo ago