大模型训练调优工程师
Tasks
- Apply efficient finetuning with LoRA
- Build multimodal model training pipeline
- Collaborate on model architecture optimization
- Configure distributed training with DDP
- Design training quality evaluation and anomaly monitoring
- Develop PEFT and adapter training
- Implement multimodal data loading
- Maintain pretrained finetuning evaluation deployment workflow
- Monitor training stability convergence
- Optimize training performance with mixed precision and compilation
- Optimize training pipeline scheduling
Perks/Benefits
- N/A
Skills/Tech-stack
Adapter | Anomaly Detection | CI/CD | DDP | DeepSpeed | Distributed Training | FSDP | GPU resource scheduling | LoRA | Mixed Precision | Multimodal Learning | PEFT | PyTorch | Resource scheduling | Training Optimization | Transformer
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Regions
Countries
States
Related jobs
-
Entry-level Internship上海、深圳 R8h ago
-
Mid-level Full Time北京 R12h ago
-
Deep Learning Compiler CI/Infrastructure Engineer CNY 160K-240KAI Agents | Agent workflows | Artifact management | Automated triage | AutomationGenerous benefits packageSenior-level Full TimeChina, Shanghai R2d ago
-
API Design | AWS | Agent Loop | Agent Orchestration | Async workflowsSenior-level Full TimeShenzhen, Guangdong Province, China - Remote R2d ago
-
MiMo-大模型训练框架开发工程师 CNY 240K-480KC++ | CI/CD | DeepSpeed | Distributed Training | GPU Memory OptimizationEntry-level Full Time北京 R7d ago
-
Mid-level Full Time北京 R7d ago
-
具身智能算法工程师-模型 CNY 500K-500KDeep learning | Distributed Training | IQL | Inference Optimization | Isaac LabMid-level Full Time北京 R7d ago
-
Senior Software Engineer (RAG Backend Developer) CNY 120K-180KA/B | A/B Testing | ABAC | Audit Logging | B testingSenior-level Full TimeGuangzhou, Guangdong, China R9d ago
-
Senior Consultant Specialist (AI Architect/Tech Lead) CNY 144K-192KAPI Design | AWS | Alibaba Cloud | Automation | CI/CDSenior-level Full TimeGuangzhou, Guangdong, China R23d ago
-
Ai 院--多模态团队--多模态理解算法研究员-强化学习方向 CNY 240K-480KDPO | Data Preprocessing | Data cleaning | DeepSpeed | Distributed TrainingSenior-level Full Time北京 R29d ago
-
AI基础设施研发工程师(Sandbox / 容器化)-MiMo CNY 180K-420KAppArmor | Argo Workflows | CI/CD | CPU resource scheduling | CgroupMid-level Full Time北京 R1mo ago
-
Entry-level Full Time北京、上海 R1mo ago
-
AI ML Engineer CNY 280K-360KAWS | Azure | C++ | Cloud Computing | Computer VisionPerformance bonuses | Professional development opportunities | Remote workMid-level Full TimeShenzhen, Guangdong Province, China R1mo ago
-
AI工程师-Agent Memory & RAG 方向(成都) CNY 240K-480KBERT | Chroma | Cross-Encoder | Embedding Models | FaissSenior-level Full Time成都 R1mo ago
-
AI工程师-Agent Memory & RAG 方向(武汉) CNY 240K-480KAlgorithms | BERT | Chroma | Cross-Encoder | Data StructuresSenior-level Full Time武汉 R1mo ago
-
AI工程师-Agent Memory & RAG 方向(北京) CNY 240K-480KBERT | Chroma | Cross-Encoder | Embedding Models | FaissSenior-level Full Time北京 R1mo ago
-
AWQ | AWS | Accelerate | Azure | BatchingMid-level Full TimeShenzhen, Guangdong, China R1mo ago
-
Generative AI - ML System Engineering CNY 360K-600KC++ | CUDA | Compilation | Data pipeline | Diffusion ModelsFully remote option | On-site work flexibilitySenior-level Full TimeShanghai R1mo ago
-
Mid-level Full Time上海、深圳 R1mo ago