大模型训练调优实习生
Tasks
- Analyze training stability and convergence
- Apply PEFT and adapter based finetuning
- Build multimodal model training pipeline
- Collaborate on model architecture system bottleneck analysis
- Coordinate multimodal data loading
- Develop efficient finetuning strategies like LoRA
- Implement DDP distributed training acceleration
- Improve training performance with mixed precision
- Maintain pretraining finetuning evaluation deployment workflow
- Monitor training quality and anomalies
- Optimize resource utilization efficiency
- Optimize training pipeline multi task scheduling
- Schedule multi GPU training resources
- Use heterogeneous acceleration and compiler optimization
Perks/Benefits
- N/A
Skills/Tech-stack
Adapter | CI/CD | DDP | DeepSpeed | FSDP | LoRA | Mixed Precision | Multi-GPU | PEFT | PyTorch | Transformer
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Roles
Engineer | Intern | Learning Engineer | Machine Learning Engineer | Research Intern
Regions
Countries
States
Related jobs
-
Entry-level Internship北京10h ago
-
Entry-level Internship北京10h ago
-
Miclaw-AI agent开发实习生 CNY 25K-37KAI Agent | Algorithms | Chain of thought reasoning | Chain-of-Thought | Data StructuresEntry-level Internship南京10h ago
-
硬件Ai学术追踪实习生 CNY 25K-37KAdaptive Noise Cancellation | Antenna design | Beamforming | Bluetooth | CSTEntry-level Internship北京10h ago
-
Miclaw-AI agent开发实习生 CNY 25K-37KAI Agent | Algorithm Design | Automated testing | Data Structures | Function CallingEntry-level Internship深圳10h ago
-
高级Ai系统开发工程师(大模型与Rag方向) CNY 240K-480KAgent workflow | Distributed Systems | Dynamic batching | Elasticsearch | GPU OptimizationSenior-level Full Time武汉11h ago
-
Senior-level Full Time北京12h ago
-
高级Ai运维工程师 CNY 240K-480KCompute resource management | Docker | Elasticsearch | Grafana | Incident ResponseSenior-level Full Time北京12h ago
-
AI Engineer CNY 216K-264KAPI Integration | Chroma | Document processing | ERP integration | EmbeddingsMid-level Full TimeShenzhen1d ago
-
Applied Scientist Intern, 2026 Shenzhen CNY 25K-37KComputer Vision | Deep learning | Diffusion Models | Language Models | Language ProcessingEntry-level Full Time InternshipShenzhen, CHN1d ago
-
Entry-level Full Time北京、深圳、上海1d ago
-
Data Engineering | Machine Learning | Model Deployment | PyTorch | PythonSenior-level Full TimeShanghai, China1d ago
-
Machine Learning Engineer - International SG CNY 28K-50KCloud Platforms | Containerization | Data Pipelines | Data Processing | Fine TuningMentorshipEntry-level Full TimeBeijing, China2d ago
-
Senior Software Engineer, 3D/4D Reconstruction CNY 417K-540K3D Computer Vision | Autonomous Driving | Computer Vision | Deep learning | Dense ReconstructionSenior-level Full TimeChina, Beijing2d ago
-
Senior-level Full TimeLOC3254: No.3239 Shenjiang Road, Shanghai, Pudong …2d ago
-
Senior-level Full TimeLOC3254: No.3239 Shenjiang Road, Shanghai, Pudong …2d ago
-
Mid-level Full Time北京2d ago
-
Senior-level Full Time北京2d ago
-
Senior-level Full Time北京2d ago
-
软件工程师 - 模型训练基础建设 CNY 180K-360KCI/CD | Containerization | Data Preprocessing | Deep learning | DeepSpeedEntry-level Full Time广州2d ago
-
大模型算法工程师--c端方向 CNY 240K-480KChain-of-Thought | Deep search | Information Retrieval | LLM Inference | LLM TrainingMid-level Full Time北京3d ago
-
大模型算法实习生 CNY 36K-37KDeep learning | DeepSpeed | Distributed Training | GPU Training | JavaLarge scale text data access | Stable internship opportunity | Supportive team environment | Technical mentorshipEntry-level Internship北京、上海5d ago
-
大模型算法-校招 CNY 500K-500KDeep learning | DeepSpeed | Distributed Training | GPU Training | Information ExtractionLarge-scale datasets | NLP application projects | Relaxed team atmosphere | Technical mentorshipEntry-level Full Time上海、北京5d ago
-
Senior-level Full TimeChina6d ago
-
Research Intern (AI Agent) CNY 25K-37KAgent systems | Embodied AI | Language Models | Large Language Models | Memory-augmented systemsEntry-level Full Time Internship深圳6d ago