大模型平台 & Infra 工程师
Tasks
- Build MLOps toolchain for model versioning
- Design distributed training architecture
- Develop evaluation platform for LLM VLM
- Implement distributed reinforcement learning training environment
- Integrate CI CD for evaluation reports
- Optimize model inference performance
Perks/Benefits
- N/A
Skills/Tech-stack
Airflow | C++ | CI/CD | CUDA | DeepSpeed | FSDP | Go | Kubernetes | MLOps | Megatron-LM | NCCL | PyTorch | Python | Quantization | RDMA | Ray | TensorFlow | TensorRT | VLLM
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Regions
Countries
States
Related jobs
-
Behavior Cloning | C++ | Cloud processing | Control | DaggerEntry-level Internship北京、上海 R8h ago
-
Entry-level Full Time上海、深圳 R8h ago
-
Entry-level Internship上海8h ago
-
Entry-level Full Time深圳、上海10h ago
-
大语言模型后训练算法工程师 CNY 240K-480KDistributed Training | Docker | Evaluation metrics | Fine Tuning | KubernetesMid-level Full Time深圳、上海10h ago
-
Senior-level Full Time广州10h ago
-
Mid-level Full Time上海11h ago
-
Mid-level Full Time北京 R12h ago
-
Entry-level Internship北京13h ago
-
Entry-level Full Time上海13h ago
-
Entry-level Full Time上海13h ago
-
Avp - Data And Analytics CNY 300K-420KAWS | BigQuery | Cloud Computing | Cloud platform | Customer SegmentationCareer growth opportunities | Flexible working | Inclusive work environment | Professional developmentExecutive-level Full TimeGuangzhou, Guangdong, China R20h ago
-
AI/ML Scientist CNY 300K-420KAPI Development | Computer Vision | Data Analysis | Data Modeling | Data PreprocessingEntry-level Full TimeCNSGH18 - Shanghai - No. 757 …1d ago
-
Data Analysis Engineer-AI CNY 192K-300KAmazon Web Services | Apache Spark | Big Data | Data Analysis | Data MiningSenior-level Full TimeCQM01 - CQM01, Chongqing Software Park, …1d ago
-
Data Analysis Engineer CNY 300K-420KAlgorithms | Amazon Web Services | Apache Spark | Big Data | Data MiningMid-level Full TimeCQM01 - CQM01, Chongqing Software Park, …1d ago
-
大模型算法实习生 CNY 36K-37KDeep learning | DeepSpeed | Distributed Training | Java | Machine LearningCollaborative team | Large NLP dataset access | Long term internship support | Technical mentorship | Technical resourcesEntry-level Internship北京、上海1d ago
-
Mid-level Full Time北京1d ago
-
Entry-level Full Time北京、广州、上海1d ago
-
Computer Vision | Computer Vision Analytics | Data Annotation | Data Engineering | Data PipelinesSenior-level Full TimeSuzhou, China1d ago
-
Bash | Data Processing | Docker | GCP | Infrastructure as CodeAsynchronous culture | Flexible managementMid-level Full TimeBeijing, China2d ago
-
Senior Embeded QA Engineer CNY 120K-180KAPI Testing | Agile | Alarms | Ant | BACnetFlexible time off | Paid parental leave | Vacation and holiday leaveSenior-level Full TimeXi'an, China2d ago
-
Senior-level Full TimeChina, Shanghai2d ago
-
Deep Learning Compiler CI/Infrastructure Engineer CNY 160K-240KAI Agents | Agent workflows | Artifact management | Automated triage | AutomationGenerous benefits packageSenior-level Full TimeChina, Shanghai R2d ago
-
Senior MLOps Manager CNY 360K-600KCause analysis | Continuous Improvement | Dashboards | Data Operations | Data QualitySenior-level Full TimeChina, Shanghai2d ago
-
API Design | AWS | Agent Loop | Agent Orchestration | Async workflowsSenior-level Full TimeShenzhen, Guangdong Province, China - Remote R2d ago