机器学习平台研发工程师/专家
Tasks
- Build and optimize distributed training systems
- Design and develop machine learning platform
- Develop resource scheduling and elastic scaling on Kubernetes
- Improve training stability with fault detection and automated failover
Perks/Benefits
- N/A
Skills/Tech-stack
Debugging | Distributed Training | Docker | Elastic scaling | Fault Tolerance | GPU | Go | Kubernetes | Kubernetes ecosystem | PyTorch | Python | System design
Education
Regions
Countries
States
Related jobs
-
机器人 Vln 大模型导航-实习生 CNY 25K-37KArtificial Intelligence | C++ | CUDA | Computer Vision | Data PipelinesOnsite workEntry-level Internship北京7h ago
-
Entry-level Internship南京7h ago
-
Entry-level Internship南京7h ago
-
Entry-level Internship南京7h ago
-
nlp算法工程师-2027届 CNY 25K-37KDeep learning | DeepSpeed | Information Retrieval | Intent Recognition | Language ProcessingInternshipEntry-level Internship武汉7h ago
-
Entry-level Full Time上海8h ago
-
Entry-level Internship深圳、上海9h ago
-
Entry-level Internship深圳9h ago
-
Entry-level Internship北京9h ago
-
Llm算法实习生(具身大脑方向) CNY 25K-37KAgentic RL | LLM Agent | Machine Learning | PyTorch | RLHFConference participation | Internship experience | Research mentorshipEntry-level Internship深圳9h ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GRPOEntry-level Internship深圳9h ago
-
具身智能算法实习生(Vla预训练方向) CNY 25K-37KCLIP | Deep learning | LLaVA | Language Models | Large Language ModelsEntry-level Internship深圳9h ago
-
AI Agent 开发实习生(通用智能仿真方向) CNY 25K-37KAPI | API Integration | Agent architecture | Agent systems | Asynchronous programmingEntry-level Internship广州9h ago
-
Apache Airflow | Apache Spark | Automated testing | Data Lakes | Data WarehousesCommute subsidy | Disability insurance | Employee assistance program | Employee resource groups | Employee stock ownershipSenior-level Full TimeShanghai, China20h ago
-
Embedded Base Software Testing Engineer- Intern CNY 74K-100KC# | CAN | Excel | Hardware-in-the-loop | I2CEntry-level Full Time InternshipWuhan, Hubei, China20h ago
-
Senior Software Engineer (RAG Backend Developer) CNY 120K-180KA/B | A/B Testing | ABAC | Audit Logging | B testingSenior-level Full TimeGuangzhou, Guangdong, China R22h ago
-
Embedded Base Software Testing Engineer- Intern CNY 74K-100KC# | CAN | Excel | Hardware-in-the-loop | I2CEntry-level Full Time InternshipWuhan, Hubei, China1d ago
-
Magnetic Recording Algorithm Development Engineer CNY 150K-240KAlgorithm Development | Automated Test | Automated Test Equipment | C# | C++Senior-level Full TimeShenzhen, Guangdong Province, China1d ago
-
Assistant Manager, Data Platform Delivery CNY 300K-406KARMA | Amazon SageMaker | Association rule | Association rule learning | AzureMid-level Full TimeChina - Guangzhou1d ago
-
Mid-level Full TimeShanghai, Shanghai, China1d ago
-
Senior-level Full TimeShenyang - PIC, China1d ago
-
Mid-level Full Time深圳2d ago
-
Mid-level Full Time深圳2d ago
-
Senior-level Full TimeShanghai, CN, 2012033d ago
-
Data Engineer CNY 360K-600KAPIs | Airflow | Alerting | Anonymization | CI/CDFlexible working models | Health and wellbeing benefits | Professional learning and developmentSenior-level Full TimeShanghai, CN, 2012033d ago