Senior Machine Learning Engineer, ML Infrastructure - Online
Tasks
- Build model deployment workflows
- Design and operate online inference infrastructure
- Implement autoscaling and self healing
- Implement canary testing and A/B experimentation
- Improve observability of online ML systems
- Lead architectural improvements for reliability scalability and cost efficiency
- Optimize inference performance
- Optimize model packaging and deployment automation
- Serve production machine learning models with low latency
- Split traffic and perform rollback
- Validate and monitor model performance in production
Perks/Benefits
- Commute subsidy
- Competitive retirement pension plans
- Employee resource groups
- Employee stock ownership
- Generous vacation
- Global employee assistance program
- Mental health and wellbeing programs
- Training and development
- Volunteering and donation matching
Skills/Tech-stack
A/B | A/B Testing | Autoscaling | B testing | Canary testing | Cost monitoring | Distributed Systems | Dynamic batching | Error Rate Monitoring | Error rate | GPU Acceleration | GPU Kernel | GPU kernel optimization | Google Kubernetes | Google Kubernetes Engine | Inference Optimization | Inference Server | Kernel optimization | Kubernetes | Kubernetes Engine | Latency optimization | Machine Learning | Model Deployment | Model compilation | Monitoring | NVIDIA Triton | NVIDIA Triton Inference | NVIDIA Triton Inference Server | Observability | PyTorch | Python | Quantization | Rate monitoring | Ray Serve | Request Scheduling | Rollback | Runtime tuning | TensorFlow Serving | Throughput Optimization | Torchserve | Traffic splitting | Triton Inference Server
Education
N/A
Related jobs
-
大模型算法实习生 CNY 36K-37KDeep learning | DeepSpeed | Distributed Training | Java | Machine LearningCollaborative team | Large NLP dataset access | Long term internship support | Technical mentorship | Technical resourcesEntry-level Internship北京、上海21h ago
-
Mid-level Full Time北京22h ago
-
Entry-level Full Time北京、广州、上海22h ago
-
Computer Vision | Computer Vision Analytics | Data Annotation | Data Engineering | Data PipelinesSenior-level Full TimeSuzhou, China1d ago
-
Bash | Data Processing | Docker | GCP | Infrastructure as CodeAsynchronous culture | Flexible managementMid-level Full TimeBeijing, China1d ago
-
Senior Embeded QA Engineer CNY 120K-180KAPI Testing | Agile | Alarms | Ant | BACnetFlexible time off | Paid parental leave | Vacation and holiday leaveSenior-level Full TimeXi'an, China1d ago
-
Deep Learning Compiler CI/Infrastructure Engineer CNY 160K-240KAI Agents | Agent workflows | Artifact management | Automated triage | AutomationGenerous benefits packageSenior-level Full TimeChina, Shanghai R1d ago
-
Senior MLOps Manager CNY 360K-600KCause analysis | Continuous Improvement | Dashboards | Data Operations | Data QualitySenior-level Full TimeChina, Shanghai1d ago
-
2026 Intern(3 months)-Robotics Tech Engineer CNY 74K-100KAgentic Workflows | Dataset Construction | Diffusion Models | Embodied intelligence | Fine TuningInternshipEntry-level InternshipChengdu, Sichuan, China1d ago
-
Applied AI Engineer CNY 300K-399KA/B | A/B Testing | API Integration | Analytics | AnthropicCareer growth | Fully remote | Global team collaboration | High ownership culture | Learning and development budgetMid-level Full TimeChina R2d ago
-
Lead AI Engineer (AI Systems & Automation) CNY 360K-600KAlerting | Anthropic | Distributed Systems | Docker | EmbeddingsFully remote | High ownership culture | Learning and development budgetSenior-level Full TimeChina R2d ago
-
Data Analysis Engineer-AI CNY 156K-240KAlgorithms | Amazon Web Services | Apache Spark | Artificial Intelligence | Big DataSenior-level Full TimeCQM01 - CQM01, Chongqing Software Park, …2d ago
-
Data Analysis Engineer CNY 300K-420KAlgorithms | Amazon Web Services | Apache Spark | Artificial Intelligence | Big DataMid-level Full TimeCQM01 - CQM01, Chongqing Software Park, …2d ago
-
Mid-level Full TimeShanghai, Shanghai, China3d ago
-
Entry-level Internship深圳3d ago
-
Entry-level Full Time上海3d ago
-
AI Feedback | Deep learning | Direct Preference Optimization | Fine Tuning | Human FeedbackMid-level Full Time上海3d ago
-
Senior-level Full Time上海、武汉、北京3d ago
-
算法工程师-大模型数据方向 CNY 240K-360KApache Spark | Clustering | Data Augmentation | Data Deduplication | Data GovernanceSenior-level Full Time上海3d ago
-
数据开发工程师(Ai知识方向) CNY 180K-300KContent processing | Data Governance | ETL | Elasticsearch | Information ArchitectureFull-time employmentMid-level Full Time上海3d ago
-
Mid-level Full Time上海3d ago
-
Senior-level Full Time上海3d ago
-
Senior-level Full Time上海3d ago
-
大模型算法工程师(开放域对话) CNY 180K-300KA/B | A/B Testing | Agentic reinforcement learning | B testing | DeepSpeedMid-level Internship上海、北京3d ago
-
Mid-level Full Time上海3d ago