aijobs.net

Staff Machine Learning Engineer, ML Infrastructure - Online

Shanghai, China

CNY 360K-600K (estimate) Senior-level Full Time

Apply Save
Found 1d ago
Tasks
Perks/Benefits
Skills/Tech-stack

A/B | A/B Experimentation | Autoscaling | Caching | Canary testing | Deployment Automation | Distributed Systems | Dynamic batching | Error Rate Monitoring | Error rate | GKE | GPU Acceleration | GPU Kernel | GPU kernel optimization | Google Kubernetes | Google Kubernetes Engine | Inference Server | Kernel optimization | Kubernetes | Kubernetes Engine | Latency optimization | Model Validation | Model compilation | Monitoring | NVIDIA Triton | NVIDIA Triton Inference | NVIDIA Triton Inference Server | Observability | PyTorch | Python | Quantization | Rate monitoring | Ray | Ray Serve | Rollback | Runtime tuning | TensorFlow Serving | Throughput Optimization | Torchserve | Traffic splitting | Triton Inference | Triton Inference Server

Education

N/A

Roles

Engineer | Infrastructure Engineer | Learning Engineer | ML Engineer | Machine Learning Engineer | Machine Learning Infrastructure Engineer

Regions

Asia/Pacific

Countries

China

States

Shanghai, CN

Cities

Shanghai, Shanghai, CN

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs