Generative AI - ML System Engineering
Tasks
- Build and scale high throughput 3D data pipelines
- Build data loading framework and libraries
- Build end to end machine learning framework for 3D
- Debug and monitor hardware platform performance
- Design training pipeline for pretraining and finetuning
- Develop inference pipelines for diffusion models
- Implement custom operators with CUDA
- Implement custom operators with Triton
- Optimize distributed model training across GPUs
- Optimize models using compilation
- Optimize models using fusion
- Optimize models using quantization
Perks/Benefits
Skills/Tech-stack
C++ | CUDA | Compilation | Data pipeline | Diffusion Models | Distributed Training | Fusion | GPU Performance | GPU Performance Optimization | JAX | Machine Learning | Model Parallelism | Performance optimization | PyTorch | Python | Quantization | Tensor programming | Triton
Education
N/A
Related jobs
-
Mid-level Full Time北京 R2d ago
-
Miclaw-端云协同调度专家 (Hybrid AI Architect) CNY 240K-480K5G | Cloud API | Consistency protocols | Data Compression | Data PrivacyHybrid workSenior-level Full Time北京 R2d ago
-
Entry-level Full Time北京 R2d ago
-
Entry-level Full Time北京 R2d ago
-
Mid-level Full Time北京 R2d ago
-
具身智能算法工程师-模型 CNY 500K-500KActor-critic | Deep learning | Distributed Training | Implicit Q Learning | Inference accelerationMid-level Full Time北京 R2d ago
-
AI基础设施研发工程师(Sandbox / 容器化)-MiMo CNY 180K-420KAppArmor | Argo Workflows | CPU resource scheduling | Cgroup | ContainerdMid-level Full Time北京 R2d ago
-
Lead Embedded Software Engineer CNY 349K-437KARM | BLE | C# | C++ | Embedded LinuxHybrid work model | Remote-friendly | Work from homeSenior-level Full TimeSuzhou, China R6d ago
-
Mid-level Full Time上海、深圳 R12d ago
-
Dashboards | Data Governance | Data Modeling | Data Monitoring | Data PipelinesMid-level Full TimeShanghai, CHN R12d ago
-
模型部署与推理优化工程师 CNY 180K-360KC++ | Edge inference | Inference Performance | Inference Performance Optimization | Model DistillationMid-level Full Time北京 R18d ago
-
Entry-level Internship上海 R19d ago
-
Mid-level Full Time北京 R1mo ago
-
AWS | Azure | JavaScript | NoSQL | Node.jsFast-paced environment | Remote workMid-level Full TimeHangzhou R1mo ago
-
AWS | Agile | Azure | Blockchain | CursorRemote workMid-level Full TimeShenzhen R1mo ago