Senior Manager, Deep Learning Performance Architecture
Tasks
- Analyze deep learning networks
- Characterize deep learning workloads
- Collaborate with software framework teams and hardware architecture teams
- Communicate team vision to senior management
- Develop engineering policies and procedures
- Drive hardware-software co-design
- Drive next generation deep learning hardware software architecture
- Establish team objectives and schedules
- Manage deep learning performance architecture team
- Optimize deep learning software stack
- Tune and analyze performance
Perks/Benefits
Skills/Tech-stack
CPU architecture | CUDA | Calculus | Co-design | Deep learning | GPU Architecture | Hardware-Software Co-design | Hardware/software | LLVM | Linear Algebra | MLIR | OpenCL | Performance Tuning | Software Co-design | Software Design | TVM | XLA
Education
Related jobs
-
Senior Software Engineer - Robot Compute Platform CNY 240K-480KC# | C++ | CAN bus | CUDA | Deterministic systemsSenior-level Full TimeShanghai, China1d ago
-
Senior Gen AI Software Solutions Engineer CNY 240K-360KAutogen | C++ | Deep learning | Edge AI | EmbeddingsOn-site work modelSenior-level Full TimeCHN - Minhang, China1d ago
-
优才-多模态交互算法工程师-X-Lab CNY 240K-480KAttention | Benchmarking | Computer Vision | Deep learning | Hard Negative MiningSenior-level Full Time上海、深圳1d ago
-
Mid-level Full Time北京 R1d ago
-
Robotic Embodied AI Engineer CNY 300K-355KAction Transformers | Action models | Autonomous Navigation | Computer Vision | Deep learningMid-level Full TimeBeijing, Beijing, China1d ago
-
Mid-level Full Time杭州1d ago
-
Deep Learning Performance Architect CNY 360K-600KAgile | C# | C++ | CPU Performance Optimization | CPU performanceSenior-level Full TimeChina, Shanghai2d ago
-
System Software Engineer - Autonomous Vehicles CNY 360K-600KC# | C++ | CPU | CUDA | Computer VisionSenior-level Full TimeChina, Shanghai2d ago
-
Mid-level Full TimeChina, Shanghai2d ago
-
Senior AI Training Performance Engineer CNY 144K-240KC++ | CUDA | Computer Architecture | Deep learning | GPU ArchitectureSenior-level Full TimeChina, Shanghai2d ago
-
Mid-level Full TimeChina, Shanghai2d ago
-
Senior Developer Relations Manager CNY 360K-600KAI Native Database | AI for Science | AI-native | APIs | Agentic AISenior-level Full TimeChina, Beijing2d ago
-
MiMo-大模型训练框架开发工程师 CNY 240K-480KC++ | CI/CD | DeepSpeed | Distributed Training | GPU Memory OptimizationEntry-level Full Time北京 R2d ago
-
Senior-level Full Time北京2d ago
-
机器人VLA算法研究员 - XiaomiRobotics CNY 500K-500KAction Generation | Computer Vision | Data pipeline | Deep learning | Diffusion ModelsEntry-level Full Time北京2d ago
-
Mid-level Full Time北京 R2d ago
-
具身世界模型训练INFRA工程师 - XiaomiRobotics CNY 180K-360KAPI Development | Deep learning | Distributed Training | Fault Tolerance | Machine LearningMid-level Full Time北京2d ago
-
具身智能算法工程师-模型 CNY 500K-500KDeep learning | Distributed Training | IQL | Inference Optimization | Isaac LabMid-level Full Time北京 R2d ago
-
Senior-level Full Time上海、北京3d ago
-
机器人 Vln 大模型导航-实习生 CNY 25K-37KArtificial Intelligence | C++ | CUDA | Computer Vision | Data PipelinesOnsite workEntry-level Internship北京3d ago
-
Entry-level Internship南京3d ago
-
Entry-level Internship南京3d ago
-
nlp算法工程师-2027届 CNY 25K-37KDeep learning | DeepSpeed | Information Retrieval | Intent Recognition | Language ProcessingInternshipEntry-level Internship武汉3d ago
-
Entry-level Internship深圳、上海3d ago
-
Senior-level Full TimeShenyang - PIC, China4d ago