Deep Learning Performance Software Engineer
Tasks
- Design and implement optimized deep learning kernels
- Develop compilers and domain specific languages for deep learning workloads
- Improve compiler architecture for next-generation chips
- Perform performance analysis on AI workloads and integrate with AI frameworks
Perks/Benefits
- N/A
Skills/Tech-stack
C# | C++ | Deep learning | Domain-specific language | LLVM | LLVM IR | MLIR | TVM | XLA
Education
Related jobs
-
DPO | Deep learning | Diverse Preference Optimization | Learning algorithms | Machine LearningMid-level Full Time上海5h ago
-
Mid-level Full Time上海5h ago
-
Mid-level Full Time上海5h ago
-
Senior-level Full Time上海5h ago
-
Entry-level Full TimeSuzhou, Jiangsu, China15h ago
-
Senior System Software Engineer, Robotics CNY 144K-240KARM architecture | C# | C++ | CUDA | DeterminismSenior-level Full TimeChina, Shanghai23h ago
-
C plus plus | C# | Camera Calibration | Camera Synchronization | Camera systemsMid-level Full TimeShenzhen, Guangdong, China23h ago
-
Machine Learning Engineer CNY 216K-300KAndroid | C# | C++ | Embedded Systems | Inference OptimizationMid-level Full TimeShanghai, Shanghai, China23h ago
-
C plus plus | CUDA | Code generation | Compiler design | Domain-specific languageSenior-level Full TimeChina, Shanghai1d ago
-
Mid-level Full Time深圳1d ago
-
Ai算法工程师 CNY 180K-300KConvolutional Neural Networks | Data Mining | Data Warehouse | Data cleaning | Data labelingMid-level Full Time东莞1d ago
-
Algorithm Engineer CNY 360K-600K3D Geometry | BEV fusion | Bayesian estimation | C++ | CMakeFlexible working environment | Global development opportunities | Team-oriented environmentSenior-level Full TimeShanghai, SH, CN, 2018141d ago
-
Mid-level Full Time北京 R3d ago
-
Entry-level Full Time北京 R3d ago
-
Senior-level Full Time北京3d ago
-
机器人VLA算法研究员 - XiaomiRobotics CNY 500K-500KAction Generation | Data Engineering | Deep learning | Diffusion Models | Machine LearningEntry-level Full Time北京3d ago
-
Mid-level Full Time北京 R3d ago
-
Mid-level Full Time北京 R3d ago
-
Mid-level Full Time北京3d ago
-
Entry-level Internship深圳4d ago
-
AI Agent 开发实习生(通用智能仿真方向) CNY 25K-37KAI Agent | API Integration | Asynchronous programming | Autogen | C++Flexible learning | Internship | MentorshipEntry-level Internship广州4d ago
-
3D Gaussian Splatting | 3D Geometry | 3D Object Detection | Algorithms | Autoregressive GenerationMid-level Full Time北京、上海、苏州4d ago
-
Entry-level Internship广州5d ago
-
Robotaxi VLA 大模型算法实习生 CNY 25K-37KAutonomous Driving | C++ | Data labeling | Fine Tuning | Functional SafetyEntry-level Internship广州5d ago
-
Deep Learning Performance Architect, CUTLASS DSL Testing CNY 360K-600KAutomated testing | Code Coverage | GPU Computing | MLIR | PythonSenior-level Full TimeChina, Shanghai5d ago