Deep Learning Compiler Engineer - CUDA
Tasks
- Design DSL for tile aware GPU programming model
- Implement core compiler for tile aware GPU programming model
- Integrate compiler with AI ML frameworks
- Integrate solutions into DSL and compiler stack
- Investigate next-generation GPU architectures
- Optimize compiler architecture for performance
- Perform performance analysis for AI LLM workloads
Perks/Benefits
Skills/Tech-stack
AI/ML | AI/ML Integration | C# | C++ | Compiler design | Computer Architecture | DSL | Distributed communication | GPU Architecture | Kernel programming | LLVM | ML integration | MLIR | Multi-GPU | Parallel Computing | Performance Analysis | TVM | Triton
Education
Related jobs
-
Machine Learning Engineer CNY 300K-380KArtifact tracking | Data Lineage | Data Pipelines | Distributed Systems | DockerFitness Events | Free meals | Hybrid working | Paid time off | Volunteer opportunitiesMid-level Full TimeShanghai, China8h ago
-
机器人 Vln 大模型导航-实习生 CNY 25K-37KArtificial Intelligence | C++ | CUDA | Computer Vision | Data PipelinesOnsite workEntry-level Internship北京16h ago
-
Entry-level Internship南京16h ago
-
Entry-level Internship南京16h ago
-
Entry-level Internship南京16h ago
-
AI Agent 开发实习生(通用智能仿真方向) CNY 25K-37KAPI | API Integration | Agent architecture | Agent systems | Asynchronous programmingEntry-level Internship广州17h ago
-
嵌入式软件工程师_DCDC应用层软件 Embedded Software Eng.(天津) CNY 180K-300KAgile Development | Automotive Software | C# | ISO 26262 | MATLABMid-level Full TimeTianjin, CN, 01d ago
-
Embedded Base Software Testing Engineer- Intern CNY 74K-100KC# | CAN | Excel | Hardware-in-the-loop | I2CEntry-level Full Time InternshipWuhan, Hubei, China1d ago
-
Embedded Base Software Testing Engineer- Intern CNY 74K-100KC# | CAN | Excel | Hardware-in-the-loop | I2CEntry-level Full Time InternshipWuhan, Hubei, China1d ago
-
Magnetic Recording Algorithm Development Engineer CNY 150K-240KAlgorithm Development | Automated Test | Automated Test Equipment | C# | C++Senior-level Full TimeShenzhen, Guangdong Province, China1d ago
-
Mid-level Full TimeShanghai, Shanghai, China1d ago
-
Senior-level Full TimeShenyang - PIC, China1d ago
-
Senior-level Full TimeShenyang - PIC, China1d ago
-
Mid-level Full Time深圳2d ago
-
Mid-level Full Time深圳2d ago
-
Ai研发工程师(云服务与大模型部署) CNY 180K-300KC++ | CI/CD | Cloud Computing | Distributed Systems | Edge ComputingMid-level Full Time深圳3d ago
-
Entry-level Full Time深圳3d ago
-
Algorithms | Android | C# | C++ | Computer ArchitectureSenior-level Full TimeShanghai, China4d ago
-
Entry-level Internship深圳5d ago
-
Mid-level Full Time上海5d ago
-
Software Development Engineer - C# CNY 180K-300K.NET | 21 CFR Part 11 | ASP.Net Core | Agile | Async/AwaitMid-level Full TimeShanghai, China6d ago
-
Sr. Software Development Engineer C# CNY 120K-180K.NET | 21 CFR Part 11 | ASP.Net Core | Agile | Agile ScrumSenior-level Full TimeShanghai, China6d ago
-
Action models | C++ | Data Generation | Dataset curation | Deep learningSenior-level Full TimeChina, Shanghai6d ago
-
Staff Machine Learning Engineer CNY 300K-500KAndroid | C++ | Concurrency | Edge Computing | Embedded SystemsSenior-level Full TimeShenzhen, Guangdong, China6d ago
-
Machine Learning Engineer CNY 300K-420KAI Inference | AI acceleration | Android | C++ | Debugging ToolsEntry-level Full TimeXian, Shaanxi, China6d ago