Software Engineer, AI and DL Kernel Libraries
Tasks
- Analyze and tune workload performance
- Build deep learning library abstractions
- Collaborate across deep learning compiler and GPU teams
- Contribute to open-source inference projects
- Design scalable inference runtime infrastructure
- Develop AI inference software
- Generate code for GPU workloads
- Implement just in time compilation
- Optimize GPU kernels for AI workloads
Perks/Benefits
- N/A
Skills/Tech-stack
API Design | Apache TVM | C++ | CUDA | CUDA C | Code generation | Deep learning | GPU Programming | JAX | Just-in-Time | Just-in-time compilation | Kernel development | Linear Algebra | MLIR | ONNX | Performance Tuning | Profiling | PyTorch | Python | Software Architecture | TensorFlow | TensorRT | Triton
Education
Related jobs
-
Embedded Software Eng. CNY 180K-300KARM | ASPICE | Automotive Software | Automotive Software Development | C#Mid-level Full TimeWuhu, CN9h ago
-
Senior Software Engineer Control Software Embedded CNY 120K-180KAgile Development | Algorithms | Bug Fixing | CAN | Code Version ControlSenior-level Full TimeSuzhou - Industrial Park, China18h ago
-
AI/LLM Application Engineer CNY 280K-330KAPI | Access Control | Audit Logging | Authentication | AuthorizationMid-level Full TimeShenyang - PIC, China18h ago
-
AI/LLM Application Engineer CNY 280K-330KAccess Control | Audit Logging | Backend Development | Citation Generation | Document chunkingMid-level Full TimeShenyang - PIC, China18h ago
-
AI/LLM Application Architect CNY 360K-600KAI architecture | API Integration | AWS Bedrock | Application development | Audit LoggingSenior-level Full TimeShenyang - PIC, China18h ago
-
Senior Software Engineer - Robot Compute Platform CNY 240K-480KC# | C++ | CAN bus | CUDA | Deterministic systemsSenior-level Full TimeShanghai, China1d ago
-
Motion Control Engineer - Actuator Control Algorithms CNY 360K-600KAnti Windup | BLDC | Cogging Compensation | Commutation | Control loopSenior-level Full TimeShanghai, China1d ago
-
Embodied AI Intern CNY 45K-50KC++ | Computer Vision | Deep learning | Gazebo | Isaac SimHands on industry scale data annotation experience | Onsite work three days per week | Structured mentoringEntry-level Internship Part TimeShanghai, China1d ago
-
CI/CD | Docker | ETL | FastAPI | FlaskEntry-level InternshipShanghai, YANGPU, China1d ago
-
Senior Gen AI Software Solutions Engineer CNY 240K-360KAutogen | C++ | Deep learning | Edge AI | EmbeddingsOn-site work modelSenior-level Full TimeCHN - Minhang, China1d ago
-
优才-多模态交互算法工程师-X-Lab CNY 240K-480KAttention | Benchmarking | Computer Vision | Deep learning | Hard Negative MiningSenior-level Full Time上海、深圳1d ago
-
Mid-level Full Time深圳 R1d ago
-
Mid-level Full Time北京 R1d ago
-
大模型算法研究员-MiMo CNY 500K-500KActive Learning | C++ | Curriculum learning | Data Generation | Data ProcessingEntry-level Full Time北京1d ago
-
Robotic Embodied AI Engineer CNY 300K-355KAction Transformers | Action models | Autonomous Navigation | Computer Vision | Deep learningMid-level Full TimeBeijing, Beijing, China1d ago
-
Gaming AI Engineer CNY 304K-380KAlgorithms | Automatic Speech Recognition | C# | C++ | Computer ArchitectureMid-level Full TimeShenzhen, Guangdong, China1d ago
-
ALSA | Android Audio | Android Audio HAL | Audio HAL | Audio feature developmentSenior-level Full TimeChengdu, Sichuan, China1d ago
-
Forward Deployed AI Engineer CNY 72K-96KAWS | Agile | Amazon Redshift | BigQuery | Cloud platformTravel up to 50 percentEntry-level Full Time Internship北京2d ago
-
Mid-level Full Time北京 R2d ago
-
Mid-level Full Time Temporary北京2d ago
-
Mid-level Full Time北京 R2d ago
-
Mid-level Full Time杭州2d ago
-
Regional Data & AI Engineer, Operations, Asia Pacific CNY 300K-380KArtificial neural networks | Clustering | Data Architecture | Data Governance | Data ModelingMid-level Full TimeShanghai, CN2d ago
-
[Pricing Data Engineering ] Staff Data Engineer I CNY 120K-180KAWS | Algorithms | Amazon EMR | Apache Airflow | Apache SparkSenior-level Full TimeShanghai, China2d ago
-
Magnetic Recording Algorithm Development Engineer CNY 144K-240KAlgorithm Development | Automated Test | Automated Test Equipment | C# | C++Senior-level Full TimeShenzhen, Guangdong Province, China2d ago