Developer Technology Engineer - AI
Tasks
- Build and optimize parallel algorithms and data structures on GPUs
- Collaborate with architecture and research teams to improve developer efficiency
- Develop and contribute to GPU and large language model frameworks and open source projects
- Improve distributed training and inference communication libraries
- Optimize GPU kernels and operators
- Optimize collective communication and data transfer strategies
- Optimize training and inference for large language models
- Tune instructions and optimize compilers
Perks/Benefits
- N/A
Skills/Tech-stack
C# | C++ | CUBLAS | CUDA | CUDNN | Cutlass | Direct memory access | Distributed Systems | FlashAttention | FlashInfer | Fortran | Infiniband | Linear Algebra | Megatron | Memory access | NVIDIA NCCL | NVSHMEM | Numerical Methods | Parallel Programming | Python | Remote Direct Memory Access | RoCE | TensorRT | TensorRT-LLM
Education
Roles
Related jobs
-
Applied AI Engineer - Silicon Co-Design Group CNY 300K-480KAgent Framework | Autogen | C# | C++ | CrewAISenior-level Full TimeChina, Shanghai20h ago
-
Data Analysis Engineer CNY 25K-37KArtificial Intelligence | Dashboard Development | Data Analysis | Data Visualization | Data cleaningEntry-level Full TimeGuangzhou, China20h ago
-
Mid-level Full Time北京 R2d ago
-
大模型算法研究员-MiMo CNY 500K-500KActive Learning | C plus plus | Curriculum learning | Deep learning | Fine TuningMid-level Full Time北京2d ago
-
Miclaw-端云协同调度专家 (Hybrid AI Architect) CNY 240K-480KCloud Computing | Consistency protocols | Data Compression | Distributed Systems | Edge ComputingHybrid workSenior-level Full Time北京 R2d ago
-
Senior-level Full Time北京3d ago
-
None Full Time深圳、北京、上海3d ago
-
Entry-level Internship深圳、北京、上海3d ago
-
None Full Time深圳、北京、上海3d ago
-
Entry-level Internship深圳、北京、上海3d ago
-
Entry-level Full Time深圳、北京、上海3d ago
-
Entry-level Full Time深圳、北京、上海3d ago
-
Principal Engineer, Cloud Storage Architect CNY 74K-100KAWS S3 | Azure Blob | Azure Blob Storage | Blob Storage | Cloud ArchitectureEntry-level Full TimeShanghai, Shanghai, China3d ago
-
C++ | Channel coding | Communication Systems | DSP Toolbox | MATLABEntry-level Internship Part TimeShanghai (JingAn), China3d ago
-
Mid-level Full Time北京 R4d ago
-
Algorithms | Cloud API | Data Structures | Embeddings | Language ModelsEnglish-speaking environmentMid-level Full TimeChina4d ago
-
API Development | Algorithms | Cloud API | Cloud API development | Convex OptimizationMid-level Full TimeShenzhen4d ago
-
Data Platform Engineering Principal, VP CNY 360K-540KAI Assisted Development | AI-Assisted Development Tools | AWS | Application development | AzureEmployee networks | Flexible work/life support | Inclusive development opportunities | Paid volunteer daysSenior-level Full TimeHangzhou, China4d ago
-
Staff AI Engineer CNY 300K-500KAWS | Agent Frameworks | Agent systems | Benchmarking | C++Autonomy to shape AI roadmap | Direct Access To Founding Team | End-to-end ownership | High impact production scaleSenior-level Full TimeShenzhen, Guangdong Province, China4d ago
-
Entry-level Full TimeDongguan (R&D), China4d ago
-
Data Platform Engineering Principal, VP CNY 360K-540KAmazon Web Services | Application development | CI/CD | Capacity Planning | Catalog ServicesFlexible work/life support | Inclusive development opportunities | Paid volunteer daysSenior-level Full TimeHangzhou, China4d ago
-
Senior-level Full TimeSHANGHAI, China4d ago
-
Internship in Innovation & Sustainability - Computer or Data Science, Mathematics or Engineering CNY 28K-50KAI Assisted Review | API Integration | Assisted Review | Automation | Data benchmarkingEntry-level Full Time InternshipSHANGHAI, China4d ago
-
AI Engineer (Biometrics and Data Science) CNY 272K-370KA/B | A/B Testing | Auto Testing | B testing | CI/CDMid-level Full TimeShanghai Jing'An Office, China4d ago
-
Senior-level Full TimeBeijing Yizhuang, China4d ago