Senior Deep Learning Solution Architect
Tasks
- Analyze machine learning system bottlenecks
- Build acceleration libraries or frameworks
- Build feature and operator implementations
- Develop KV cache offloading frameworks
- Develop open source inference frameworks
- Drive R and D on distributed training performance
- Optimize LLM inference efficiency
- Optimize inference performance
- Support multi level cache offloading
Perks/Benefits
- N/A
Skills/Tech-stack
Accelerated computing | Computer Systems | Data Structures | Deep learning | Distributed Training | Heterogeneous computing | Inference Optimization | KV Cache Offloading | KV cache | LLM Inference | LLM Training | Parallel Computing | Performance Modeling | Performance optimization
Education
Related jobs
-
大模型算法实习生 CNY 36K-37KDeep learning | DeepSpeed | Distributed Training | Java | Machine LearningCollaborative team | Large NLP dataset access | Long term internship support | Technical mentorship | Technical resourcesEntry-level Internship北京、上海14h ago
-
AI Feedback | Deep learning | Direct Preference Optimization | Fine Tuning | Human FeedbackMid-level Full Time上海3d ago
-
Senior-level Full Time上海、武汉、北京3d ago
-
Senior-level Full Time上海3d ago
-
Mid-level Full Time上海3d ago
-
Senior-level Full Time上海3d ago
-
Embodied AI Intern CNY 45K-50KC++ | Computer Vision | Deep learning | Gazebo | Isaac SimHands on industry scale data annotation experience | Onsite work three days per week | Structured mentoringEntry-level Internship Part TimeShanghai, China5d ago
-
Mid-level Full Time Temporary北京5d ago
-
Mid-level Full Time北京 R5d ago
-
Mid-level Full Time杭州5d ago
-
Mid-level Full TimeChina, Shanghai6d ago
-
Senior AI Training Performance Engineer CNY 144K-240KC++ | CUDA | Computer Architecture | Deep learning | GPU ArchitectureSenior-level Full TimeChina, Shanghai6d ago
-
Mid-level Full TimeChina, Shanghai6d ago
-
Senior-level Full Time上海、北京7d ago
-
Entry-level Internship南京7d ago
-
Entry-level Internship南京7d ago
-
Entry-level Internship深圳7d ago
-
具身智能算法实习生(Vla预训练方向) CNY 25K-37KCLIP | Deep learning | LLaVA | Language Models | Large Language ModelsEntry-level Internship深圳7d ago
-
Embodied AI Research Intern CNY 25K-37KAction Model | Agentic AI | Auto-labeling | CLIP | Computer VisionEntry-level Full Time Internship深圳、上海7d ago
-
大模型算法工程师(Memory/RAG/意图识别) CNY 180K-360KComputer Vision | Data Processing | Data cleaning | Data labeling | Dataset developmentEntry-level Full Time深圳10d ago
-
AI应用研发工程师(架构设计/RAG/Agent) CNY 180K-300KCI/CD | Context Management | Data pipeline | Deep learning | DockerMid-level Full Time深圳10d ago
-
Ai算法工程师 CNY 144K-240KBig Data | Big data processing | Data Processing | Deep learning | Feature EngineeringEntry-level Full Time深圳12d ago
-
【校招实习】Ai算法工程师 CNY 25K-37KComputer Vision | Data Analysis | Deep learning | Feature Engineering | HadoopInternship opportunityEntry-level Internship深圳12d ago
-
Entry-level Internship深圳12d ago
-
Manager, AI / Data Scientist CNY 300K-380KClustering | Convolutional Neural Network | Data Analysis | Data Preparation | Data QualityMid-level Full TimeAIA AI Tech ED (Shanghai) Hongkou, …13d ago