Senior Deep Learning Solution Architect
Tasks
- Analyze machine learning system bottlenecks
- Build acceleration libraries or frameworks
- Build feature and operator implementations
- Develop KV cache offloading frameworks
- Develop open source inference frameworks
- Drive R and D on distributed training performance
- Optimize LLM inference efficiency
- Optimize inference performance
- Support multi level cache offloading
Perks/Benefits
- N/A
Skills/Tech-stack
Accelerated computing | Computer Systems | Data Structures | Deep learning | Distributed Training | Heterogeneous computing | Inference Optimization | KV Cache Offloading | KV cache | LLM Inference | LLM Training | Parallel Computing | Performance Modeling | Performance optimization
Education
Related jobs
-
AI Framework Software Engineer CNY 300K-420KAsynchronous Communication | C++ | Computational graphs | Data parallelism | Deep learningOn-site work environmentEntry-level Full TimeCHN - Minhang, China3d ago
-
Embodied AI Research Intern CNY 25K-37KAgentic AI | CLIP | Computer Vision | Deep learning | DeepSpeedEntry-level Full Time Internship深圳、上海3d ago
-
Embodied AI Research Intern CNY 25K-37KAgent planning | Agent reasoning | CLIP | Computer Vision | Data SynthesisInternship experience | Research mentorship | Team collaborationEntry-level Full Time Internship深圳、上海3d ago
-
Embodied AI Research Intern CNY 25K-37KAuto-labeling | CLIP | Computer Vision | Computer Vision Benchmarks | Data SynthesisInternshipEntry-level Full Time Internship深圳、上海3d ago
-
Artificial Intelligence | Attention Mechanisms | Benchmarking | C++ | GEMMEntry-level Full Time InternshipChina, Beijing4d ago
-
C# | C++ | Data analytics | Deep learning | GPU ComputingComprehensive benefits packageEntry-level Full TimeChina, Shanghai4d ago
-
Entry-level Internship南京4d ago
-
Entry-level Internship南京4d ago
-
AI Agent开发工程师-汽车专项-实习 CNY 25K-37KAPI Design | Authentication | Autogen | Concurrency | Context ManagementEntry-level Internship上海4d ago
-
DPO | Deep learning | Diverse Preference Optimization | Learning algorithms | Machine LearningMid-level Full Time上海5d ago
-
Senior-level Full Time上海5d ago
-
Mid-level Full Time上海5d ago
-
Senior-level Full Time上海5d ago
-
Entry-level Full TimeSuzhou, Jiangsu, China6d ago
-
Mid-level Full TimeBeijing, China6d ago
-
Mid-level Full Time东莞6d ago
-
Ai算法工程师 CNY 180K-300KConvolutional Neural Networks | Data Mining | Data Warehouse | Data cleaning | Data labelingMid-level Full Time东莞6d ago
-
AI Research Scientist 大模型研究科学家 CNY 37K-37K3D Generation | Cloud Computing | DeepSpeed | Diffusers | Image GenerationEntry-level Full TimeBeijing, Beijing, China7d ago
-
AI Software Engineering Intern CNY 60K-60KAI Agents | Agent systems | Data Pipelines | Deep learning | Fine TuningOn-site workEntry-level Full Time InternshipCHN - Shenzhen, China7d ago
-
Accelerator | Computer Architecture | Deep learning | Deep learning frameworks | GPUSenior-level Full TimeChina, Shanghai7d ago
-
Solution Architect ISV - Robotics CNY 360K-600KAI Pipelines | AI Workflow Orchestration | AI workflow | Agentic AI | Artificial IntelligenceSenior-level Full TimeChina, Shenzhen7d ago
-
大模型算法研究员-MiMo CNY 500K-500KAI Feedback | Active Learning | C++ | Curriculum learning | Deep learningMid-level Full Time北京8d ago
-
AI基础设施研发工程师(Sandbox / 容器化)-MiMo CNY 180K-420KAppArmor | Argo Workflows | CI/CD | CPU resource scheduling | CgroupMid-level Full Time北京 R8d ago
-
AI Computing Architect CNY 240K-480KArchitecture simulation | C# | C++ | CUDA | Computer ArchitectureSenior-level Full TimeChina, Shanghai10d ago
-
Entry-level Internship广州10d ago