Senior Deep Learning Solution Architect
Tasks
- Analyze machine learning system bottlenecks
- Build example code and acceleration libraries
- Collaborate with community on framework development
- Develop KV cache offloading frameworks
- Develop open source inference frameworks
- Drive distributed training performance research
- Improve inference efficiency across storage tiers
- Optimize model inference performance
Perks/Benefits
- N/A
Skills/Tech-stack
C++ | Caching | Computer Architecture | Data Structures | Data transfer | Data transfer optimization | Deep learning | Distributed Training | Heterogeneous computing | High Performance | High-Performance Computing | KV cache | Language Models | Large Language Models | Machine Learning | Model Inference | Networking | Parallel Computing | Performance Computing | Performance Modeling | Performance optimization | Python | Transfer optimization
Education
Related jobs
-
Agentic Inference | CUDA | Distributed Training | Docker | GPU ComputingSenior-level Full TimeChina, Beijing18h ago
-
Agentic AI | Artificial Intelligence | GPU Computing | Generative AI | Human FeedbackSenior-level Full TimeChina, Shenzhen18h ago
-
Mid-level Full TimeChina, Shanghai18h ago
-
Executive-level Full TimeBeiJing, China1d ago
-
Entry-level Full TimeCHN - Minhang, China1d ago
-
Entry-level Full Time上海1d ago
-
Senior AI Software Solutions engineer CNY 240K-480KApache Flink | Apache Spark | C++ | CUDA | Convolutional Neural NetworkOn-site workSenior-level Full TimeCHN - Minhang, China2d ago
-
Solution Architect - Top AI Labs CNY 435K-500KAIGC | C++ | CUDA | Cloud Computing | Computer VisionSenior-level Full TimeChina, Beijing2d ago
-
Sr. Specialist BD, AI/ML, WWSO CNY 435K-540KAmazon SageMaker | Business Operations | Cloud Computing | Deal negotiation | Deep learningCareer growth | Mentorship | Work-life balanceSenior-level Full TimeShanghai, CHN2d ago
-
Mid-level Full Time深圳2d ago
-
Mid-level Full Time上海2d ago
-
Mid-level Full Time北京2d ago
-
驾舱一体专家/高级专家/总监 CNY 240K-480KComputer Vision | Data Generation | Data Preprocessing | Data cleaning | Deep learningSenior-level Full Time北京、上海、深圳、广州2d ago
-
Agent systems | LLM | Machine Learning | Multi-Agent | Multi-Agent SystemsMid-level Full Time深圳、上海2d ago
-
Entry-level Full Time InternshipBeijing2d ago
-
Mid-level Full Time北京2d ago
-
Miclaw-端云协同调度专家 (Hybrid AI Architect) CNY 240K-480K5G | Anthropic Claude | Anthropic Claude API | Claude API | Cloud ComputingHybrid workSenior-level Full Time北京2d ago
-
大模型算法专家 CNY 240K-480KAgentic RL | Language Models | Large Language Models | Linux | Multimodal GenerationSenior-level Full Time北京2d ago
-
大模型应用算法实习生 CNY 25K-37KAgentic Systems | C++ | Customer Service | Customer Service Automation | Deep learningOne-on-one mentorship | Technical workshopsEntry-level Internship上海4d ago
-
AI software engineer intern CNY 38K-50KComputer Vision | Deep learning | Fine Tuning | Inference Optimization | Language ModelsOn-site workEntry-level Full Time InternshipCHN - Minhang, China4d ago
-
Internship: Data Engineer CNY 38K-50KAPI Integration | Cloud platform | Data Pipelines | Data Preprocessing | Data RetrievalEntry-level InternshipMin Hang Qu, Shang Hai Shi, …5d ago
-
Staff Applied AI Scientist CNY 200K-500KBenchmarking | Cost Optimization | DPO | Deep learning | DistillationCross-functional collaboration | Direct impact with real customer data | Remote-friendly workSenior-level Full TimeShenzhen, Guangdong Province, China5d ago
-
IT Manager, AI Innovation & Enablement CNY 272K-370KAI orchestration | API Design | API Integration | Agile | Artificial IntelligenceMid-level Full TimeCN - Pudong, China5d ago
-
Entry-level Internship深圳6d ago
-
AI Tech Lead SGD 96K-132KArtificial Intelligence | Cloud Computing | Data Pipelines | Deployment strategy | LLM ApplicationsOn site customer work | TravelSenior-level Full Time Internship北京、新加坡、中国香港8d ago