Solutions Architect - Top AI Labs
Tasks
- Accelerate LLM inference and training
- Analyze machine learning system bottlenecks
- Build and optimize KV cache offloading frameworks
- Develop acceleration libraries and frameworks
- Develop open source inference frameworks
- Drive distributed training performance research
- Perform performance analysis and optimization
Perks/Benefits
- N/A
Skills/Tech-stack
Artificial Intelligence | C++ | Computer Systems | Data Structures | Distributed Computing | Distributed Training | Heterogeneous computing | High Performance | High-Performance Computing | KV Cache Offloading | KV cache | LLM Inference | LLM Training | Language Models | Large Language Models | Machine Learning | Open Source | Parallel Computing | Performance Computing | Performance Modeling | Performance optimization | Python
Education
Roles
AI | AI Solutions | AI Solutions Architect | Architect | Solutions Architect
Related jobs
-
Senior-level Full Time上海3h ago
-
【27届实习】云原生Ai平台研发工程师-杭州 CNY 25K-37KArgo Workflow | Computer networks | Containerization | Data Structures | GolangConversion to permanent role | Technical documentation supportEntry-level Internship杭州4h ago
-
【27届实习】Ai算法工程师(工程开发方向) CNY 25K-37KComputer Vision | Deep learning | Docker | GPU Programming | JavaConversion to full time offerEntry-level Internship上海4h ago
-
Entry-level InternshipShanghai4h ago
-
Entry-level InternshipShanghai4h ago
-
Senior Solutions Architect - KV Cache and AI Storage CNY 460K-600KBluefield | CMX | Caching | Cassandra | CephSenior-level Full TimeChina, Beijing22h ago
-
Deep Learning Solution Architect CNY 337K-490KCUDA | Distributed Training | Evaluation Pipelines | Experiment Management | Language ModelsSenior-level Full TimeChina, Beijing22h ago
-
AI Algorithm and Development Software Engineer CNY 240K-360KAgent architecture | Agent systems | AutoGPT | CUDA | Chain-of-ThoughtOn-call supportMid-level Full TimeBeiJing, China22h ago
-
C++ | CPU architecture | Deep learning | Fine Tuning | GPU ComputingOn-site workEntry-level Full TimeCHN - Minhang, China1d ago
-
机器人VLA算法研究员 - XiaomiRobotics CNY 500K-500KDeep learning | Diffusion Models | Language Modeling | Machine Learning | Mixture of ExpertsEntry-level Full Time北京3d ago
-
Agentic Inference | CUDA | Distributed Training | Docker | GPU ComputingSenior-level Full TimeChina, Beijing3d ago
-
Senior Deep Learning Solution Architect CNY 367K-490KC++ | Caching | Computer Architecture | Data Structures | Data transferSenior-level Full TimeChina, Beijing3d ago
-
Agentic AI | Artificial Intelligence | GPU Computing | Generative AI | Human FeedbackSenior-level Full TimeChina, Shenzhen3d ago
-
Mid-level Full TimeChina, Shanghai3d ago
-
Executive-level Full TimeBeiJing, China4d ago
-
Entry-level Full TimeCHN - Minhang, China4d ago
-
Entry-level Full Time上海5d ago
-
Senior AI Software Solutions engineer CNY 240K-480KApache Flink | Apache Spark | C++ | CUDA | Convolutional Neural NetworkOn-site workSenior-level Full TimeCHN - Minhang, China5d ago
-
Solution Architect - Top AI Labs CNY 435K-500KAIGC | C++ | CUDA | Cloud Computing | Computer VisionSenior-level Full TimeChina, Beijing5d ago
-
Sr. Specialist BD, AI/ML, WWSO CNY 435K-540KAmazon SageMaker | Business Operations | Cloud Computing | Deal negotiation | Deep learningCareer growth | Mentorship | Work-life balanceSenior-level Full TimeShanghai, CHN5d ago
-
Mid-level Full Time深圳5d ago
-
Mid-level Full Time上海5d ago
-
Mid-level Full Time北京5d ago
-
驾舱一体专家/高级专家/总监 CNY 240K-480KComputer Vision | Data Generation | Data Preprocessing | Data cleaning | Deep learningSenior-level Full Time北京、上海、深圳、广州5d ago
-
Agent systems | LLM | Machine Learning | Multi-Agent | Multi-Agent SystemsMid-level Full Time深圳、上海5d ago