Deep Learning Solution Architect
Tasks
- Architect end to end LLM solutions
- Collaborate with customers on AI solution requirements
- Collaborate with engineering teams on technical feedback
- Design RAG workflows
- Design agentic inference pipelines
- Integrate LLM pipelines into customer systems
- Lead LLM training and fine tuning
- Optimize LLM inference throughput latency and memory
- Orchestrate agentic inference workflows
- Provide pre sales technical support for workshops and demos
Perks/Benefits
- N/A
Skills/Tech-stack
Agentic Inference | CUDA | Distributed Training | Docker | GPU Computing | GPU parallelism | Hugging Face | Hugging Face Transformers | KV cache | Kubernetes | Language Models | Large Language Models | Memory Optimization | Multi GPU Parallelism | Multi-GPU | PyTorch | Python | Quantization | RAG
Education
Related jobs
-
Senior-level Full TimeChina, Shanghai1h ago
-
Ai多模态研究实习生(有留用机会) CNY 25K-37KClustering | DBSCAN | Data Visualization | Embeddings | FaissMentorship | Real world production data exposure | Return OfferEntry-level Internship广州、北京5h ago
-
Mid-level Full Time深圳6h ago
-
Mid-level Full Time上海6h ago
-
Ai多模态研究实习生(有留用机会) CNY 25K-37KAttention | Clustering | DBSCAN | Data Visualization | EmbeddingsConversion to full time offer | Mentorship | On-the-job trainingEntry-level Internship广州、北京6h ago
-
Entry-level Full Time广州7h ago
-
Ai多模态研究实习生(有留用机会) CNY 25K-37KAttention Mechanisms | Clustering | DBSCAN | Data Analysis | Data ProcessingCareer growth | Full-time conversion opportunity | Mentorship | Real world production dataEntry-level Internship广州、北京7h ago
-
Senior-level Full Time上海23h ago
-
Senior-level Full Time上海23h ago
-
Mid-level Full TimeAIA ED (Shanghai) Hongkou, China1d ago
-
Embedded Test Engineer CNY 180K-360KARM | Automated testing | Bus analyzer | C plus plus | C#Creativity culture | Sustainability focused work | Team collaborationMid-level Full TimeChengdu - China1d ago
-
C++ | CUDA | Embodied AI | GPU Computing | LinuxCompetitive salary | Comprehensive benefits packageMid-level Full TimeChina, Shanghai1d ago
-
Senior-level Full TimeChina, Beijing1d ago
-
Senior-level Full TimeChina, Shanghai1d ago
-
Ai算法暑期实习生(Llm/强化学习) CNY 36K-37KDPO | Deep learning | Language Models | Large Language Models | Machine LearningFull-time internship | On-site internshipEntry-level Internship北京1d ago
-
Mid-level Full Time深圳、上海1d ago
-
实习-AI模型使用(Safety服务方向) CNY 25K-37KAdversarial Attacks | CI/CD | CNN | Data poisoning | Deep learningEntry-level Internship上海1d ago
-
实习-Ai研究员-大语言模型/视觉语言模型算法与后训练(博士优先) CNY 25K-37KAI Feedback | Direct Preference Optimization | Efficient Fine Tuning | Fine Tuning | FlaxEntry-level Internship上海1d ago
-
Ai算法暑期实习生(Llm/强化学习) CNY 36K-37KDPO | Deep learning | Language Models | Large Language Models | Machine LearningFull-time internship | Onsite internshipEntry-level Internship北京1d ago
-
Executive-level Full TimeHangzhou, China2d ago
-
AI Governance | APIs | AWS | Adversarial Testing | Automated EvaluationExecutive-level Full TimeHangzhou, China2d ago
-
Executive-level Full TimeHangzhou, China2d ago
-
Efficient AI Solutions Engineering Intern CNY 28K-50KC++ | Deep learning | Language Models | Large Language Models | LinuxOn-site workEntry-level Full Time InternshipCHN - Beijing, China2d ago
-
Algorithm Developer IV CNY 400K-540KActive Learning | Bayesian optimization | CI/CD | CUDA | Code ReviewsCareer development support | Health and wellbeing programs | Relocation assistance | Travel opportunities | Work-life supportSenior-level Full TimeHangzhou,CHN, China2d ago
-
Executive-level Full TimeHangzhou, China2d ago