Deep Learning Solution Architect
Tasks
- Architect end to end LLM solutions
- Collaborate with customers on AI solution requirements
- Collaborate with engineering teams on technical feedback
- Design RAG workflows
- Design agentic inference pipelines
- Integrate LLM pipelines into customer systems
- Lead LLM training and fine tuning
- Optimize LLM inference throughput latency and memory
- Orchestrate agentic inference workflows
- Provide pre sales technical support for workshops and demos
Perks/Benefits
- N/A
Skills/Tech-stack
Agentic Inference | CUDA | Distributed Training | Docker | GPU Computing | GPU parallelism | Hugging Face | Hugging Face Transformers | KV cache | Kubernetes | Language Models | Large Language Models | Memory Optimization | Multi GPU Parallelism | Multi-GPU | PyTorch | Python | Quantization | RAG
Education
Related jobs
-
Senior Deep Learning Solution Architect CNY 367K-490KC++ | Caching | Computer Architecture | Data Structures | Data transferSenior-level Full TimeChina, Beijing18h ago
-
Agentic AI | Artificial Intelligence | GPU Computing | Generative AI | Human FeedbackSenior-level Full TimeChina, Shenzhen18h ago
-
Mid-level Full TimeChina, Shanghai18h ago
-
Executive-level Full TimeBeiJing, China1d ago
-
Entry-level Full TimeCHN - Minhang, China1d ago
-
Deep Learning Software Engineer CNY 180K-360KAsynchronous Communication | C++ | Data parallelism | Deep learning | Distributed TrainingCareer advancement opportunities | On-site work model | Skill development opportunitiesEntry-level Full TimeCHN - Minhang, China1d ago
-
Senior-level Full Time北京、上海1d ago
-
Entry-level Full Time北京1d ago
-
Entry-level Internship上海2d ago
-
Automated Workflows | C# | Clean room | Data Analysis | FIBTraining opportunities | Travel 40 percentSenior-level Full TimeChina - Shanghai - VIA Office2d ago
-
Senior AI Software Solutions engineer CNY 240K-480KApache Flink | Apache Spark | C++ | CUDA | Convolutional Neural NetworkOn-site workSenior-level Full TimeCHN - Minhang, China2d ago
-
Senior-level Full TimeChina, Shanghai2d ago
-
Solution Architect - Top AI Labs CNY 435K-500KAIGC | C++ | CUDA | Cloud Computing | Computer VisionSenior-level Full TimeChina, Beijing2d ago
-
Data Analytics & Management, Off CNY 397K-540KAPIs | AWS | Airflow | Apache Iceberg | BIEmployee networks | Flexible work/life support | Inclusive development opportunities | Paid volunteer days | Vibrant employee networksSenior-level Full TimeHangzhou, China2d ago
-
Algorithm modeling | Computer Vision | Data Augmentation | Emulation Platforms | F1-scoreCross site international experienceMid-level Full TimeShanghai, China2d ago
-
Mid-level Full Time深圳2d ago
-
Mid-level Full Time上海2d ago
-
Mid-level Full Time上海2d ago
-
Mid-level Full Time深圳、上海2d ago
-
Mid-level Full Time北京2d ago
-
机器人大模型软件工程师 CNY 180K-300KAgent systems | Asynchronous programming | C++ | Context Scheduling | Conversation ManagementMid-level Full Time深圳2d ago
-
Mid-level Full Time广州2d ago
-
Mid-level Full Time广州2d ago
-
驾舱一体专家/高级专家/总监 CNY 240K-480KComputer Vision | Data Generation | Data Preprocessing | Data cleaning | Deep learningSenior-level Full Time北京、上海、深圳、广州2d ago
-
Entry-level Full Time深圳、上海、北京2d ago