高级Ai系统开发工程师(大模型与Rag方向)
Tasks
- Build distributed inference services
- Build model training deployment monitoring pipeline
- Deploy and optimize LLM and VLM models
- Design LLM and RAG system architecture
- Develop agent workflow orchestration
- Develop high concurrency AI microservices
- Implement quantization and dynamic batching
- Implement tool calling using MCP protocol
- Integrate AI services with business systems
- Integrate vector databases
- Lead technical projects and coordinate teams
- Optimize RAG retrieval and caching
Perks/Benefits
- N/A
Skills/Tech-stack
Agent workflow | Caching | Distributed Systems | Dynamic batching | Elasticsearch | GPU Inference | Go | High concurrency | Java | MCP | Message Queue | Microservices | Milvus | Model Deployment | MongoDB | Multi-Modal | Multi-modal Retrieval | MySQL | Python | Quantization | RAG | Redis | SGLang | Spring Boot | Spring Cloud | TensorRT-LLM | Transformer | VLLM | Vector Database
Education
Related jobs
-
Entry-level Full Time北京3h ago
-
Mid-level Full Time武汉3h ago
-
Senior-level Full Time北京4h ago
-
Senior-level Full TimeChina20h ago
-
机器人多模态数据工程实习生 CNY 25K-37KData Engineering | Data Processing | Data Storage | Data Visualization | Data alignmentCross-disciplinary collaboration | Hands on robotics projects | Research opportunityEntry-level Internship深圳1d ago
-
具身智能算法实习生 (Manipulation) CNY 25K-37KCLIP | Computer Vision | Deep learning | Diffusion Model | Fine TuningEntry-level Internship深圳1d ago
-
Agi 后端工程师-下一代Ai数据链路 CNY 180K-360KCache | Data Processing | Data Storage | Data cleaning | Data pipelineEntry-level Full Time北京、上海1d ago
-
校招-Ai研究科学家-大语言模型/视觉语言模型算法与后训练(博士优先) CNY 500K-500KAdapters | Direct Preference Optimization | Fine Tuning | Flax | Function designNone Full Time上海1d ago
-
Activation Function | Architecture Design | Automated testing | CI/CD | Computer VisionBirthday off | Flexible working hours | Local holidays | Onsite work | Paid vacationMid-level Full TimeShenzhen1d ago
-
Entry-level Internship Part TimeChina1d ago
-
AI & Data Science Intern CNY 38K-50KAgentic AI | Data Pipelines | Data Visualization | Embeddings | Generative AICross-functional collaboration | Hands-on projects | Innovation at scale | MentorshipEntry-level Full Time InternshipSHANGHAI, China1d ago
-
Senior GenAI Software Architect CNY 240K-480KAutogen | Bayesian analysis | Chroma | Deep learning | Edge ComputingOn-site workSenior-level Full TimeCHN - Minhang, China1d ago
-
Senior GenAI Software Architect CNY 240K-480KAutogen | Bayesian analysis | Chroma | Deep learning | Edge AIOn-site work modelSenior-level Full TimeCHN - Minhang, China1d ago
-
AI Software Engineer - Intern CNY 28K-50KC++ | Generative AI | Graph Structure Transformation | Hugging Face | Hugging Face TransformersOn-site workEntry-level Full Time InternshipCHN - Minhang, China1d ago
-
Machine Learning Engineer Lead CNY 300K-500KAPI Development | AWS | Asynchronous programming | CI/CD | Cloud platformAnnual Medical Checkup | Flexible benefits | Life insurance | Long service award | Medical insuranceSenior-level Full TimeChina-Shanghai (Tianshan-W-Rd)1d ago
-
Machine Learning Engineer Lead CNY 300K-500KAPI Development | AWS | Anomaly Detection | Artificial Intelligence | Asynchronous programmingAnnual Medical Checkup | Family care leave | Flexible benefits | Life insurance | Long service awardSenior-level Full TimeChina-Shanghai (Tianshan-W-Rd)1d ago
-
机器学习特征/样本数据工程研发 CNY 180K-300KC++ | Data Processing | Data pipeline | Distributed data | Distributed data processingEntry-level Full Time北京、上海2d ago
-
多模态大模型算法工程师(Vlm / 自动驾驶方向) CNY 180K-264KAgent systems | Autoregressive models | BEV | Behavior Modeling | C++Entry-level Full Time北京、苏州2d ago
-
Mid-level Full TimeBeijing, Beijing, CN; Suzhou, Jiangsu, CN2d ago
-
Computer Vision | CoreML | Deep learning | Diffusion Models | GLSLSenior-level Full TimeBeijing, China2d ago
-
GenAI Software Architect CNY 240K-480KAutogen | Bayesian analysis | Chroma | Deep learning | Edge ComputingSenior-level Full TimeCHN - Minhang, China2d ago
-
Entry-level Full Time广州3d ago
-
Entry-level Full Time广州3d ago
-
Principal Software Engineer - Core Infrastructure Team CNY 240K-480KAPI Design | Automation | C# | C++ | Database DesignSenior-level Full TimeBeijing, Beijing, CN; Suzhou, Jiangsu, CN3d ago
-
AI Agent Development Intern CNY 45K-50KAI Agent | AI Agent Development | Agent Development | Code generation | GitHubEntry-level InternshipJia Ding Qu, Shang Hai Shi, …3d ago