高级Ai系统开发工程师(大模型与Rag方向)
Tasks
- Build distributed inference services with caching
- Deploy and optimize LLM and VLM models
- Design RAG system architecture
- Develop knowledge base management and multi modal retrieval optimization
- Develop scalable AI microservices
- Guide mid level engineers
- Implement quantization compression and dynamic batching
- Integrate AI services with business systems
- Integrate vector databases into RAG
- Lead technical projects and cross-team collaboration
- Manage model training deployment and monitoring pipeline
- Optimize GPU CPU heterogeneous inference
- Orchestrate agent workflows and tool calling
Perks/Benefits
- N/A
Skills/Tech-stack
Agent workflow | Distributed Systems | Dynamic batching | Elasticsearch | GPU Optimization | Go | High concurrency | Java | LLM | MCP | Message queuing | Milvus | Model Quantization | MongoDB | Multi-Modal | Multi-modal Retrieval | MySQL | Python | RAG | Redis | SGLang | SQL | Spring | Spring Boot | Spring Cloud | Spring MVC | TensorRT-LLM | Transformer | VLLM | VLM | Vector Database
Education
Roles
AI | AI Systems Engineer | Backend | Backend Engineer | Engineer | Systems Engineer
Related jobs
-
运动健康算法工程师实习生 CNY 25K-37KAgent systems | Langchain | Langgraph | Language Models | Large Language ModelsEntry-level Internship北京7h ago
-
Entry-level Internship北京7h ago
-
Entry-level Internship北京7h ago
-
Miclaw-AI agent开发实习生 CNY 25K-37KAI Agent | Algorithms | Chain of thought reasoning | Chain-of-Thought | Data StructuresEntry-level Internship南京7h ago
-
硬件Ai学术追踪实习生 CNY 25K-37KAdaptive Noise Cancellation | Antenna design | Beamforming | Bluetooth | CSTEntry-level Internship北京7h ago
-
Miclaw-大模型训练推理方向实习生 CNY 25K-37KAttention Mechanism | C++ | CUDA | Compiler optimization | FlashAttentionEntry-level Internship北京7h ago
-
Miclaw-AI agent开发实习生 CNY 25K-37KAI Agent | Algorithm Design | Automated testing | Data Structures | Function CallingEntry-level Internship深圳7h ago
-
Entry-level Internship北京7h ago
-
Entry-level Full Time北京8h ago
-
Java开发工程师(大数据方向) CNY 180K-420KAB Testing | Apache Flink | Apache Spark | Data Mining | Distributed SystemsMid-level Full Time武汉8h ago
-
Senior-level Full Time北京9h ago
-
高级Ai运维工程师 CNY 240K-480KCompute resource management | Docker | Elasticsearch | Grafana | Incident ResponseSenior-level Full Time北京9h ago
-
Entry-level Internship上海、深圳9h ago
-
Llm应用研发实习生 CNY 37K-37KAlgorithms | C# | Data Structures | Go | JavaFull-time conversion opportunity | Mentorship | Open source community experienceEntry-level Internship北京10h ago
-
AI Test Engineer CNY 240K-480KAgent-based | Agent-based systems | Artificial Intelligence | Automated testing | Boundary testingHybrid work environment | Medical insurance | Remote work | Work-life balanceSenior-level Full TimeXi'an, Shaanxi, China1d ago
-
AI Test Engineer CNY 240K-480KAgent systems | Automated testing | Cause analysis | Deep learning | Defect TrackingInclusive environment | Medical insurance | Remote work hybrid | Safe working environment | Work-life balanceSenior-level Full TimeXi'an, Shaanxi, China1d ago
-
AI Test Engineer CNY 240K-480KAgent systems | Automated testing | Azure | CPU | Cause analysisComprehensive medical coverage | Hybrid work setup | Insurance coverage | Remote work | Work-life balanceSenior-level Full TimeXi'an, Shaanxi, China1d ago
-
C++ | Chunking | Code review | Context engineering | Continuous integrationEntry-level Full TimeBEIJING 04, China1d ago
-
Lead Software Engineer - AI/LLM for Virtuoso CNY 300K-480KC++ | Chunking | Code review | Context engineering | Continuous integrationSenior-level Full TimeBEIJING 04, China1d ago
-
Mid-level Full Time上海1d ago
-
Entry-level Full Time北京、深圳、上海1d ago
-
Data Engineering | Machine Learning | Model Deployment | PyTorch | PythonSenior-level Full TimeShanghai, China1d ago
-
Machine Learning Engineer - International SG CNY 28K-50KCloud Platforms | Containerization | Data Pipelines | Data Processing | Fine TuningMentorshipEntry-level Full TimeBeijing, China1d ago
-
Associate Principal Engineer(AI Architect) CNY 216K-264KAI Pipelines | Agentic AI | Autogen | BigQuery | Cloud RunMid-level Full TimeShanghai, China1d ago
-
Senior Software Engineer, 3D/4D Reconstruction CNY 417K-540K3D Computer Vision | Autonomous Driving | Computer Vision | Deep learning | Dense ReconstructionSenior-level Full TimeChina, Beijing2d ago