大模型应用架构师
Tasks
- Build model gateway and adapter layer
- Build reusable agent development framework SDK and components
- Collaborate cross functionally to deliver product from requirements to launch
- Deploy high availability observable and canary services
- Design AI Agent application architecture
- Develop MCP skill tool framework
- Develop core agent modules
- Engineer inference planning decision capabilities for production services
- Implement concurrency control and rate limiting
- Implement error tolerance cost governance
- Implement knowledge base services
- Implement task planning and execution engine
- Implement timeout fallback and degradation
- Integrate inference engines with model gateway and scheduling
- Monitor first token and end to end latency
- Optimize agent pipeline performance and stability
- Track LLM MLLM and agent application trends
Perks/Benefits
- N/A
Skills/Tech-stack
Agent Frameworks | C++ | Caching | Canary Deployment | Concurrency Control | Coze | Dify | Execution Engine | Fallback mechanisms | Function Calling | Go | High Availability | Inference engine | Java | Kafka | LLM Inference | Langchain | Langgraph | Language Models | Large Language Models | Llamaindex | MCP | Model Scheduling | Model gateway | MongoDB | MySQL | Observability | Python | Rate Limiting | Redis | Retrieval-Augmented Generation | SGLang | Streaming | Task planning | TensorRT | TensorRT-LLM | Timeout handling | Token Management | VLLM | Vector Databases | Vector Search
Education
N/A
Related jobs
-
系统工程师(视觉感知方向) CNY 192K-300K3D Reconstruction | AI Vision | C plus plus | Camera Calibration | Computer VisionSenior-level Full Time深圳1d ago
-
高级运维工程师(Dba方向) CNY 240K-480KAccess Control | Alerting | Automation | Backup and Recovery | Capacity PlanningSenior-level Full Time深圳1d ago
-
2026届秋招-大数据开发工程师 CNY 144K-240KJava | Linux | Python | SQL | ScriptingMentorship | Regular technical sharing | Technical trainingNone Full Time上海1d ago
-
系统工程师(视觉感知方向) CNY 192K-300K3D Reconstruction | C++ | Camera Calibration | Computer Vision | Data ValidationSenior-level Full Time深圳1d ago
-
Entry-level Internship南京1d ago
-
Entry-level Internship南京1d ago
-
AI Agent开发工程师-汽车专项-实习 CNY 25K-37KAPI Design | Authentication | Autogen | Concurrency | Context ManagementEntry-level Internship上海1d ago
-
Dba工程师 CNY 50K-50KAlertmanager | Buffer Pool | Docker | GTID | GrafanaAnnual bonus | Flexible working hours | No blame incident postmortems | Stock incentivesEntry-level Full Time InternshipBeijing1d ago
-
Entry-level Full Time InternshipBeijing1d ago
-
Entry-level Full Time InternshipBeijing1d ago
-
IT Dept. AI Engineer_Tech. Foundation(上海) CNY 192K-240KAPI Integration | Access Control | Alibaba Cloud | Automation | Cloud servicesMid-level Full TimeShanghai, CN, 2018001d ago
-
Consultant Specialist CNY 300K-420KApache Airflow | Apache Beam | Artificial Intelligence | BigQuery | Cause analysisMid-level Full TimeGuangzhou, Guangdong, China1d ago
-
Senior-level Full TimeCN-OCG International Center, Cheng Du, China1d ago
-
Principal Specialist, AI Application CNY 360K-600KAgile | Angular | Cloud Platforms | Enterprise Integration | Generative AISenior-level Full TimeCN-OCG International Center, Cheng Du, China1d ago
-
Mid-level Full TimeShenzhen, Guangdong, China1d ago
-
Mid-level Full TimeShenzhen, Guangdong, China1d ago
-
Principal Specialist, AI Application CNY 360K-600KAgile | Angular | Cloud Computing | Enterprise Integration | Generative AISenior-level Full TimeCN-OCG International Center, Cheng Du, China1d ago
-
Senior-level Full TimeCN-OCG International Center, Cheng Du, China1d ago
-
具身智能-强化学习(灵巧操作方向) 实习生 CNY 25K-37KActor-critic | Diffusion Models | Distributed Training | Embodied intelligence | Flow matchingEntry-level Full Time Internship深圳1d ago
-
DPO | Deep learning | Diverse Preference Optimization | Learning algorithms | Machine LearningMid-level Full Time上海2d ago
-
算法工程师-大模型数据方向 CNY 240K-360KAutomated Evaluation | Clustering | Corpus Synthesis | Data Augmentation | Data GovernanceSenior-level Full Time上海2d ago
-
数据开发工程师(Ai知识方向) CNY 180K-300KContent governance | Data Governance | Data Quality | Data Quality Metrics | ETLMid-level Full Time上海2d ago
-
Mid-level Full Time上海2d ago
-
Senior-level Full Time上海2d ago
-
Senior-level Full Time上海2d ago