大模型应用架构师
Tasks
- Build model gateway and adapter layer
- Build reusable agent development framework SDK and components
- Collaborate cross functionally to deliver product from requirements to launch
- Deploy high availability observable and canary services
- Design AI Agent application architecture
- Develop MCP skill tool framework
- Develop core agent modules
- Engineer inference planning decision capabilities for production services
- Implement concurrency control and rate limiting
- Implement error tolerance cost governance
- Implement knowledge base services
- Implement task planning and execution engine
- Implement timeout fallback and degradation
- Integrate inference engines with model gateway and scheduling
- Monitor first token and end to end latency
- Optimize agent pipeline performance and stability
- Track LLM MLLM and agent application trends
Perks/Benefits
- N/A
Skills/Tech-stack
Agent Frameworks | C++ | Caching | Canary Deployment | Concurrency Control | Coze | Dify | Execution Engine | Fallback mechanisms | Function Calling | Go | High Availability | Inference engine | Java | Kafka | LLM Inference | Langchain | Langgraph | Language Models | Large Language Models | Llamaindex | MCP | Model Scheduling | Model gateway | MongoDB | MySQL | Observability | Python | Rate Limiting | Redis | Retrieval-Augmented Generation | SGLang | Streaming | Task planning | TensorRT | TensorRT-LLM | Timeout handling | Token Management | VLLM | Vector Databases | Vector Search
Education
N/A
Related jobs
-
具身智能数据开发实习生 CNY 25K-37KAPI Development | Algorithms | Automation | Data Ingestion | Data StructuresEntry-level Internship上海4h ago
-
None Full Time上海4h ago
-
A/B | A/B Testing | Agent systems | Anomaly Detection | B testingEntry-level Internship上海4h ago
-
BPS & AI engineer_PS CNY 25K-37KArtificial Intelligence | BPS | Business Process | Business process improvement | Continuous ImprovementEntry-level Full TimeWuxi, Jiangsu, China18h ago
-
Lead Embedded Software Engineer CNY 349K-437KARM | BLE | C# | C++ | Embedded LinuxHybrid work model | Remote-friendly | Work from homeSenior-level Full TimeSuzhou, China R23h ago
-
Senior Manager, AI Algorithm Lead CNY 240K-360KAI architecture | Data Modeling | Deep learning | Inference acceleration | Language ProcessingSenior-level Full TimeAIA ED (Shanghai) Hongkou, China23h ago
-
数据开发工程师 CNY 240K-480KAirbyte | BigQuery | Cube.js | DBT | Data GovernanceAI tool subscriptions | API credits | Cloud credits | Flat organizationSenior-level Full Time深圳1d ago
-
数据平台开发工程师 CNY 180K-360KData Lake | Data Warehouse | Data Warehouse Modeling | Data pipeline | Delta LakeMid-level Full Time广州1d ago
-
Entry-level InternshipShenzhen1d ago
-
Agile | Automatic control | C++ | Continuous integration | Functional SafetyAgile work environment | Scrum and Kanban collaborationMid-level Full TimeJiading Qu, China1d ago
-
Asset Management - Data Engineer - Associate CNY 302K-370KApache Spark | Data APIs | Data Architecture | Data Modeling | Data PipelinesMid-level Full TimeShanghai, China1d ago
-
GenAI Software Architect CNY 240K-480KAutogen | Bayesian analysis | Chroma | Deep learning | EmbeddingsSenior-level Full TimeCHN - Minhang, China1d ago
-
Senior-level Full TimeShanghai Offices, China1d ago
-
Entry-level Full Time InternshipShenzhen Brion office, China1d ago
-
C# | C++ | CUDA | Data analytics | Deep learningEntry-level Full Time InternshipChina, Beijing1d ago
-
Senior-level Full TimeChina, Shanghai1d ago
-
Software Engineering & Development, AVP CNY 300K-420KAI Governance | API Development | AWS | Adversarial Robustness | AlertingExecutive-level Full TimeHangzhou, China1d ago
-
Executive-level Full TimeHangzhou, China1d ago
-
【集团】数据库运维工程师 CNY 240K-360KAutomation | Backup and Recovery | Data synchronization | Database Migration | Database monitoringMid-level Full Time Temporary上海2d ago
-
Mid-level Full Time北京 R2d ago
-
Miclaw-端云协同调度专家 (Hybrid AI Architect) CNY 240K-480K5G | API Integration | Classifier Training | Claude 3 | Claude 3 5 APIHybrid workSenior-level Full Time北京 R2d ago
-
Machine Learning Engineer, AI Applications - Shenzhen CNY 240K-330KAPI Integration | Anomaly Detection | Backend integration | Data Pipelines | Data ProcessingMid-level Full TimeShenzhen2d ago
-
Application Engineer-Senior CNY 240K-480KAPI Development | Computer Vision | Dify | Django | DockerSenior-level Full TimeShanghai, China2d ago
-
Audit Logging | CI/CD | Data Governance | Data Privacy | Drift DetectionSenior-level Full TimeShanghai, Shanghai, China2d ago
-
Senior AI Engineer CNY 240K-480KAgent Orchestration | Authentication | Authorization | CI Gates | CI/CDSenior-level Full TimeChina2d ago