AI工程师-Agent Infra & LLMOps 方向(武汉)
Tasks
- Build agent evaluation pipeline for ab testing
- Build agent tool server
- Containerize and maintain vector database middleware
- Create code execution sandbox with Docker or MicroVM
- Deploy fine tuned llm inference service
- Enforce cpu mem limits and network controls
- Generate and manage OpenAPI JSON Schema
- Implement serverless faas for agent tools
- Manage runtime dependencies for python and sql
- Optimize inference engine for low ttft
Perks/Benefits
- N/A
Skills/Tech-stack
AutoGPT | Docker | Firecracker | GRPC | GVisor | Go | HTTP2 | Inference Server | JSON Schema | Knative | Kubernetes | Langchain | Langsmith | MicroVM | Microservices | Milvus | Ollama | OpenAPI | OpenFaaS | Python | Qdrant | Serverless | TGI | TensorRT-LLM | Triton Inference | Triton Inference Server | VLLM
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Roles
AI | AI Engineer | DevOps | DevOps Engineer | Engineer | LLMOps Engineer
Related jobs
-
具身智能算法实习生 (Manipulation) CNY 25K-37KCLIP | Computer Vision | Deep learning | Diffusion Models | Fine TuningEntry-level Internship深圳13h ago
-
Entry-level Internship深圳13h ago
-
Ai 应用工程师实习生 CNY 36K-37KAgent Framework | Langchain | Language Models | Large Language Models | Prompt engineeringEntry-level Internship北京13h ago
-
Research Intern- Deep Learning USD 84K-120K3D Object Detection | 3D Object Tracking | C++ | CUDA | Computational GeometryFully onsiteEntry-level Internship费利蒙13h ago
-
Staff/Senior Staff Software Engineer - Perception USD 245K-350KAI | C++ | Computer Vision | Deep learning | Optimization401k | Dental insurance | Family leave | Free food & snacks | Health care planSenior-level Full Time费利蒙13h ago
-
(Senior) Software Engineer, Deep Learning USD 140K-280KAlgorithms | C++ | CUDA | Code optimization | Computational GeometryFamily leave | Free food and snacks | Health care plan | Life insurance | Long-term disabilitySenior-level Full Time费利蒙13h ago
-
Mid-level Full Time武汉、北京、上海14h ago
-
数据管理工程师 CNY 240K-420KCassandra | Database Architecture | Distributed database | Distributed database architecture | DorisMid-level Full Time杭州14h ago
-
Ai平台系统研发实习生 CNY 50K-50KAPI Integration | Authentication and Authorization | DNS | Distributed Storage | DockerEntry-level Internship深圳14h ago
-
AI工程师-Agent Memory & RAG 方向(成都) CNY 240K-480KBERT | Chroma | Cross-Encoder | Embedding Models | FaissSenior-level Full Time成都 R14h ago
-
AI工程师-Agent Infra & LLMOps 方向(成都) CNY 180K-360KAccess Control | AutoGPT | CPU isolation | Docker | FirecrackerNone Full Time成都14h ago
-
Entry-level Full Time北京14h ago
-
AI工程师-Agent Memory & RAG 方向(武汉) CNY 240K-480KAlgorithms | BERT | Chroma | Cross-Encoder | Data StructuresSenior-level Full Time武汉 R14h ago
-
AI工程师-Agent Memory & RAG 方向(北京) CNY 240K-480KBERT | Chroma | Cross-Encoder | Embedding Models | FaissSenior-level Full Time北京 R14h ago
-
IT Intern - LLM Application Engineer CNY 25K-37KAPI Optimization | Agent systems | Autogen | Claude Code | CursorEntry-level Full Time InternshipBeijing, Beijing, China1d ago
-
Senior DGX Cloud AI Infrastructure Software Engineer CNY 160K-240KAI Inference | AI Training | APIs | C# | C++Senior-level Full TimeChina, Shanghai1d ago
-
AI Engineer-Intern CNY 25K-37KData Visualization | LLM Deployment | Langchain | Language Processing | Natural LanguageTrainingEntry-level Full Time InternshipSuzhou - Industrial Park, China1d ago
-
Entry-level Full TimeShanghai, Shanghai, China1d ago
-
Ai算法工程师-汽车专项-实习 CNY 25K-37KAutoml | C# | C++ | Computer Vision | Data ProcessingInternship | Mentorship | Real-world projectsEntry-level Internship南京1d ago
-
Machine Learning Engineer (Training Optimization) CNY 144K-240KCUDA | DeepSpeed | Diffusion Models | Distributed Training | FSDPEntry-level Full TimeBeijing, Beijing, China1d ago
-
Bash | Data Processing | Docker | GCP | LinuxAsynchronous culture | Entrepreneurial team | Friendly work environment | Hands-off managementMid-level Full TimeShenzhen, China1d ago
-
Mid-level Full Time北京1d ago
-
Intern, Agentic AI Researcher (007358) CNY 50K-50KAgentic AI | Artificial Intelligence | Claude | GitHub Copilot | Language ProcessingEntry-level InternshipNANJING,CN,2100002d ago
-
Benchmarking | C++ | CUDA | Deep learning | Distributed SystemsSenior-level Full TimeChina, Shanghai2d ago
-
None Full Time深圳2d ago