AI Software Engineer Intern
CHN - Minhang, China
CNY 38K-50K (estimate) Entry-level Full Time Internship
Tasks
- Build end to end inference runtime and scheduling
- Develop MoE inference routing and load balancing
- Develop and optimize GPU kernels
- Fuse operators into efficient GPU kernels
- Implement flash attention
- Implement optimized LLM inference techniques
- Implement quantization for model optimization
- Implement speculative decoding
- Optimize KV cache memory and management
- Optimize MoE expert parallelism and sharding
- Optimize NCCL communication
- Profile and tune GPU memory access patterns
- Scale distributed inference across GPUs and nodes
- Study LLM inference research
Perks/Benefits
Skills/Tech-stack
CUDA | Compiler optimization | Continuous batching | Distributed Systems | Dynamic batching | FP8 | FasterTransformer | Flash Attention | GPU Architecture | GPU Kernels | GPU Programming | INT8 | KV cache | Low-bit quantization | MOE | Memory Optimization | Mixture of Experts | NCCL | Operator fusion | Paged Attention | Parallel Computing | Performance Profiling | PyTorch | Python | Quantization | Speculative decoding | TensorRT-LLM | Transformers | Triton | VLLM
Education
Related jobs
-
None Full Time深圳23h ago
-
Senior-level Full TimeShanghai, China1d ago
-
Sr. Consultant - Data Scientist CNY 360K-540KAgile | Computer Vision | Containerization | Data Governance | Data ScienceEmployee assistance program | Mindfulness programs | On demand digital course library | Personalized wellbeing programs | Volunteer matching programSenior-level Full TimeChina Shanghai (Hongmei)1d ago
-
AI Software Engineering Intern CNY 60K-60KAI Agents | Agent systems | Data Pipelines | Deep learning | Fine TuningCareer development opportunities | On-site work environmentEntry-level Full Time InternshipCHN - Beijing, China1d ago
-
Mid-level Full Time北京 R1d ago
-
大模型算法研究员-MiMo CNY 500K-500KActive Learning | C++ | Curriculum learning | Data Generation | Deep learningEntry-level Full Time北京1d ago
-
Miclaw-端云协同调度专家 (Hybrid AI Architect) CNY 240K-480K5G | Cloud API | Consistency protocols | Data Compression | Data PrivacyHybrid workSenior-level Full Time北京 R1d ago
-
AI基础设施研发工程师(Sandbox / 容器化)-MiMo CNY 180K-420KAppArmor | Argo Workflows | CPU resource scheduling | Cgroup | ContainerdMid-level Full Time北京 R1d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitEntry-level Internship深圳2d ago
-
Ai应用工程师(提效方向 0-1) CNY 50K-50KAI Programming | AI Programming Tools | API Integration | JavaScript | Language ProcessingEngineering resource support | Hands-on product development | Model and compute support | Real world usageEntry-level Internship深圳2d ago
-
Entry-level Full Time北京、上海2d ago
-
AGI 服务端资深工程师-Talkie&星野 CNY 180K-300KData Engineering | Dify | Distributed Systems | Go | Inference OptimizationMid-level Full Time北京、上海2d ago
-
Feature Engineering | Machine Learning | Python | Quantitative FinanceAccess to proprietary platforms | Collaborative work environment | Decision-making participation | Mentorship | Potential full-time conversionEntry-level InternshipBeijing, Beijing, China2d ago
-
Mid-level Full TimeBeijing, China2d ago
-
Services Operations Analytics Intern CNY 50K-50KData Visualization | Data analytics | Excel | Forecasting | Outage ManagementCross-departmental collaboration | Hands-on project experience | Networking opportunitiesEntry-level Full Time InternshipShanghai, China2d ago
-
Services Operations Analytics Intern CNY 50K-50KData Analysis | Data Visualization | Excel | Forecasting | Pivot TablesCross-departmental collaboration | Hands-on project experience | Network with professionalsEntry-level Full Time InternshipShanghai, China2d ago
-
Senior-level Full TimeWuxi, Jiangsu, China3d ago
-
Llm算法实习生(具身大脑方向) CNY 25K-37KAgentic RL | Data Annotation | Fine Tuning | Human Feedback | LLM AgentEntry-level Internship深圳4d ago
-
Llm算法实习生(具身大脑方向) CNY 25K-37KAgentic RL | Data Modeling | LLM Agent | Language Models | Large Language ModelsInternship experience | Mentorship | Research collaborationEntry-level Internship深圳4d ago
-
Llm算法实习生(具身大脑方向) CNY 25K-37KEmbodied AI | Fine Tuning | Human Feedback | LLM Agents | Language ModelsEntry-level Internship深圳4d ago
-
Llm算法实习生(具身大脑方向) CNY 25K-37KAgentic RL | Fine Tuning | Human Feedback | LLM Agent | Language ModelsEntry-level Internship深圳5d ago
-
AI Agent Engineer(Embededd Software Tooling)_ETAS CNY 240K-480KAgent architecture | C++ | Deep learning | Edge AI | Embedded SoftwareSenior-level Full TimeShanghai, Shanghai, China5d ago
-
JMP_ AI Operation Excellence Expert(VM) CNY 240K-480KAI Agents | API | Cloud Native | Data Governance | Digital TwinSenior-level Full TimeSuzhou, Jiangsu, China R5d ago
-
AI Application Development Engineer CNY 180K-300KAgent systems | Artificial Intelligence | Computer Vision | Deep learning | Image ProcessingEntry-level Full TimeShenzhen, Guangdong Province, China5d ago
-
APIs | AWS | Agentic Workflows | Azure | Cloud platformSenior-level Full TimeChina, Shanghai5d ago