Miclaw-大模型训练推理方向实习生
Tasks
- Collaborate on model chip co design for inference systems
- Evaluate and compare model performance on edge scenarios
- Optimize inference with quantization KV Cache and attention improvements
- Reproduce large language model inference optimization techniques
- Research efficient model architectures for limited compute and memory
- Transfer research results into production engineering implementations
Perks/Benefits
- N/A
Skills/Tech-stack
Attention | C++ | CUDA | Compiler optimization | High Performance | High-Performance Computing | KV cache | Model Compression | Operator optimization | Parallel Computing | Performance Computing | Python | Quantization
Education
Related jobs
-
Mid-level Full Time上海1d ago
-
Senior-level Full Time上海1d ago
-
【算法】多模态/大模型算法专家(上海) CNY 240K-480KAgent Frameworks | C++ | Computer Vision | Language Processing | LinuxSenior-level Full Time上海1d ago
-
【算法】计算机视觉算法专家(上海) CNY 240K-480KAnomaly Detection | C++ | Computer Vision | Deep learning | Fine-grained classificationSenior-level Full Time上海1d ago
-
Applied AI Engineer CNY 330K-500KCost Optimization | Fine Tuning | JAX | LLM Inference | Language ModelsSenior-level Full TimeChina1d ago
-
Analog circuit | Analog circuit design | Brushless Motor | Brushless motor control | C#Mid-level Full TimeDongguan (R&D), China1d ago
-
AI Engineer CNY 240K-360KAI workflow | AI workflow design | Agent systems | Computer Vision | Data AnnotationOccasional travel | Office environmentSenior-level Full TimeChina - Suzhou, Jiangsu - 297 …1d ago
-
Ai数据工程实习生(训练数据 & 清洗方向) CNY 25K-37KData Deidentification | Data Pipelines | Data Quality | Data Quality Management | Data StandardizationInternship experience | MentorshipEntry-level Internship上海2d ago
-
AWS | Access Control | Apache Iceberg | Authentication | AzureEmployee networks | Flexible work/life support | Inclusive development opportunities | Paid volunteer daysSenior-level Full TimeHangzhou, China2d ago
-
AI Model Inference | AI model | Agent Framework | C++ | Closed LoopEntry-level Full Time InternshipChina, Shenzhen2d ago
-
Automated testing | C# | C++ | Datalink communication | DebuggingEmployee assistance programs | Flexible spending accounts | Health Lifestyle Programs | Health savings account | Life insuranceMid-level Full TimeWuxi, Jiangsu, China2d ago
-
Senior-level Full Time北京、苏州3d ago
-
Mid-level Full Time北京、上海、苏州3d ago
-
Senior-level Full Time上海、苏州、北京、深圳3d ago
-
Senior-level Full Time北京、上海3d ago
-
Mid-level Full Time北京、深圳、苏州、上海3d ago
-
Entry-level Full Time深圳3d ago
-
AI Intern – RAG Engineering CNY 37K-37KDify | Document processing | LLM Applications | Langchain | LanggraphEntry-level Full Time Internship北京市, 北京市, 中国3d ago
-
AI Intern – Agent & LLM Solutions CNY 45K-57KDify | Langchain | Langgraph | Language Models | Large Language ModelsEntry-level Full Time Internship北京市, 北京市, 中国3d ago
-
Agent Frameworks | Boundary testing | Case design | Data Privacy | Data anonymizationEntry-level Full Time InternshipBeijing, Beijing, China3d ago
-
Applied AI Engineer - Silicon Co-Design Group CNY 300K-480KAgent Framework | Autogen | C# | C++ | CrewAISenior-level Full TimeChina, Shanghai3d ago
-
Senior-level Full TimeChina, Shanghai3d ago
-
Senior Machine Learning Engineer II CNY 240K-480KAPI Integration | AWS | Agent Frameworks | Apache Spark | Artificial IntelligenceAnnual Medical Checkup | Birthday gifts | Family care leave | Festival Gifts | Flexible benefitsSenior-level Full TimeChina-Shanghai (Tianshan-W-Rd)3d ago
-
Senior Machine Learning Engineer II CNY 240K-480KAPI Integration | AWS | Agent Framework | Azure DevOps | CI/CDAnnual Medical Checkup | Family care leave | Flexible benefits | Life insurance | Long service awardSenior-level Full TimeChina-Shanghai (Tianshan-W-Rd)3d ago
-
Senior Machine Learning Engineer II CNY 240K-480KAPI Integration | AWS | Agile | Apache Spark | Artificial IntelligenceAnnual Medical Checkup | Flexible benefits | Life insurance | Long service award | Medical insuranceSenior-level Full TimeChina-Shanghai (Tianshan-W-Rd)3d ago