AI Frameworks Software Engineer – Model Compression Algorithm
Tasks
- Develop model compression product
- Implement compression for text to image video models
- Implement quantization for large language models
- Optimize inference and finetuning acceleration
- Optimize model compression for Intel AI platform
- Research compression techniques
- Research quantization techniques
- Track efficient model deployment research directions
Perks/Benefits
Skills/Tech-stack
C++ | CPU architecture | Deep learning | Fine Tuning | GPU Computing | Inference Optimization | Language Models | Large Language Models | Model Compression | Neural Networks | Pruning | Python | Quantization
Education
Roles
Related jobs
-
Entry-level Full Time北京 R7h ago
-
多模态大模型算法工程师(偏Llm) CNY 500K-500KComputer Vision | Deep learning | Fine Tuning | Language Models | Large Language ModelsMid-level Full Time北京7h ago
-
Entry-level Full Time北京 R8h ago
-
Senior-level Full Time北京9h ago
-
Entry-level Full Time北京 R9h ago
-
机器人VLA算法研究员 - XiaomiRobotics CNY 500K-500KDeep learning | Diffusion Models | Language Models | Machine Learning | Mixture of ExpertsEntry-level Full Time北京9h ago
-
Mid-level Full Time北京 R9h ago
-
具身世界模型推理INFRA工程师 - XiaomiRobotics CNY 240K-480KCFG Parallelism | Diffusion Models | Expert parallelism | FP8 Quantization | Multi Token PredictionSenior-level Full Time北京9h ago
-
Mid-level Full Time北京 R9h ago
-
ANSYS | APDL | C Programming | Design of Experiments | DynamicsNone Full TimeWuhan, Hubei, China22h ago
-
Senior-level Full TimeChina, Shanghai1d ago
-
IT Dept. AI Engineer_Application (上海) CNY 240K-360KAI machine learning | Alibaba Cloud | Cloud Applications | Database Design | Language ModelsMid-level Full TimeAnting, CN, 2018051d ago
-
Sr Machine Learning Engineer III CNY 240K-480KAPI Design | AWS | Agent Frameworks | Azure DevOps | CI/CDAdoption leave | Annual Medical Checkup | Family leave | Flexible benefits | Life insuranceSenior-level Full TimeChina-Shanghai (Tianshan-W-Rd)1d ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Distributed Systems | FP8 | FasterTransformer | Flash AttentionOn-site workEntry-level Full Time InternshipCHN - Minhang, China1d ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Compiler optimization | Continuous batching | Distributed Systems | Dynamic batchingOn-site workEntry-level Full Time InternshipCHN - Minhang, China1d ago
-
AI Software Engineer Intern CNY 28K-50KAWQ | Cache optimization | DINOv2 | DeepSpeed | Diffusion ModelsEntry-level Full Time InternshipCHN - Minhang, China1d ago
-
Entry-level Full Time InternshipCHN - Minhang, China1d ago
-
Ai多模态研究实习生(有留用机会) CNY 25K-37KClustering | DBSCAN | Data Visualization | Embeddings | FaissMentorship | Real world production data exposure | Return OfferEntry-level Internship广州、北京1d ago
-
Recommendation Algorithm Engineer CNY 25K-37KDeep learning | Java | Recommendation Systems | SpringTeam collaborationEntry-level InternshipGuangzhou1d ago
-
Mid-level Full Time深圳1d ago
-
Mid-level Full Time上海1d ago
-
Ai多模态研究实习生(有留用机会) CNY 25K-37KAttention | Clustering | DBSCAN | Data Visualization | EmbeddingsConversion to full time offer | Mentorship | On-the-job trainingEntry-level Internship广州、北京1d ago
-
Entry-level Full Time广州1d ago
-
Ai多模态研究实习生(有留用机会) CNY 25K-37KAttention Mechanisms | Clustering | DBSCAN | Data Analysis | Data ProcessingCareer growth | Full-time conversion opportunity | Mentorship | Real world production dataEntry-level Internship广州、北京1d ago
-
Senior-level Full Time上海2d ago