AI Frameworks Software Engineer – Model Compression Algorithm
Tasks
- Develop model compression product
- Implement compression for text to image video models
- Implement quantization for large language models
- Optimize inference and finetuning acceleration
- Optimize model compression for Intel AI platform
- Research compression techniques
- Research quantization techniques
- Track efficient model deployment research directions
Perks/Benefits
Skills/Tech-stack
C++ | CPU architecture | Deep learning | Fine Tuning | GPU Computing | Inference Optimization | Language Models | Large Language Models | Model Compression | Neural Networks | Pruning | Python | Quantization
Education
Roles
Related jobs
-
Senior-level Full Time上海5h ago
-
【27届实习】云原生Ai平台研发工程师-杭州 CNY 25K-37KArgo Workflow | Computer networks | Containerization | Data Structures | GolangConversion to permanent role | Technical documentation supportEntry-level Internship杭州5h ago
-
【27届实习】数据挖掘工程师 CNY 25K-37KData Structures | Deep learning | Distributed machine learning | Go | Image ProcessingConversion to full time roleEntry-level Internship Temporary上海5h ago
-
【27届实习】算法研究员-视觉方向 CNY 25K-37KAIGC | Computer Vision | Cross-Modal Learning | Data Structures | Deep learningFull-time conversion opportunityEntry-level Internship上海、杭州5h ago
-
【27届实习】Ai算法工程师(工程开发方向) CNY 25K-37KComputer Vision | Deep learning | Docker | GPU Programming | JavaConversion to full time offerEntry-level Internship上海5h ago
-
Senior-level Full Time北京5h ago
-
Entry-level Full Time北京 R5h ago
-
Entry-level InternshipShanghai6h ago
-
Entry-level InternshipShanghai6h ago
-
Solutions Architect - Top AI Labs CNY 435K-500KArtificial Intelligence | C++ | Computer Systems | Data Structures | Distributed ComputingSenior-level Full TimeChina, Beijing23h ago
-
Deep Learning Solution Architect CNY 337K-490KCUDA | Distributed Training | Evaluation Pipelines | Experiment Management | Language ModelsSenior-level Full TimeChina, Beijing23h ago
-
AI Algorithm and Development Software Engineer CNY 240K-360KAgent architecture | Agent systems | AutoGPT | CUDA | Chain-of-ThoughtOn-call supportMid-level Full TimeBeiJing, China23h ago
-
AWS | Azure | JavaScript | NoSQL | Node.jsFast-paced environment | Remote workMid-level Full TimeHangzhou R2d ago
-
AWS | Agile | Azure | Blockchain | CursorRemote workMid-level Full TimeShenzhen R2d ago
-
高级算法工程师(Nlp方向) CNY 240K-480KAgent memory | CUDA | Chroma | Client-Server | Client-Server ArchitectureSenior-level Full Time北京3d ago
-
机器人VLA算法研究员 - XiaomiRobotics CNY 500K-500KDeep learning | Diffusion Models | Language Modeling | Machine Learning | Mixture of ExpertsEntry-level Full Time北京3d ago
-
VLA训练infra算法工程师 - XiaomiRobotics CNY 240K-480KBF16 | C++ | CPU/memory optimization | CUDA | Data pipelineMid-level Full Time北京3d ago
-
具身世界模型训练INFRA工程师 - XiaomiRobotics CNY 180K-360KAPIs | Deep learning | Distributed Training | Fault Tolerance | Infrastructure EngineeringMid-level Full Time北京3d ago
-
Agentic Inference | CUDA | Distributed Training | Docker | GPU ComputingSenior-level Full TimeChina, Beijing3d ago
-
Senior Deep Learning Solution Architect CNY 367K-490KC++ | Caching | Computer Architecture | Data Structures | Data transferSenior-level Full TimeChina, Beijing3d ago
-
Mid-level Full TimeChina, Shanghai3d ago
-
Executive-level Full TimeBeiJing, China4d ago
-
Entry-level Full TimeCHN - Minhang, China4d ago
-
Deep Learning Software Engineer CNY 180K-360KAsynchronous Communication | C++ | Data parallelism | Deep learning | Distributed TrainingCareer advancement opportunities | On-site work model | Skill development opportunitiesEntry-level Full TimeCHN - Minhang, China4d ago
-
Senior-level Full Time北京、上海5d ago