端侧模型部署工程师
Tasks
- Create test cases for deployments
- Deploy deep learning models
- Deploy inference services across platforms
- Develop and optimize CUDA operators
- Develop deployment automation scripts
- Distill models
- Export and convert models for deployment
- Monitor deployed model performance
- Optimize deployment for heterogeneous compute platforms
- Optimize model performance on target hardware
- Package models as inference services
- Prune models
- Quantize models
- Research and implement model compression
- Tune inference for stability and consistency
Perks/Benefits
- N/A
Skills/Tech-stack
C++ | CUDA | DSP | Heterogeneous computing | Inference engine | Knowledge Distillation | Linux | Model Pruning | Model Quantization | NPU | ONNX | ONNX Runtime | OpenVINO | PyTorch | Python | TensorFlow | TensorRT
Education
N/A
Related jobs
-
BPS & AI engineer_PS CNY 25K-37KArtificial Intelligence | BPS | Business Process | Business process improvement | Continuous ImprovementEntry-level Full TimeWuxi, Jiangsu, China15h ago
-
Lead Embedded Software Engineer CNY 349K-437KARM | BLE | C# | C++ | Embedded LinuxHybrid work model | Remote-friendly | Work from homeSenior-level Full TimeSuzhou, China R20h ago
-
Senior Manager, AI Algorithm Lead CNY 240K-360KAI architecture | Data Modeling | Deep learning | Inference acceleration | Language ProcessingSenior-level Full TimeAIA ED (Shanghai) Hongkou, China20h ago
-
数据开发工程师 CNY 240K-480KAirbyte | BigQuery | Cube.js | DBT | Data GovernanceAI tool subscriptions | API credits | Cloud credits | Flat organizationSenior-level Full Time深圳1d ago
-
数据平台开发工程师 CNY 180K-360KData Lake | Data Warehouse | Data Warehouse Modeling | Data pipeline | Delta LakeMid-level Full Time广州1d ago
-
Entry-level InternshipShenzhen1d ago
-
Agile | Automatic control | C++ | Continuous integration | Functional SafetyAgile work environment | Scrum and Kanban collaborationMid-level Full TimeJiading Qu, China1d ago
-
GenAI Software Architect CNY 240K-480KAutogen | Bayesian analysis | Chroma | Deep learning | EmbeddingsSenior-level Full TimeCHN - Minhang, China1d ago
-
Senior-level Full TimeShanghai Offices, China1d ago
-
Entry-level Full Time InternshipShenzhen Brion office, China1d ago
-
C# | C++ | CUDA | Data analytics | Deep learningEntry-level Full Time InternshipChina, Beijing1d ago
-
Software Engineering & Development, AVP CNY 300K-420KAI Governance | API Development | AWS | Adversarial Robustness | AlertingExecutive-level Full TimeHangzhou, China1d ago
-
Executive-level Full TimeHangzhou, China1d ago
-
Mid-level Full Time北京 R2d ago
-
Machine Learning Engineer, AI Applications - Shenzhen CNY 240K-330KAPI Integration | Anomaly Detection | Backend integration | Data Pipelines | Data ProcessingMid-level Full TimeShenzhen2d ago
-
Application Engineer-Senior CNY 240K-480KAPI Development | Computer Vision | Dify | Django | DockerSenior-level Full TimeShanghai, China2d ago
-
Audit Logging | CI/CD | Data Governance | Data Privacy | Drift DetectionSenior-level Full TimeShanghai, Shanghai, China2d ago
-
Senior AI Engineer CNY 240K-480KAgent Orchestration | Authentication | Authorization | CI Gates | CI/CDSenior-level Full TimeChina2d ago
-
Bash | Cloud platform | Data Processing | Docker | Google CloudAsynchronous culture | Friendly work environment | Hands-off management | Remote/distributed workMid-level Full TimeShanghai, China2d ago
-
SAP China iXp Interns - AI Engineer Intern CNY 28K-50KAWS | Langchain | Language Models | Large Language Models | Prompt TuningPaid internshipEntry-level InternshipShanghai, CN, 2012032d ago
-
Artificial Intelligence | C# | C++ | Computer Architecture | GStreamerSenior-level Full TimeChina Shanghai2d ago
-
Forward Deployed AI Engineer CNY 37K-37KAWS | Agile | Azure | BigQuery | Cloud ComputingTravel opportunitiesEntry-level Full Time Internship北京2d ago
-
Mid-level Full Time北京2d ago
-
Mid-level Full Time北京2d ago
-
Mid-level Full Time杭州2d ago