Generative AI - ML System Engineering
Tasks
- Build and scale high throughput 3D data pipelines
- Build data loading framework and libraries
- Build end to end machine learning framework for 3D
- Debug and monitor hardware platform performance
- Design training pipeline for pretraining and finetuning
- Develop inference pipelines for diffusion models
- Implement custom operators with CUDA
- Implement custom operators with Triton
- Optimize distributed model training across GPUs
- Optimize models using compilation
- Optimize models using fusion
- Optimize models using quantization
Perks/Benefits
Skills/Tech-stack
C++ | CUDA | Compilation | Data pipeline | Diffusion Models | Distributed Training | Fusion | GPU Performance | GPU Performance Optimization | JAX | Machine Learning | Model Parallelism | Performance optimization | PyTorch | Python | Quantization | Tensor programming | Triton
Education
N/A
Related jobs
-
Mid-level Full Time北京 R12h ago
-
Miclaw-端云协同调度专家 (Hybrid AI Architect) CNY 240K-360K5G | API Integration | Claude 3.5 | Distributed Systems | GPT-4oHybrid workSenior-level Full Time北京 R13h ago
-
Mid-level Full Time广州 R4d ago
-
Ai 院--多模态团队--多模态理解算法研究员-强化学习方向 CNY 240K-480KDPO | Data Preprocessing | Data cleaning | DeepSpeed | Distributed TrainingSenior-level Full Time北京 R8d ago
-
Lead Technical Support Engineer - AI / ML CNY 144K-240KAPI Integration | Agent Frameworks | C plus plus | Cause analysis | Cloud ComputingHybrid work model | Travel for customer workshops | Work from homeSenior-level Full TimeBeijing, China R9d ago
-
Agent Development | Agile | Artificial Intelligence | Data Privacy | Data SecurityMid-level Full TimeXi'an, Shaanxi, China R9d ago
-
Entry-level Full Time北京 R10d ago
-
Nlp / Llm 应用工程师 CNY 180K-360KAgent evaluation | Anthropic API | Automated testing | Chroma | DeepEvalEntry-level Full Time北京 R10d ago
-
Mid-level Full Time北京 R10d ago
-
Mid-level Full Time北京 R10d ago
-
AI基础设施研发工程师(Sandbox / 容器化)-MiMo CNY 180K-420KAppArmor | Argo Workflows | CI/CD | CPU resource scheduling | CgroupMid-level Full Time北京 R10d ago
-
Entry-level Full Time北京、上海 R12d ago
-
Behavior Cloning | C++ | Cloud processing | Computer Vision | ControlEntry-level Internship北京、上海 R14d ago
-
Mid-level Full Time上海、深圳 R14d ago
-
AI platforms | API Development | Artificial Intelligence | Cloud AI | Cloud AI PlatformsMid-level Full TimeRemote, China R16d ago
-
AI ML Engineer CNY 280K-360KAWS | Azure | C++ | Cloud Computing | Computer VisionPerformance bonuses | Professional development opportunities | Remote workMid-level Full TimeShenzhen, Guangdong Province, China R18d ago
-
AI工程师-Agent Memory & RAG 方向(成都) CNY 240K-480KBERT | Chroma | Cross-Encoder | Embedding Models | FaissSenior-level Full Time成都 R19d ago
-
AI工程师-Agent Memory & RAG 方向(武汉) CNY 240K-480KAlgorithms | BERT | Chroma | Cross-Encoder | Data StructuresSenior-level Full Time武汉 R19d ago
-
AI工程师-Agent Memory & RAG 方向(北京) CNY 240K-480KBERT | Chroma | Cross-Encoder | Embedding Models | FaissSenior-level Full Time北京 R19d ago
-
AWS | Agent Orchestration | Agent systems | Azure | DockerMid-level Full TimeShenzhen, Guangdong, China R20d ago
-
AVP, AI Solution Lead CNY 360K-600KCloud Computing | DataOps | DevOps | Flutter | Generative AIContinuous professional development | Flexible workingSenior-level Full TimeGuangzhou, Guangdong, China R20d ago
-
Analytics Modelling CNY 360K-600KAWS | BigQuery | Cloud platform | Google Cloud | Google Cloud PlatformContinuous professional development | Flexible working | Inclusive and diverse environment | Opportunities for growthSenior-level Full TimeGuangzhou, Guangdong, China R20d ago
-
AWQ | AWS | Accelerate | Azure | BatchingMid-level Full TimeShenzhen, Guangdong, China R21d ago
-
Lead Embedded Software Engineer CNY 349K-437KARM | BLE | C# | C++ | Embedded LinuxHybrid work model | Remote-friendly | Work from homeSenior-level Full TimeSuzhou, China R27d ago
-
Mid-level Full Time上海、深圳 R1mo ago