Deep Learning Performance Architect
Tasks
- Analyze deep learning workloads
- Collaborate across teams on next gen deep learning hardware and software
- Define hardware and software performance measurements
- Develop fastest GPU kernels
- Optimize deep learning inference performance
- Prototype performance opportunities
Perks/Benefits
- N/A
Skills/Tech-stack
AI Compiler | C# | C++ | CUDA | Deep learning | GPU | JAX | Kernel development | MLIR | Machine Learning | Performance Tuning | PyTorch | TensorFlow | TensorRT
Education
Related jobs
-
Senior-level Full Time北京6h ago
-
GenAI Software Architect CNY 240K-480KAutogen | Bayesian analysis | Chroma | Deep learning | EmbeddingsSenior-level Full TimeCHN - Minhang, China2d ago
-
Senior Manager, AI Infrastructure CNY 435K-500KAI Agents | Carbon Aware Scheduling | Cluster management | DeepSpeed | Distributed TrainingSenior-level Full TimeShanghai Jing'An Office, China8d ago
-
JMP-Chief AI Software Technologist (BCSC) CNY 240K-390KA2A | Agent systems | CI/CD | Data Governance | Data ModelingExecutive-level Full TimeWuxi, Jiangsu, China8d ago
-
Senior Manager, Solution Architect CNY 300K-480KAI architecture | Agentic Workflows | Algorithm Integration | Architecture governance | Artificial IntelligenceSenior-level Full TimeAIA ED (Shanghai) Hongkou, China24d ago
-
数据挖掘平台工程架构开发工程师 CNY 180K-420KAlerting | Automated Deployment | CI/CD | Database Design | Distributed SystemsMid-level Full Time北京、上海、苏州24d ago
-
Autonomous Vehicles | C# | C++ | CPU | Code optimizationSenior-level Full TimeChina, Shanghai30d ago
-
AWS | Agentic AI | Apache Iceberg | Azure | CI/CDEmployee networks | Flexible work/life support | Inclusive development opportunities | Paid volunteer daysSenior-level Full TimeHangzhou, China1mo ago
-
Information Technology Manager (Data Analytics) HKD 360K-612KAlteryx | Amazon Web Services | Artificial Intelligence | Big Data | Business IntelligenceMid-level Full TimeHong Kong, China1mo ago
-
Senior Solutions Architect - KV Cache and AI Storage CNY 460K-600KBluefield | CMX | Caching | Cassandra | CephSenior-level Full TimeChina, Beijing1mo ago
-
Solutions Architect - Top AI Labs CNY 435K-500KArtificial Intelligence | C++ | Computer Systems | Data Structures | Distributed ComputingSenior-level Full TimeChina, Beijing1mo ago
-
Deep Learning Solution Architect CNY 337K-490KCUDA | Distributed Training | Evaluation Pipelines | Experiment Management | Language ModelsSenior-level Full TimeChina, Beijing1mo ago
-
Agentic Inference | CUDA | Distributed Training | Docker | GPU ComputingSenior-level Full TimeChina, Beijing1mo ago
-
Senior Deep Learning Solution Architect CNY 367K-490KC++ | Caching | Computer Architecture | Data Structures | Data transferSenior-level Full TimeChina, Beijing1mo ago
-
Agentic AI | Artificial Intelligence | GPU Computing | Generative AI | Human FeedbackSenior-level Full TimeChina, Shenzhen1mo ago
-
Solution Architect - Top AI Labs CNY 435K-500KAIGC | C++ | CUDA | Cloud Computing | Computer VisionSenior-level Full TimeChina, Beijing1mo ago
-
GenAI Software Architect CNY 240K-480KAutogen | Bayesian analysis | Chroma | Deep learning | Edge ComputingSenior-level Full TimeCHN - Minhang, China1mo ago