Senior System Software Engineer - AI Performance and Efficiency Tools
Tasks
- Build debugging tools for memory and networking issues
- Build profiling and analysis tools for AI workloads
- Create benchmarking and simulation technologies for AI systems
- Partner with hardware architects to propose and improve features
Perks/Benefits
- N/A
Skills/Tech-stack
Benchmarking | C++ | CUDA | Deep learning | Distributed Systems | GPU Cluster | Job Scheduling | Kubernetes | Linux | NCCL | Networking | PyTorch | Python | Simulation | Slurm | Storage | TensorFlow
Education
Related jobs
-
Senior-level Full TimeChina14h ago
-
Senior-level Full TimeChina14h ago
-
Bash | Data Ingestion | Data Processing | Docker | GCPAsynchronous work culture | Friendly laid-back atmosphereMid-level Full TimeShanghai, China18h ago
-
Computer Graphics | Computer Vision | CoreML | Deep learning | Diffusion ModelsSenior-level Full TimeBeijing, Beijing, China20h ago
-
CUDA | DeepSpeed | Distributed Training | FSDP | Gradient CheckpointingEntry-level Full TimeBeijing, Beijing, China21h ago
-
Senior-level Full TimeBeijing, China23h ago
-
AI Computing Software Development Engineer, TensorRT CNY 144K-240KArtificial Intelligence | C# | C++ | Debugging | Deep learningSenior-level Full TimeChina, Shanghai23h ago
-
Entry-level Internship深圳1d ago
-
Entry-level InternshipBeijing,Beijing,China1d ago
-
(Sr) Cloud & Data Engineer CNY 192K-240KAWS | Automation | CI/CD | Container Security | Data ModelingMid-level Full TimeBeijing, Beijing, CN1d ago
-
[Growth Engineering] Staff Back-end Engineer I CNY 144K-240KAnomaly Detection | CI/CD | Containerization | FastAPI | Graph DatabaseSenior-level Full TimeShanghai, China1d ago
-
[Growth Engineering] Staff Back-end Engineer I CNY 144K-240KCI/CD | Containerization | FastAPI | Graph Database | Inference ServerSenior-level Full TimeShanghai, China1d ago
-
Deep Learning Performance Architect CNY 152K-240KComputer Architecture | Deep learning | Inference | JAX | Language ModelsSenior-level Full TimeChina, Shanghai1d ago
-
Deep Learning Performance Architect CNY 144K-240KAI Agents | Computer Architecture | Deep learning | GPU | Generative AISenior-level Full TimeChina, Shanghai1d ago
-
C# | C++ | Computer Vision | Debugging | Deep learningSenior-level Full TimeChina, Shanghai1d ago
-
Entry-level Internship Part TimeShanghai - Daning Main Blg, China1d ago
-
Entry-level Full Time广州2d ago
-
Senior-level Full Time上海、北京2d ago
-
None Full Time淄博2d ago
-
None Full Time济南2d ago
-
【27届实习】Ai实习生(可转正) CNY 36K-48KAmazon Web Services | Computer Vision | Deep learning | Docker | KubernetesEntry-level Internship淄博、济南、青岛2d ago
-
Mid-level Full Time北京 R2d ago
-
Miclaw-端云协同调度专家 (Hybrid AI Architect) CNY 240K-360K5G | API Integration | Claude 3.5 | Distributed Systems | GPT-4oHybrid workSenior-level Full Time北京 R2d ago
-
Java开发工程师(大数据方向) CNY 180K-360KApache Flink | Apache Spark | Data pipeline | Distributed Systems | IO ProgrammingMid-level Full Time武汉2d ago
-
A/B | A/B Experimentation | Autoscaling | Caching | Canary testingCommute subsidy | Disability insurance | Employee stock ownership | Generous vacation | Health insuranceSenior-level Full TimeShanghai, China2d ago