Find jobs in AI/ML, Data Science and Big Data
24 results
for Tensor Parallelism
(Skill/Tech stack)
-
具身世界模型推理INFRA工程师 - XiaomiRobotics CNY 240K-480KCFG Parallelism | Diffusion Models | Expert parallelism | FP8 Quantization | Multi Token PredictionSenior-level Full Time北京1d ago
-
Intern Researcher – AI Foundation Model Training CAD 58K-104KAI Agent | AI agent systems | Agent systems | Architecture Search | Computational Graph OptimizationEntry-level InternshipMarkham, Ontario, Canada2d ago
-
Senior Machine Learning Research Scientist USD 200K-220KAWS SageMaker | Apache Airflow | CUDA | Data parallelism | Distributed Training401k match | Medical/Dental/Vision insurance | Paid Holidays | Paid parental leave | Remote-first teamSenior-level Full TimeRemote (United States) R2d ago
-
C++ | CUDA | CUDA profiling | Collective communication | Communication Compute OverlapSenior-level Full TimeIsrael, Tel Aviv R3d ago
-
Principal High-Performance LLM Training Engineer USD 272K-431KActivation checkpointing | Benchmarking | CUDA | Communication and Computation Overlap | CompilersBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States3d ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeHong Kong6d ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeSingapore6d ago
-
C++ | CUDA | Data parallelism | DeepSpeed | InfinibandEntry-level Full TimeChina6d ago
-
C++ | CUDA | Data parallelism | DeepSpeed | InfinibandEntry-level Full TimeBoston, USA6d ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeSeattle, USA6d ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeOregon, USA6d ago
-
C++ | CUDA | Data parallelism | DeepSpeed | InfinibandEntry-level Full TimeSan Francisco Bay Area, USA6d ago
-
Senior Software Engineer, RL Post-Training Frameworks USD 184K-356KActor Based Programming | C# | C++ | Consistency models | DPOComprehensive benefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States8d ago
-
Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles USD 184K-356KC++ | CUDA | Cutlass | Efficient Attention | GPU ArchitectureSenior-level Full TimeUS, CA, Santa Clara, United States10d ago
-
Senior Software Engineer, Machine Learning, Core ML USD 174K-252KC++ | Compiler optimization | Data Processing | Data parallelism | DebuggingSenior-level Full TimeMountain View, CA, USA16d ago
-
Principal Deep Learning Communication Architect USD 272K-431K3D Parallelism | CUDA | Context Parallelism | Data parallelism | DeepSpeedSenior-level Full TimeUS, CA, Santa Clara, United States17d ago
-
Senior Software Engineer, AI Inference CAD 135K-220KC++ | Chunked prefill | Continuous batching | Cutlass | DockerSenior-level Full TimeCanada, Toronto21d ago
-
Research Engineer, Infrastructure USD 255K-400KC++ | Checkpointing | Compute efficiency | Data Pipelines | Data parallelismSenior-level Full TimeSan Francisco Bay Area22d ago
-
Software Engineer, Inference – AMD GPU Enablement USD 295K-555KCUDA | Collective communication | Distributed Systems | GPU Kernels | HIPMid-level Full TimeSan Francisco24d ago
-
Senior Engineering Manager, AI Runtime USD 228K-297KCheckpointing | Cluster Lifecycle Management | Cluster lifecycle | DeepSpeed | Distributed TrainingSenior-level Full TimeMountain View, California; San Francisco, California29d ago
-
Software Engineering Manager, LLM Training USD 170K-277KCUDA | CUDA profiling | Containerization | Context Parallelism | Data I/OHealth and wellness programs | Hybrid work | Time away from workEntry-level Full TimeMountain View, CA, United States1mo ago
-
Software Engineering Manager, LLM Training USD 170K-277KCUDA | Containerization | Data parallelism | Distributed Systems | DockerFlexible-hybrid work | Health and wellness programs | Time offEntry-level Full TimeMountain View, CA, United States1mo ago
-
Sr. Staff Software Engineer, AI Infra USD 198K-326KC++ | CUDA | DeepSpeed | Distributed Training | GNNSenior-level Full TimeMountain View, CA, United States1mo ago
-
Senior Engineer 2: Inference Data Plane USD 167K-209KAI | Databases | Distributed Systems | GPU Optimization | GRPCBenefits support | Educational courses | Equity compensation | Flexible time off | Reimbursement for trainingSenior-level Full TimeSan Francisco R1mo ago