Find jobs in AI/ML, Data Science and Big Data
28 results
for Pipeline parallelism
(Skill/Tech stack)
-
C++ | CUDA | CUDA profiling | Collective communication | Communication Compute OverlapSenior-level Full TimeIsrael, Tel Aviv R3d ago
-
Principal High-Performance LLM Training Engineer USD 272K-431KActivation checkpointing | Benchmarking | CUDA | Communication and Computation Overlap | CompilersBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States3d ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeHong Kong6d ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeSingapore6d ago
-
C++ | CUDA | Data parallelism | DeepSpeed | InfinibandEntry-level Full TimeChina6d ago
-
C++ | CUDA | Data parallelism | DeepSpeed | InfinibandEntry-level Full TimeBoston, USA6d ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeSeattle, USA6d ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeOregon, USA6d ago
-
C++ | CUDA | Data parallelism | DeepSpeed | InfinibandEntry-level Full TimeSan Francisco Bay Area, USA6d ago
-
Senior Software Engineer, RL Post-Training Frameworks USD 184K-356KActor Based Programming | C# | C++ | Consistency models | DPOComprehensive benefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States8d ago
-
Senior Machine Learning Engineer USD 170K-240KAWS | Azure | Debugging | Distributed Computing | FSDPSenior-level Full TimeGM Automation - Sunnyvale - GM …11d ago
-
Generative AI Executive Director USD 150K-210KComputer Vision | DAG | Data parallelism | Deep learning | DeepSpeedBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersExecutive-level Full TimeNew York, NY, United States14d ago
-
Data parallelism | Deep learning | Distributed Training | Model Acceleration | Model BenchmarkingSenior-level Full TimeSan Jose, California, United States14d ago
-
Computational optimization | Data parallelism | Deep learning | Distributed Training | Generative AIMid-level Full TimeSan Jose, California, United States14d ago
-
Communication optimization | Data parallelism | Deep learning | Distributed Training | Generative AISenior-level Full TimeSeattle, Washington, United States14d ago
-
Benchmarking | CUDA | Data parallelism | Distributed Training | Model ParallelismSenior-level Full TimeSan Jose, California, United States14d ago
-
Benchmarking | CUDA | Communication optimization | Data parallelism | Deep learningMid-level Full TimeSeattle, Washington, United States15d ago
-
Data parallelism | Deep learning | Distributed Training | GPU Acceleration | Model BenchmarkingMid-level Full TimeSan Jose, California, United States15d ago
-
Senior Software Engineer, Machine Learning, Core ML USD 174K-252KC++ | Compiler optimization | Data Processing | Data parallelism | DebuggingSenior-level Full TimeMountain View, CA, USA16d ago
-
Principal Deep Learning Communication Architect USD 272K-431K3D Parallelism | CUDA | Context Parallelism | Data parallelism | DeepSpeedSenior-level Full TimeUS, CA, Santa Clara, United States17d ago
-
Senior Software Engineer, AI Inference CAD 135K-220KC++ | Chunked prefill | Continuous batching | Cutlass | DockerSenior-level Full TimeCanada, Toronto21d ago
-
Research Engineer, Infrastructure USD 255K-400KC++ | Checkpointing | Compute efficiency | Data Pipelines | Data parallelismSenior-level Full TimeSan Francisco Bay Area22d ago
-
Generative AI - Vice President GBP 144K-181KComputer Vision | DAG | Data parallelism | DeepSpeed | Distributed TrainingExecutive-level Full TimeLONDON, LONDON, United Kingdom22d ago
-
AI acceleration | Communication optimization | Data parallelism | Deep learning | Distributed TrainingSenior-level Full TimeSeattle, Washington, United States24d ago
-
Sr. Large Model Training Acceleration Engineer USD 194K-359KBenchmarking | Data parallelism | Deep learning | Distributed Training | Inference OptimizationSenior-level Full TimeSan Jose, California, United States27d ago
-
Senior Engineering Manager, AI Runtime USD 228K-297KCheckpointing | Cluster Lifecycle Management | Cluster lifecycle | DeepSpeed | Distributed TrainingSenior-level Full TimeMountain View, California; San Francisco, California29d ago
-
Software Engineering Manager, LLM Training USD 170K-277KCUDA | CUDA profiling | Containerization | Context Parallelism | Data I/OHealth and wellness programs | Hybrid work | Time away from workEntry-level Full TimeMountain View, CA, United States1mo ago
-
Software Engineering Manager, LLM Training USD 170K-277KCUDA | Containerization | Data parallelism | Distributed Systems | DockerFlexible-hybrid work | Health and wellness programs | Time offEntry-level Full TimeMountain View, CA, United States1mo ago