Find jobs in AI/ML, Data Science and Big Data
31 results
for Tensor Parallelism
(Skill/Tech stack)
-
Data parallelism | Diffusion Models | Efficient Attention | Expert parallelism | FlaxSenior-level Full TimeMountain View, California, United States, New …19h ago
-
C++ | CUDA | Cluster scheduling | Compute scheduling | Deep learningSenior-level Full TimeIsrael, Tel Aviv6d ago
-
具身世界模型推理INFRA工程师 - XiaomiRobotics CNY 240K-480KCFG Parallelization | Diffusion Models | Expert parallelism | FP8 Quantization | Inference OptimizationSenior-level Full Time北京6d ago
-
Async execution | Batch inference | C++ | CUDA | CachingEntry-level Full TimeSan Francisco, California, United States8d ago
-
Software Engineer, Systems ML USD 141K-208KC plus plus | CUDA | Co-design | Compiler optimization | Deep learningSenior-level Full TimeBellevue, WA | Menlo Park, CA …12d ago
-
Attention Mechanisms | Batching | CUDA | Data parallelism | Distributed SystemsSenior-level Full TimeSan Jose, California, United States13d ago
-
Attention Mechanisms | Data parallelism | Deep learning | Distributed Training | Language ModelsSenior-level Full TimeSan Jose, California, United States13d ago
-
AI Engineer EUR 60K-80KAWQ | AWS | Agent SDK | CI/CD | CUDACareer growth opportunities | Permanent employment | Remote work optionMid-level Full TimeRemote - Paris, France R13d ago
-
Staff Software Engineer, AI Runtime USD 190K-265KAlgorithms | Automatic Recovery | Checkpointing | Collective communication | Data StructuresSenior-level Full TimeMountain View, California; San Francisco, California14d ago
-
LLM Inference Frameworks and Optimization Engineer USD 160K-230KC++ | CUDA | CUDA graph | Cluster scheduling | CompilerEquity | Health insuranceMid-level Full TimeSan Francisco, Singapore, Amsterdam14d ago
-
Senior Inference Engineer, AIConfigurator for Dynamo USD 184K-356KBatching | Distributed Systems | Expert parallelism | GPU Computing | High PerformanceEquity | Health benefits | Hybrid workSenior-level Full TimeUS, CA, Santa Clara, United States18d ago
-
Architecture Search | C++ | CUDA | Computer Vision | Deep learningSenior-level Full TimeUS, CA, Santa Clara, United States19d ago
-
AI Engineer USD 100K-135KAWQ | AWS | AWS EC2 | Agent Frameworks | CI/CD401k match | Health insurance | Learning and development stipend | Paid parental leave | Paid time offMid-level Full TimeRemote USA - In Tandem R19d ago
-
Senior Software Engineer, AI Runtime USD 160K-225KAlgorithms | Checkpointing | Collective communication | Data Structures | Data parallelismSenior-level Full TimeMountain View, California; San Francisco, California21d ago
-
Staff Compiler Engineer - PyTorch + Kernel DSLPLATE USD 163K-253KAutotuning | Collective Primitives | Cost Based Compilation | Custom ISA | Cutlass401k | Adoption support stipend | Charitable giving match | Fertility care stipend | Flexible work environmentSenior-level Full TimeSan Jose, California, United States27d ago
-
Data/AI Engineer Intern SGD 40K-57KAI Job Scheduling | Automated testing | C++ | Checkpointing | DeepSpeedEntry-level Full Time InternshipSingapore-CapitaSky28d ago
-
Senior AI Engineer USD 209K-275KA/B | A/B Testing | Autoscaling | B testing | BashFour days in office | Hybrid work arrangement | Telecommuting one day per weekSenior-level Full TimeSan Jose (CA), United States1mo ago
-
Engineering Manager, Model Inference USD 220K-270KAPIs | Attention Mechanism | Batching | Distributed Systems | Docker401k matching | Commuter benefits | Flexible PTO | Flexible spending accounts | Generous time offMid-level Full TimeSF Office1mo ago
-
Compute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelism100 percent remoteSenior-level Full TimeRemote job R1mo ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KComputer Vision | Diffusion Models | Edge Computing | Expert parallelism | Flash AttentionRemote workSenior-level Full TimeRemote job R1mo ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KCompute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelismEnglish communication support | Remote workSenior-level Full TimeRemote job R1mo ago
-
Diffusion Models | Distributed Inference Systems | Distributed inference | Expert parallelism | Flash Attention100 percent remote | Worldwide remoteSenior-level Full TimeRemote job R1mo ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KCompute Shaders | Diffusion Models | Distributed Inference Systems | Distributed inference | Edge ComputingRemote workSenior-level Full TimeRemote job R1mo ago
-
Software Engineer, Inference - Multi Modal USD 295K-555KDistributed Systems | GPU | High Throughput | Inference | Language ModelsEntry-level Full TimeSan Francisco1mo ago
-
Activation checkpointing | Attention Mechanisms | CUDA | Collective operations | Data parallelismSenior-level Full TimeMountain View, California; San Francisco, California1mo ago
-
Senior Software Engineer, CUDA Deep Learning Systems USD 184K-356KC++ | CUDA | CUDA kernel | CUDA kernel optimization | Computer ArchitectureEquity | Health benefits | Paid time offSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Senior Deep Learning Frameworks CUDA Software Engineer USD 184K-356KAI compilers | C++ | CUDA | Distributed machine learning | HPC communicationSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
API Integration | Agent systems | Benchmarking | Computer Vision | Data DriftBare Metal GPU Cluster Access | Company bike leasing | Company events | Employer Subsidy | Flexible work hoursSenior-level Full TimeHeidelberg1mo ago
-
Software Engineering Manager, LLM Training USD 170K-277KCUDA | Containerization | Context Parallelism | Data I/O | Data parallelismEntry-level Full TimeMountain View, CA, United States1mo ago
-
AWQ | Audio codecs | Audio streaming | Autoscaling | Chunked prefill401k matching | Annual offsites | Dental coverage | Employer-paid training | Healthcare benefitsMid-level Full TimeSan Francisco, CA1mo ago
-
Research Engineer, Training & Inference USD 200K-450KC++ | CUDA | Cutlass | Distributed Training | FSDP401k matching | Employer-paid health insurance | Health Savings Account (HSA) | Unlimited PTOEntry-level Full TimePalo Alto1mo ago