Find jobs in AI/ML, Data Science and Big Data
35 results
for Megatron-LM
(Skill/Tech stack)
-
CUDA | DeepSpeed | Distributed Training | FSDP | Gradient CheckpointingEntry-level Full TimeBeijing, Beijing, China12d ago
-
Senior Machine Learning Engineer – LLMs EUR 62K-90KAccelerate | Axolotl | BF16 | DPO | Data DeduplicationAutonomy | Hybrid work model | Professional growth | Top-spec equipmentSenior-level Full TimeNetherlands - Amsterdam13d ago
-
Senior-level Full TimeNetherlands - Amsterdam13d ago
-
Entry-level Full Time深圳、北京、上海17d ago
-
Applied Scientist 5 INR 2475K-4500K3D Reconstruction | Adapters | CLIP | Computer Vision | ControlNetSenior-level Full TimeBangalore, India R19d ago
-
Applied Scientist 5.5 INR 2475K-4500K3D Reconstruction | Adapters | CLIP | Computer Vision | ControlNetSenior-level Full TimeBangalore, India R19d ago
-
Data/AI Engineer Intern SGD 40K-57KAI Job Scheduling | Automated testing | C++ | Checkpointing | DeepSpeedEntry-level Full Time InternshipSingapore-CapitaSky21d ago
-
Senior-level Full Time上海21d ago
-
Ai 院--多模态团队--多模态理解算法研究员-强化学习方向 CNY 240K-480KDPO | Data Preprocessing | Data cleaning | DeepSpeed | Distributed TrainingSenior-level Full Time北京 R21d ago
-
AI院--训练Infra工程师 CNY 180K-300KComputer Vision | Distributed Training | Language Models | Language Processing | Large Language ModelsMid-level Full Time北京21d ago
-
Research Scientist - LLM Training System as a Service - Global Frontier Tech Recruitment Program - 2027 Start (PhD) USD 212K-450KCUDA | Distributed Systems | GPU Performance | Language Models | Large Language ModelsEntry-level Full TimeSan Jose, California, United States25d ago
-
Accelerate | Autoregressive models | Custom Training Loops | DeepSpeed | Denoising DiffusionPermanent employmentSenior-level Full TimeMontreal, Quebec, Canada28d ago
-
Senior Quantum AI Research Scientist, Applied Research USD 192K-304KAdapters | CUDA | Calibration | Deep learning | Distributed TrainingSenior-level Full TimeUS, WA, Redmond, United States1mo ago
-
Solutions Architect - AI Technology Center, Foundation Model Building KRW 65000K-90000KAI model | AI model development | CUDA | Debugging | Fine TuningSenior-level Full TimeKorea, Seoul, Korea, Republic of1mo ago
-
Activation checkpointing | Attention Mechanisms | CUDA | Collective operations | Data parallelismSenior-level Full TimeMountain View, California; San Francisco, California1mo ago
-
Member of Technical Staff, AI Engineering USD 162K-297KAutogen | BF16 | C++ | CI/CD | CUDAIncome Protection for Illness or Injury | Medical, dental, vision plans | Paid Holidays | Paid family leave | Paid time offSenior-level Full TimeBoise, ID - Main Site, United …1mo ago
-
Solutions Architect, Pre-training and Post-training KRW 65000K-90000KArtificial Intelligence | Debugging | Deep learning | Fine Tuning | GPU ArchitectureSenior-level Full TimeKorea, Seoul, Korea, Republic of1mo ago
-
AI Platform Engineer INR 1500K-2500KAutomated Evaluation | CI/CD | CUDA | Continuous Checkpointing | Continuous batchingMid-level Full TimeBangalore, India1mo ago
-
Alignment | Benchmark design | Constitutional AI | Continued Pretraining | Data CurationSenior-level Full TimeDublin, CA (HQ)1mo ago
-
Alignment | Benchmark design | DPO | Data Curation | Data DeduplicationSenior-level Full TimeIndia/Bengaluru1mo ago
-
Constitutional AI | Continued Pretraining | DPO | Data Curation | DeduplicationSenior-level Full TimeBrazil/Remote R1mo ago
-
Senior Applied AI Researcher (India) INR 2500K-4500KArtificial Intelligence | DPO | Data parallelism | DataLoader | DeepSpeedSenior-level Full TimeIndia/Bengaluru1mo ago
-
Senior Applied AI Researcher (Brazil) BRL 271K-370KCI/CD | DPO | Data parallelism | Deep learning | DeepSpeedSenior-level Full TimeBrazil/Remote R1mo ago
-
Senior Applied AI Researcher (Dublin, CA) USD 190K-300KAutomated testing | Continuous Evaluation | Data parallelism | Deep learning | DeepSpeedSenior-level Full TimeDublin, CA (HQ)1mo ago
-
Senior Principal Machine Learning Engineer (Fulfilment) SGD 182K-240KDecision Processes | DeepSpeed | Direct Preference Optimization | Distributed Training | Dynamic ModelsBirthday leave | Confidential Assistance Programme | FlexWork | Medical insurance | Parental leaveExecutive-level Full TimeSingapore, Singapore1mo ago
-
Senior Principal Data Scientist (Fulfilment) SGD 224K-252KDecision Processes | DeepSpeed | Distributed Training | Dynamic Models | FSDPBirthday leave | Flexible work arrangements | Life insurance | Medical insurance | Parental leaveExecutive-level Full TimeSingapore, Singapore1mo ago
-
Research Scientist - Multimodal Representation Learning USD 200K-300KCLIP | Computer Vision | Contrastive Learning | DINOv2 | DeepSpeedMid-level Full TimeFremont, California, United States1mo ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeHong Kong1mo ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeSingapore1mo ago
-
3D Parallelism | C++ | CUDA | DeepSpeed | InfinibandEntry-level Full TimeAustralia1mo ago
-
C++ | CUDA | Data parallelism | DeepSpeed | InfinibandEntry-level Full TimeChina1mo ago
-
C++ | CUDA | Data parallelism | DeepSpeed | InfinibandEntry-level Full TimeBoston, USA1mo ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeSeattle, USA1mo ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeOregon, USA1mo ago
-
C++ | CUDA | Data parallelism | DeepSpeed | InfinibandEntry-level Full TimeSan Francisco Bay Area, USA1mo ago