Find jobs in AI/ML, Data Science and Big Data
40 results
for Megatron-LM
(Skill/Tech stack)
-
Senior-level Full Time上海1d ago
-
Ai 院--多模态团队--多模态理解算法研究员-强化学习方向 CNY 240K-480KDPO | Data Preprocessing | Data cleaning | DeepSpeed | Distributed TrainingSenior-level Full Time北京 R1d ago
-
AI院--训练Infra工程师 CNY 180K-300KComputer Vision | Distributed Training | Language Models | Language Processing | Large Language ModelsMid-level Full Time北京1d ago
-
Entry-level Full Time北京 R3d ago
-
Mid-level Full Time北京 R3d ago
-
Research Scientist - LLM Training System as a Service - Global Frontier Tech Recruitment Program - 2027 Start (PhD) USD 212K-450KCUDA | Distributed Systems | GPU Performance | Language Models | Large Language ModelsEntry-level Full TimeSan Jose, California, United States5d ago
-
Accelerate | Autoregressive models | Custom Training Loops | DeepSpeed | Denoising DiffusionPermanent employmentSenior-level Full TimeMontreal, Quebec, Canada7d ago
-
Senior Quantum AI Research Scientist, Applied Research USD 192K-304KAdapters | CUDA | Calibration | Deep learning | Distributed TrainingSenior-level Full TimeUS, WA, Redmond, United States12d ago
-
Machine Learning Engineer (Training Optimization) CNY 144K-240KCUDA | DeepSpeed | Diffusion Models | Distributed Training | FSDPEntry-level Full TimeBeijing, Beijing, China13d ago
-
Solutions Architect - AI Technology Center, Foundation Model Building KRW 65000K-90000KAI model | AI model development | CUDA | Debugging | Fine TuningSenior-level Full TimeKorea, Seoul, Korea, Republic of13d ago
-
Activation checkpointing | Attention Mechanisms | CUDA | Collective operations | Data parallelismSenior-level Full TimeMountain View, California; San Francisco, California19d ago
-
Member of Technical Staff, AI Engineering USD 162K-297KAutogen | BF16 | C++ | CI/CD | CUDAIncome Protection for Illness or Injury | Medical, dental, vision plans | Paid Holidays | Paid family leave | Paid time offSenior-level Full TimeBoise, ID - Main Site, United …19d ago
-
Solutions Architect, Pre-training and Post-training KRW 65000K-90000KArtificial Intelligence | Debugging | Deep learning | Fine Tuning | GPU ArchitectureSenior-level Full TimeKorea, Seoul, Korea, Republic of20d ago
-
AI Platform Engineer INR 1500K-2500KAutomated Evaluation | CI/CD | CUDA | Continuous Checkpointing | Continuous batchingMid-level Full TimeBangalore, India20d ago
-
Machine Learning Engineer 5 INR 2500K-4500K3D Reconstruction | Adapters | CLIP | Computer Vision | ControlNetSenior-level Full TimeBangalore, India R21d ago
-
C++ | DeepSpeed | Differential Privacy | Distributed Training | Federated LearningDisability benefits | Educational assistance | Employee discounts | Flexible work hours | Generous vacation and holidaysSenior-level Full TimeOak Ridge, TN, US, 3783025d ago
-
Alignment | Benchmark design | Constitutional AI | Continued Pretraining | Data CurationSenior-level Full TimeDublin, CA (HQ)28d ago
-
Alignment | Benchmark design | DPO | Data Curation | Data DeduplicationSenior-level Full TimeIndia/Bengaluru28d ago
-
Constitutional AI | Continued Pretraining | DPO | Data Curation | DeduplicationSenior-level Full TimeBrazil/Remote R28d ago
-
Senior Applied AI Researcher (India) INR 2500K-4500KArtificial Intelligence | DPO | Data parallelism | DataLoader | DeepSpeedSenior-level Full TimeIndia/Bengaluru28d ago
-
Senior Applied AI Researcher (Brazil) BRL 271K-370KCI/CD | DPO | Data parallelism | Deep learning | DeepSpeedSenior-level Full TimeBrazil/Remote R28d ago
-
Senior Applied AI Researcher (Dublin, CA) USD 190K-300KAutomated testing | Continuous Evaluation | Data parallelism | Deep learning | DeepSpeedSenior-level Full TimeDublin, CA (HQ)28d ago
-
Senior-level Full TimeTaichung - AATT, Taiwan28d ago
-
Senior Principal Machine Learning Engineer (Fulfilment) SGD 182K-240KDecision Processes | DeepSpeed | Direct Preference Optimization | Distributed Training | Dynamic ModelsBirthday leave | Confidential Assistance Programme | FlexWork | Medical insurance | Parental leaveExecutive-level Full TimeSingapore, Singapore1mo ago
-
Senior Principal Data Scientist (Fulfilment) SGD 224K-252KDecision Processes | DeepSpeed | Distributed Training | Dynamic Models | FSDPBirthday leave | Flexible work arrangements | Life insurance | Medical insurance | Parental leaveExecutive-level Full TimeSingapore, Singapore1mo ago
-
Research Scientist - Multimodal Representation Learning USD 200K-300KCLIP | Computer Vision | Contrastive Learning | DINOv2 | DeepSpeedMid-level Full TimeFremont, California, United States1mo ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeHong Kong1mo ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeSingapore1mo ago
-
3D Parallelism | C++ | CUDA | DeepSpeed | InfinibandEntry-level Full TimeAustralia1mo ago
-
C++ | CUDA | Data parallelism | DeepSpeed | InfinibandEntry-level Full TimeChina1mo ago
-
C++ | CUDA | Data parallelism | DeepSpeed | InfinibandEntry-level Full TimeBoston, USA1mo ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeSeattle, USA1mo ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeOregon, USA1mo ago
-
C++ | CUDA | Data parallelism | DeepSpeed | InfinibandEntry-level Full TimeSan Francisco Bay Area, USA1mo ago
-
Asynchronous programming | Asyncio | Deep learning | DeepSpeed | Distributed TrainingSenior-level Full TimeChina, Shanghai1mo ago
-
Senior Software Engineer, RL Post-Training Frameworks USD 184K-356KActor Based Programming | C# | C++ | Consistency models | DPOComprehensive benefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Agentic AI | Autogen | BF16 | Big Data | CI/CDSenior-level Full TimeFab 10A, Singapore1mo ago
-
None Full Time深圳、北京、上海1mo ago
-
None Full Time深圳、北京、上海1mo ago
-
Entry-level Full Time深圳、北京、上海1mo ago