Find jobs in AI/ML, Data Science and Big Data
43 results
for Megatron-LM
(Skill/Tech stack)
-
Entry-level Full Time北京 R1d ago
-
Active Learning | Artificial Intelligence | Curriculum learning | Data selection | Deep learning401k retirement plan | Contributory pension plan | Dental plan | Disability benefits | Educational assistanceSenior-level Full TimeOak Ridge, TN, US, 378304d ago
-
C++ | DeepSpeed | Differential Privacy | Distributed Training | Federated LearningDisability benefits | Educational assistance | Employee discounts | Flexible work hours | Generous vacation and holidaysSenior-level Full TimeOak Ridge, TN, US, 378304d ago
-
Ablation Studies | Automated Evaluation | CI/CD | Data Pipelines | Deep learningAccess to large scale compute | Career growth applied research | Career growth technical leadership | Hybrid work flexibility | Publication opportunitiesSenior-level Full TimeBrazil5d ago
-
Artificial Intelligence | CI/CD | DPO | Deep learning | DeepSpeedAccess to large scale compute | Career growth | Collaborative work environment | High ownership role | Opportunity to publish researchSenior-level Full TimeIndia5d ago
-
Alignment | Benchmark design | Constitutional AI | Continued Pretraining | Data CurationSenior-level Full TimeDublin, CA (HQ)7d ago
-
Alignment | Benchmark design | DPO | Data Curation | Data DeduplicationSenior-level Full TimeIndia/Bengaluru7d ago
-
Constitutional AI | Continued Pretraining | DPO | Data Curation | DeduplicationSenior-level Full TimeBrazil/Remote R7d ago
-
Senior Applied AI Researcher (India) INR 2500K-4500KArtificial Intelligence | DPO | Data parallelism | DataLoader | DeepSpeedSenior-level Full TimeIndia/Bengaluru7d ago
-
Senior Applied AI Researcher (Brazil) BRL 271K-370KCI/CD | DPO | Data parallelism | Deep learning | DeepSpeedSenior-level Full TimeBrazil/Remote R7d ago
-
Senior Applied AI Researcher (Dublin, CA) USD 190K-300KAutomated testing | Continuous Evaluation | Data parallelism | Deep learning | DeepSpeedSenior-level Full TimeDublin, CA (HQ)7d ago
-
Senior-level Full TimeTaichung - AATT, Taiwan7d ago
-
Senior Principal Machine Learning Engineer (Fulfilment) SGD 182K-240KDecision Processes | DeepSpeed | Direct Preference Optimization | Distributed Training | Dynamic ModelsBirthday leave | Confidential Assistance Programme | FlexWork | Medical insurance | Parental leaveExecutive-level Full TimeSingapore, Singapore12d ago
-
Senior Principal Data Scientist (Fulfilment) SGD 224K-252KDecision Processes | DeepSpeed | Distributed Training | Dynamic Models | FSDPBirthday leave | Flexible work arrangements | Life insurance | Medical insurance | Parental leaveExecutive-level Full TimeSingapore, Singapore12d ago
-
AI Software Engineer Intern CNY 28K-50KAWQ | Cache optimization | DINOv2 | DeepSpeed | Diffusion ModelsEntry-level Full Time InternshipCHN - Minhang, China13d ago
-
Research Scientist - Multimodal Representation Learning USD 200K-300KCLIP | Computer Vision | Contrastive Learning | DINOv2 | DeepSpeedMid-level Full TimeFremont, California, United States15d ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeHong Kong18d ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeSingapore18d ago
-
3D Parallelism | C++ | CUDA | DeepSpeed | InfinibandEntry-level Full TimeAustralia18d ago
-
C++ | CUDA | Data parallelism | DeepSpeed | InfinibandEntry-level Full TimeChina18d ago
-
C++ | CUDA | Data parallelism | DeepSpeed | InfinibandEntry-level Full TimeBoston, USA18d ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeSeattle, USA18d ago
-
3D Parallelism | C++ | CUDA | Data parallelism | DeepSpeedEntry-level Full TimeOregon, USA18d ago
-
C++ | CUDA | Data parallelism | DeepSpeed | InfinibandEntry-level Full TimeSan Francisco Bay Area, USA18d ago
-
Asynchronous programming | Asyncio | Deep learning | DeepSpeed | Distributed TrainingSenior-level Full TimeChina, Shanghai18d ago
-
Senior Software Engineer, RL Post-Training Frameworks USD 184K-356KActor Based Programming | C# | C++ | Consistency models | DPOComprehensive benefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States19d ago
-
GenAI Researcher USD 163K-307KAutoregressive models | Deep learning | DeepSpeed | Diffusion Models | Distributed Training401k plan | Dental insurance | Disability insurance | Life insurance | Medical insuranceMid-level Full TimeUS-California-Palo Alto, United States19d ago
-
GenAI Researcher USD 163K-307KAutoregressive models | Deep learning | DeepSpeed | Diffusion Models | Distributed Training401k plan | Dental insurance | Disability insurance | Life insurance | Medical insuranceMid-level Full TimeUS-California-Palo Alto, United States19d ago
-
Research Scientist - LLM Training System as a Service - Global Frontier Tech Recruitment Program - 2027 Start (PhD) USD 212K-450KCUDA | Deep learning | Distributed Systems | GPU Performance | GPU Performance OptimizationEntry-level Full TimeSan Jose, California, United States21d ago
-
Agentic AI | Autogen | BF16 | Big Data | CI/CDSenior-level Full TimeFab 10A, Singapore22d ago
-
Machine Learning Engineer (Training Optimization) CNY 240K-480KCUDA | Data Types | DeepSpeed | Diffusion Models | Distributed TrainingSenior-level Full TimeBeijing, Beijing, China26d ago
-
Audio Processing | Computer Vision | Data Ablation | DeepSpeed | Diffusion ModelsSenior-level Full TimeNoida, India R28d ago
-
None Full Time深圳、北京、上海1mo ago
-
None Full Time深圳、北京、上海1mo ago
-
Entry-level Full Time深圳、北京、上海1mo ago
-
Senior Principal Machine Learning Engineer USD 165K-296K3D Geometry | Apache Iceberg | BIM | CAD | CUDASenior-level Full TimeAMER - United States - Massachusetts …1mo ago
-
Senior Engineering Manager, AI Runtime USD 228K-297KCheckpointing | Cluster Lifecycle Management | Cluster lifecycle | DeepSpeed | Distributed TrainingSenior-level Full TimeMountain View, California; San Francisco, California1mo ago
-
Agentic AI | Autogen | BF16 | Big Data | CI/CDSenior-level Full TimeFab 10A, Singapore1mo ago
-
Senior Applied Scientist - Sovereign AI INR 2500K-4600KAblation Studies | Benchmarking | Knowledge Distillation | Machine Learning | Megatron-LMSenior-level Full TimeIndia, Bengaluru1mo ago
-
Principal Engineer, Machine Learning, SMAI SGD 96K-155KAgentic AI | Auto RL | Autogen | BF16 | Big DataSenior-level Full TimeFab 10A, Singapore1mo ago
-
Senior-level Full TimeFab 10A, Singapore1mo ago
-
AI Model Deployment | AI model | Artificial Intelligence | Debugging | Deep learningSenior-level Full TimeKorea, Seoul, Korea, Republic of1mo ago
-
AI Research Scientist – Datadog AI Research (DAIR) EUR 95K-120KAI Agents | CUDA | DeepSpeed | Distributed Training | Foundation ModelsConference attendance | Conference presentation | Employee stock purchase plan | Hybrid work | Mentor programSenior-level Full TimeParis, France1mo ago