Research Engineer - Distributed Training
Tasks
- Create technical blogs for customers and developers
- Develop open-source distributed training libraries and frameworks
- Lead and participate in research for decentralized training orchestration
- Optimize AI workload performance and costs
- Publish research in top AI conferences
- Stay updated with AI/ML infrastructure advances and identify platform enhancements
Perks/Benefits
- Conferences
- Equity incentives
- Flexible work
- Hackathons
- Learning opportunities
- Quarterly off-sites
- Relocation assistance
- Remote or in-office
- Visa sponsorship
Skills/Tech-stack
AI/ML | AI/ML engineering | CI/CD | Compute Optimization | Data parallelism | DeepSpeed | Distributed Training | Experiment tracking | ML Engineering | MLOps | Memory Optimization | Model Parallelism | MosaicML LLM Foundry | Performance Tuning | Pipeline parallelism | PyTorch distributed | Ray | Scalability | Tensor Parallelism | Versioning
Regions
Countries
States
Related jobs
-
Research Engineer, Media Data Research - MSL FAIR USD 170K-251KComputer Vision | Data Curation | Data Generation | Data Scaling Laws | Data mixingSenior-level Full TimeMenlo Park, CA18h ago
-
AI Research Engineer, Computer Vision USD 170K-210KAutoregressive models | CUDA | DDP | Data Pipelines | DeepSpeed401k retirement plan | Company equity | Dental insurance | Fertility support | Human Annotation SupportMid-level Full TimeRemote (U.S. or Canada) R1d ago
-
3D Reconstruction | AWS SageMaker | Amazon EC2 | Computer Vision | DDP401k eligibility | Annual cash bonus | Dental insurance | Medical insurance | Paid time offMid-level Full TimeLos Altos, CA4d ago
-
Computer Vision | Data Management | Deep learning | Edge AI | Experiment trackingFlexible scheduling | Professional development opportunitiesSenior-level Full TimeBaltimore, Maryland5d ago
-
Staff Machine Learning Engineer, Responsible AI USD 177K-387KA/B | A/B Testing | AI Safety | B testing | C++Flexible schedule | Health benefits | Learning & development opportunities | Remote workSenior-level Full TimeSeattle (WA), United States5d ago
-
Research Engineer – LLM Evaluation Systems (Seed Infra) USD 198K-416KBenchmarking | Distributed Systems | GPU Computing | High Performance | High-Performance ComputingMid-level Full TimeSeattle, Washington, United States8d ago
-
Lead AI Research Engineer USD 91K-175KCloud Platforms | Cloud platforms Azure | Cloud platforms Azure GCP | Cloud platforms Azure GCP AWS | Data PipelinesFlexible work arrangements | Health and well-being benefits | Inclusive culture | Professional development opportunities | Recognition programsSenior-level Full TimeWork at Home - Ohio - …11d ago
-
AI Research Engineer, Scaling USD 180K-300KC++ | CUDA | DeepSpeed | Distributed Training | FSDP401k matching | Dental insurance | Health insurance | Holidays | Paid time offSenior-level Full TimeSan Carlos, California, United States13d ago
-
Principal Research Engineer USD 163K-331KAI | Agent systems | Bias Mitigation | Data Engineering | Deep learningCareer development opportunities | Flexible work arrangements | Health benefitsSenior-level Full TimeRedmond, WA, US14d ago
-
Senior Research Engineer USD 119K-258KAgent systems | Bias Mitigation | CI/CD | Data Engineering | Deep learningSenior-level Full TimeRedmond, WA, US14d ago
-
Research Engineer, Multimodal USD 225K-400KAudio Processing | DeepSpeed | FSDP | Image Generation | Model CompressionSenior-level Full TimeRedwood City, CA15d ago
-
Senior Research Engineer USD 119K-258KAI Deployment | AI Safety | Bias Mitigation | C# | C++Career development | Flexible work arrangements | Health benefits | Inclusive cultureSenior-level Full TimeRedmond, WA, US15d ago
-
Audio ML Engineer (Research) USD 134K-196KAI-assisted coding | AI-assisted coding tools | Audio signal processing | Coding Tools | DSPEmployee discounts | Flexible work environment | Recognition program | Training opportunities | Tuition reimbursementMid-level Full TimeUS Northridge 8500 Balboa Blvd, United …19d ago
-
Research Engineer / Research Scientist, Tokens USD 350K-500KData Processing | Distributed Training | Kubernetes | Large Scale Data | Large-scale Data ProcessingFlexible working hours | Generous vacation and parental leave | Option to donate equityMid-level Full TimeNew York City, NY; New York …20d ago
-
Senior Research Engineer USD 119K-258KAI Deployment | Agent systems | Data Management | Deep learning | Fine TuningBenefits | Growth environment | Inclusive cultureSenior-level Full TimeRedmond, WA, US28d ago
-
Senior Research Engineer USD 119K-258KAI Fairness | Agent architectures | Data evaluation | Experimentation | Large Scale TrainingSenior-level Full TimeRedmond, WA, US30d ago
-
Senior LLM Research Engineer - Artificial Intelligence USD 165K-260KAlgorithms | Data Structures | Deep learning | DeepSpeed | Financial NLP401k match | Bonuses | Comprehensive benefits | Disability benefits | Medical/Dental/VisionSenior-level Full TimeNew York30d ago
-
Senior Research Engineer USD 119K-258KAI | AI Search | Agent architectures | AzureAI | Bias MitigationBenefits | Collaborative culture | Growth opportunities | Work on innovative projectsSenior-level Full TimeRedmond, WA, US1mo ago
-
AI Data Foundation Research Engineer USD 126K-240KAI frameworks | Big Data | C++ | Computer Vision | Container OrchestrationHealth & wellbeing benefits | Inclusive work environment | Personal & professional developmentMid-level Full TimeFt. Collins, Colorado, United States of …1mo ago
-
AI Data Foundation Research Engineer USD 126K-240KAI Model Architectures | AI/HPC workflows | Algorithms | Big Data | Big Data PipelinesHealth benefits | Inclusion and diversity policies | Professional developmentMid-level Full TimeFt. Collins, Colorado, United States of …1mo ago
-
Principal Research Engineer USD 139K-304KAI research | Agent systems | Aggregation | Azure Machine Learning | Azure OpenAIGrowth mindset culture | Impactful projects | Inclusive environment | Mentoring opportunitiesSenior-level Full TimeRedmond, WA, US1mo ago
-
Research Engineer - Reinforcement Learning USD 310K-425KAI Models | AI/ML | AI/ML engineering | CI/CD | Data GenerationConferences | Equity incentives | Flexible remote or in-office work | Hackathons | Learning opportunitiesMid-level Full TimeSan Francisco1mo ago
-
AI Research Engineer, Media - MSL PAR USD 177K-251KAI/ML | Computer Vision | Data inference | Deep learning | Efficiency optimizationEntry-level Full TimeMenlo Park, CA1mo ago
-
AI Safety | Computer Vision | Data Curation | Distributed Training | Machine LearningSenior-level Full TimeSunnyvale, CA | Bellevue, WA | …1mo ago
-
Senior Research Engineer / Scientist - Storage for LLM USD 177K-341KAI infrastructure | CUDA | Caching | Distributed Storage | GPU ProgrammingCompetitive salary | Conference participation | Innovative culture | Open Source contribution | Research resourcesSenior-level Full TimeSeattle, Washington, United States1mo ago