Research Engineer - Reinforcement Learning
Tasks
- Contribute to open-source libraries and frameworks
- Lead research on synthetic data generation pipeline
- Optimize AI inference workload performance and cost
- Publish research papers in top-tier AI conferences
- Stay updated on latest AI/ML infrastructure advancements
- Write technical blogs for customers and developers
Perks/Benefits
- Conferences
- Equity incentives
- Flexible remote or in-office work
- Hackathons
- Learning opportunities
- Quarterly off-sites
- Relocation assistance
- Visa sponsorship
Skills/Tech-stack
AI Models | AI/ML | AI/ML engineering | CI/CD | Data Generation | Distributed inference | Experiment tracking | Large AI models | ML Engineering | MLOps | Model versioning | Reinforcement Learning | SGLang | Scaling large AI models | Synthetic Data Generation | Synthetic data | VLLM
Education
Bachelor's | Master's | PhD
Regions
Countries
States
Related jobs
-
Code review | Contamination Checking | Data Generation | Data Pipelines | Data ProcessingEntry-level Full TimeMenlo Park, CA1d ago
-
Research Engineer, Media Data Research - MSL FAIR USD 170K-251KComputer Vision | Data Curation | Data Generation | Data Scaling Laws | Data mixingSenior-level Full TimeMenlo Park, CA1d ago
-
Research Engineer, Gemini AutoRater USD 166K-244KData collection | Fine Tuning | Foundation Models | Human evaluation | Language ModelsSenior-level Full TimeMountain View, California, US2d ago
-
Systems Engineering Intern (Machine Learning) USD 62K-124KAgentic AI | C# | C++ | DPO | Deep learningEntry-level Full Time InternshipDallas, TX, United States3d ago
-
Software Engineer III, AI/ML, Google Research USD 147K-211KAlgorithms | C++ | Data Processing | Data Structures | Deep learningBenefits | Bonus | EquitySenior-level Full TimeMountain View, CA, USA; Cambridge, MA, …3d ago
-
Staff ML Engineer, Frontier AI USD 250KDeep learning | Language Processing | Machine Learning | Model Optimization | Natural Language401k match | Flexible time off | Medical/Dental/Vision | Parental leave | Remote workSenior-level Full TimeSan Francisco4d ago
-
3D Reconstruction | AWS SageMaker | Amazon EC2 | Computer Vision | DDP401k eligibility | Annual cash bonus | Dental insurance | Medical insurance | Paid time offMid-level Full TimeLos Altos, CA4d ago
-
Applied Research Engineer, Agents USD 250K-300KAI research | Autonomous Agents | Benchmarking | Data Pipelines | Data ScienceGrowth opportunities | Hybrid work environment | Impact focusMid-level Full TimeSan Francisco Bay Area5d ago
-
Applied Research Engineer USD 250K-300KAI Model Training | AI model | Data Quality | Data Quality Measurement | Deep learningFlexible work options | Health insurance | Paid time off | Professional development budgetSenior-level Full TimeSan Francisco Bay Area5d ago
-
Computer Vision | Data Management | Deep learning | Edge AI | Experiment trackingFlexible scheduling | Professional development opportunitiesSenior-level Full TimeBaltimore, Maryland5d ago
-
Staff Machine Learning Engineer, Responsible AI USD 177K-387KA/B | A/B Testing | AI Safety | B testing | C++Flexible schedule | Health benefits | Learning & development opportunities | Remote workSenior-level Full TimeSeattle (WA), United States5d ago
-
Research Engineer, Reinforcement Learning USD 295K-440KData Curation | Language Models | PyTorch | Reinforcement Learning | Research401k | Dental insurance | Gym membership | Health insurance | Vision insuranceMid-level Full TimeSan Francisco, CA8d ago
-
Research Engineer Intern, Evaluations USD 75K-114KData Engineering | Data Lakes | Data Warehouses | Evaluation Methodologies | JAXDental insurance | Gym membership | Health insurance | Internship stipend | Vision insuranceEntry-level Full Time InternshipSan Francisco, CA8d ago
-
Research Engineer USD 100K-300KComputer Vision | Imitation Learning | Machine Learning | PyTorch | PythonMid-level Full TimePittsburgh, San Mateo11d ago
-
Lead AI Research Engineer USD 91K-175KCloud Platforms | Cloud platforms Azure | Cloud platforms Azure GCP | Cloud platforms Azure GCP AWS | Data PipelinesFlexible work arrangements | Health and well-being benefits | Inclusive culture | Professional development opportunities | Recognition programsSenior-level Full TimeWork at Home - Ohio - …11d ago
-
Frontier Data Lead - Code USD 250K-350KAutomation | C++ | Data Generation | Data Management | Data PipelinesCollaborative work environment | Competitive compensation | Flexible working hours | Opportunity to work with top AI labsSenior-level Full TimeSan Francisco, California, United States11d ago
-
AI Research Engineer, Reinforcement Learning USD 180K-250KC++ | Control Systems | PyTorch | Python | Reinforcement Learning401k match | Dental insurance | Health insurance | Paid time off | Vision insuranceSenior-level Full TimeSan Carlos, California, United States13d ago
-
Principal Research Engineer USD 163K-331KAI | Agent systems | Bias Mitigation | Data Engineering | Deep learningCareer development opportunities | Flexible work arrangements | Health benefitsSenior-level Full TimeRedmond, WA, US15d ago
-
Senior Research Engineer USD 119K-258KAgent systems | Bias Mitigation | CI/CD | Data Engineering | Deep learningSenior-level Full TimeRedmond, WA, US15d ago
-
Senior Research Engineer USD 119K-258KAI Deployment | AI Safety | Bias Mitigation | C# | C++Career development | Flexible work arrangements | Health benefits | Inclusive cultureSenior-level Full TimeRedmond, WA, US15d ago
-
Research Engineer USD 180K-370KC++ | Cloud infrastructure | Data Pipelines | Deep learning | Distributed SystemsCareer growth | Collaborative environment | Impactful workSenior-level Full TimeSan Francisco17d ago
-
Audio ML Engineer (Research) USD 134K-196KAI-assisted coding | AI-assisted coding tools | Audio signal processing | Coding Tools | DSPEmployee discounts | Flexible work environment | Recognition program | Training opportunities | Tuition reimbursementMid-level Full TimeUS Northridge 8500 Balboa Blvd, United …19d ago
-
Research Engineer / Research Scientist, Tokens USD 350K-500KData Processing | Distributed Training | Kubernetes | Large Scale Data | Large-scale Data ProcessingFlexible working hours | Generous vacation and parental leave | Option to donate equityMid-level Full TimeNew York City, NY; New York …21d ago
-
Code review | Data Filtering | Data Generation | Data Pipelines | Distributed SystemsSenior-level Full TimeMenlo Park, CA21d ago
-
Code review | Data Filtering | Data Generation | Data-Centric AI | Data-centricSenior-level Full TimeMenlo Park, CA21d ago