AI Research Engineer (Multi-Modal Reinforcement Learning) - 100% Remote Worldwide
Tasks
- Analyze policy performance bottlenecks across modalities
- Benchmark reinforcement learning performance across multimodal tasks
- Curate multimodal simulation environments and datasets
- Design reinforcement learning infrastructure for distributed training
- Develop reinforcement learning paradigms from environment feedback
- Develop reward modeling to improve training stability
- Publish research findings in top-tier conferences
- Research reinforcement learning algorithms for multimodal models
Perks/Benefits
Skills/Tech-stack
Audio Processing | Autoregression | Autoregressive models | Computer Vision | Deep learning | Diffusion Models | Distributed Training | Exploration/exploitation | Generative Models | Language Processing | Multi-Modal | Multi-Modal Learning | Natural Language | Natural Language Processing | Policy Optimization | PyTorch | Reinforcement Learning | Reward Hacking | Reward Modeling | Sample efficiency | Training stability
Education
Roles
Related jobs
-
Mid-level Full Time北京 R11h ago
-
Miclaw-端云协同调度专家 (Hybrid AI Architect) CNY 240K-360K5G | API Integration | Claude 3.5 | Distributed Systems | GPT-4oHybrid workSenior-level Full Time北京 R11h ago
-
Software Engineer USD 149K-211KAlgorithms | C# | C++ | Cause analysis | Code reviewBonus | Equity | Health benefits | Hybrid scheduleMid-level Full TimeMountain View, CA, USA R18h ago
-
Software Engineer USD 149K-211KAlgorithms | C# | C++ | Code review | Data AnalysisBenefits | Bonuses | Equity | Hybrid work scheduleMid-level Full TimeMountain View, CA, USA R18h ago
-
Software Engineer USD 149K-211KAlgorithms | Android Development | C# | C++ | Data StructuresHybrid scheduleMid-level Full TimeMountain View, CA, USA R18h ago
-
Software Engineer USD 149K-211KAlgorithms | C# | C++ | Code review | Data AnalysisBonus | Equity | Hybrid work scheduleMid-level Full TimeMountain View, CA, USA R18h ago
-
Software Engineer USD 149K-211KC# | C++ | Cause analysis | Data Processing | Data StructuresHybrid scheduleMid-level Full TimeSunnyvale, CA, USA R18h ago
-
Senior Software Engineer USD 189K-252KAlgorithms | Audio Processing | C++ | Cause analysis | Data StructuresHybrid scheduleSenior-level Full TimeNew York, NY, USA R18h ago
-
Software Engineer USD 149K-211KAlgorithms | C# | C++ | Cause analysis | Data AnalysisHybrid scheduleMid-level Full TimeMountain View, CA, USA R18h ago
-
Senior Research Engineer USD 174K-252KC plus plus | Code Reviews | Data Curation | Deep learning | JAXHybrid scheduleSenior-level Full TimeNew York, NY, USA R18h ago
-
Staff Software Engineer USD 207K-300KAdversarial Testing | C++ | Data pipeline | Learning evaluation | Machine LearningEquity compensation | Health benefits | Hybrid scheduleSenior-level Full TimeNew York, NY, USA R18h ago
-
Senior Research Engineer USD 174K-252KC plus plus | Cause analysis | Code Reviews | Dataset curation | Deep Neural NetworksBenefits | Bonus | Equity | Hybrid work scheduleSenior-level Full TimeMountain View, CA, USA R18h ago
-
Software Engineer USD 147K-211KAlgorithms | C# | C++ | Cause analysis | Data AnalysisHybrid work scheduleMid-level Full TimeNew York, NY, USA R18h ago
-
Consultant.e AI Engineer EUR 50K-60KAI Foundry | APIs | Apache Spark | Azure AI | Azure AI FoundryHybrid work | RTT | Restaurant ticket | TrainingSenior-level Full TimeNiort, Deux-Sèvres, Nouvelle-Aquitaine, FR R21h ago
-
Consultant.e AI Engineer EUR 50K-60KAI Foundry | Apache Spark | Azure | Azure AI | Azure AI FoundryCareer growth | Human-sized company | Hybrid work | Individualized coaching | Meal cardSenior-level Full TimeNantes, Loire-Atlantique, Pays de la Loire, … R21h ago
-
Consultant.e AI Engineer EUR 60K-70KAI Foundry | API Development | Azure AI | Azure AI Foundry | Azure CognitiveHybrid work | Meal card | RTT | Training opportunitiesSenior-level Full TimeParis, Paris, Île-de-France, FR R21h ago
-
AI Engineer (m/f/n) PLN 282K-402KAWS | AWS Lambda | Apache Kafka | CI/CD | Cloud platformB2B contract | Flexible office or remote work | International cross functional environment | Remote work | Training opportunitiesSenior-level Full TimeWarszawa, Województwo mazowieckie, Poland R21h ago
-
Senior-level Full TimeBangalore, Karnataka, India R22h ago
-
Senior-level Full TimeBangalore, Karnataka, India R22h ago
-
ML Engineer EUR 45K-60KCI/CD | Cloud Computing | Data Pipelines | Data Quality | Data VisualizationFlexible working hours | Hybrid work model | Work from anywhere up to 3 weeks per yearMid-level Full TimeVilnius, Vilnius City Municipality, Lithuania R1d ago
-
Junior AI Engineer - Computer Vision INR 300K-324KComputer Vision | Convolutional Neural Networks | Deep learning | Detectron2 | DockerCollaborative engineering environment | GPU infrastructure access | Opportunities for growth | Remote workEntry-level Full TimeIndia - Remote R1d ago
-
Hugging Face | LLM orchestration | Langchain | Language Models | Large Language ModelsCareer growth potential | Early stage technical hire | Equity compensation | High ownership role | Hybrid workMid-level Full TimeSan Francisco, CA; Hybrid R1d ago
-
Deep learning | Diffusion Models | Foundation Models | Generative Models | Image GenerationAcademic internship | Hybrid work setup | Mentorship | Onsite Days Per WeekEntry-level Internship Part TimeGM Israel - Technical Center Israel … R1d ago
-
AI Solutions Architect USD 144K-200KAI RMF | Angular | Django | Drift Detection | FedRAMPCareer development | Employee resource groups | Flexible WFH | Generous PTO | Paid volunteer timeSenior-level Full TimeUS-Washington DC-Remote, United States R1d ago
-
Batching | C# | C++ | CUDA | FP16Dental insurance | Disability insurance | Flexible spending account | Flexible vacation | Health insuranceMid-level Full TimeAnywhere, USA R1d ago