AI Research Engineer (Multi-Modal Reinforcement Learning) - 100% Remote Worldwide
Tasks
- Analyze policy performance bottlenecks across modalities
- Benchmark reinforcement learning performance across multimodal tasks
- Curate multimodal simulation environments and datasets
- Design reinforcement learning infrastructure for distributed training
- Develop reinforcement learning paradigms from environment feedback
- Develop reward modeling to improve training stability
- Publish research findings in top-tier conferences
- Research reinforcement learning algorithms for multimodal models
Perks/Benefits
Skills/Tech-stack
Audio Processing | Autoregression | Autoregressive models | Computer Vision | Deep learning | Diffusion Models | Distributed Training | Exploration/exploitation | Generative Models | Language Processing | Multi-Modal | Multi-Modal Learning | Natural Language | Natural Language Processing | Policy Optimization | PyTorch | Reinforcement Learning | Reward Hacking | Reward Modeling | Sample efficiency | Training stability
Education
Roles
Related jobs
-
Senior-level Full TimeToronto, Canada R9h ago
-
Manager, AI Engineering USD 122K-152KAI Act | AI RMF | Agent Frameworks | Bias Testing | Data Lineage401k match | Business travel coverage | Dental insurance | Disability insurance | Employee assistance programMid-level Full TimePrinceton, New Jersey, United States; San … R9h ago
-
Senior Manager, AI Perception and Skills USD 222K-347KArtificial Intelligence | Behavior Cloning | Computer Vision | Diffusion Models | Edge ComputingEmployee assistance program | Flexible work arrangements | Parental leave | Professional development | Relocation assistanceSenior-level Full TimeRemote R11h ago
-
Associate Director R&D AI Solutions & Analytics USD 156K-195KAPI Integration | Analytics | Artificial Intelligence | CDISC | Clinical data401k match | Business travel coverage | Dental insurance | Disability insurance | Employee assistance programMid-level Full TimePrinceton, New Jersey, United States; San … R12h ago
-
AI Engineer COP 41748K-43836KAWS | CI/CD | Cloud infrastructure | Data Ingestion | Data ProcessingAdvanced AWS Partnership training | Advanced engineering resources | Early access to emerging cloud capabilitiesMid-level Full TimeBogota, Colombia (Remote Friendly) R13h ago
-
Mid-level Full Time北京 R13h ago
-
Miclaw-端云协同调度专家 (Hybrid AI Architect) CNY 240K-360K5G | API Integration | Claude 3.5 | Distributed Systems | GPT-4oHybrid workSenior-level Full Time北京 R13h ago
-
Senior Machine Learning Engineer USD 164K-258KCI/CD | Distributed Systems | Drift Detection | Embeddings | Entity ResolutionSenior-level Full TimeRemote R14h ago
-
AI Engineer (f/m/d) EUR 70K-75KAWS EKS | Agent Builder | Agent platform | Airflow | Amazon SageMakerEquipment provided | Internal and external training | International career opportunities | Remote work 2 days per week | Technical training and certificationsSenior-level Full TimeParis, Paris, France R16h ago
-
Software Engineer USD 149K-211KAlgorithms | C# | C++ | Cause analysis | Code reviewBonus | Equity | Health benefits | Hybrid scheduleMid-level Full TimeMountain View, CA, USA R21h ago
-
Software Engineer USD 149K-211KAlgorithms | C# | C++ | Code review | Data AnalysisBenefits | Bonuses | Equity | Hybrid work scheduleMid-level Full TimeMountain View, CA, USA R21h ago
-
Software Engineer USD 149K-211KAlgorithms | Android Development | C# | C++ | Data StructuresHybrid scheduleMid-level Full TimeMountain View, CA, USA R21h ago
-
Software Engineer USD 149K-211KAlgorithms | C# | C++ | Code review | Data AnalysisBonus | Equity | Hybrid work scheduleMid-level Full TimeMountain View, CA, USA R21h ago
-
Software Engineer USD 149K-211KC# | C++ | Cause analysis | Data Processing | Data StructuresHybrid scheduleMid-level Full TimeSunnyvale, CA, USA R21h ago
-
Senior Software Engineer USD 189K-252KAlgorithms | Audio Processing | C++ | Cause analysis | Data StructuresHybrid scheduleSenior-level Full TimeNew York, NY, USA R21h ago
-
Software Engineer USD 149K-211KAlgorithms | C# | C++ | Cause analysis | Data AnalysisHybrid scheduleMid-level Full TimeMountain View, CA, USA R21h ago
-
Senior Research Engineer USD 174K-252KC plus plus | Code Reviews | Data Curation | Deep learning | JAXHybrid scheduleSenior-level Full TimeNew York, NY, USA R21h ago
-
Staff Software Engineer USD 207K-300KAdversarial Testing | C++ | Data pipeline | Learning evaluation | Machine LearningEquity compensation | Health benefits | Hybrid scheduleSenior-level Full TimeNew York, NY, USA R21h ago
-
Senior Research Engineer USD 174K-252KC plus plus | Cause analysis | Code Reviews | Dataset curation | Deep Neural NetworksBenefits | Bonus | Equity | Hybrid work scheduleSenior-level Full TimeMountain View, CA, USA R21h ago
-
Software Engineer USD 147K-211KAlgorithms | C# | C++ | Cause analysis | Data AnalysisHybrid work scheduleMid-level Full TimeNew York, NY, USA R21h ago
-
Consultant.e AI Engineer EUR 50K-60KAI Foundry | APIs | Apache Spark | Azure AI | Azure AI FoundryHybrid work | RTT | Restaurant ticket | TrainingSenior-level Full TimeNiort, Deux-Sèvres, Nouvelle-Aquitaine, FR R23h ago
-
Consultant.e AI Engineer EUR 50K-60KAI Foundry | Apache Spark | Azure | Azure AI | Azure AI FoundryCareer growth | Human-sized company | Hybrid work | Individualized coaching | Meal cardSenior-level Full TimeNantes, Loire-Atlantique, Pays de la Loire, … R23h ago
-
Consultant.e AI Engineer EUR 60K-70KAI Foundry | API Development | Azure AI | Azure AI Foundry | Azure CognitiveHybrid work | Meal card | RTT | Training opportunitiesSenior-level Full TimeParis, Paris, Île-de-France, FR R23h ago
-
AI Engineer (m/f/n) PLN 282K-402KAWS | AWS Lambda | Apache Kafka | CI/CD | Cloud platformB2B contract | Flexible office or remote work | International cross functional environment | Remote work | Training opportunitiesSenior-level Full TimeWarszawa, Województwo mazowieckie, Poland R1d ago
-
Senior-level Full TimeBangalore, Karnataka, India R1d ago