Applied Reinforcement Learning Engineer
Tasks
- Architect multi step reasoning agents with tool calling
- Build RL training pipelines from human labeled traces
- Design and build RL environments
- Design reward functions verifiers and validation frameworks
- Train LLM based agents with RLHF
- Translate RL research into production systems
Perks/Benefits
- Collaborative research culture
- Hybrid remote work
- Open to Publications
- Research to Production Opportunities
Skills/Tech-stack
A2C | A3C | Actor-critic | Agent systems | BCQ | Behavioral cloning | CQL | DPO | Deep Q-Network | Domain Randomization | Double Deep Q Network | Dreamer | Dueling Deep Q Network | Eligibility Traces | Entropy Regularization | GAIL | GRPO | Gymnasium | Hierarchical reinforcement learning | IQL | JAX | MDP | MuZero | Multi-Agent | Multi-Agent Systems | Multi-step reasoning | Offline RL | OpenAI Gym | Options Framework | PPO | Policy Gradient | PyTorch | Python | Q-learning | RLHF | RLOO | Reinforcement Learning | Reward Modeling | Reward engineering | Rllib | SAC | Simulation to Real | Stable Baselines | TD Lambda | TRPO | TensorFlow | Tool use | Trust Region | World Models
Education
Related jobs
-
Adobe Campaign | Automated testing | Azure | Azure Data | Azure Data FactorySenior-level ContractSan Francisco, United States3h ago
-
ANSYS | CAD | Git | NVIDIA Omniverse | PipEntry-level InternshipPennsylvania, Canonsburg4h ago
-
ANSYS Mechanical | Ansys Twin Builder | C++ | Control Systems | DrakeFinancial benefits | Health benefits | Internship program | Remote work | Wellness benefitsEntry-level InternshipCalifornia, US (California) Off-Site4h ago
-
Software Engineer, Data License Monitoring & Resiliency USD 160K-240KAIOps | Anomaly Detection | C# | C++ | Capacity Planning401k match | Dental insurance | Life insurance | Long-term disability | Medical insuranceMid-level Full TimeNew York4h ago
-
Senior Software Engineer - Data & Analytics Federation USD 160K-240KApache Arrow | Asynchronous frameworks | C++ | Distributed Systems | Linux401k match | Dental insurance | Life insurance | Medical insurance | Paid HolidaysSenior-level Full TimeNew York4h ago
-
API Design | Agentic Workflows | C plus plus | Code review | Context ManagementSenior-level Full TimeRedmond, WA5h ago
-
Business Engineer, Business AI USD 147K-203KAgent Orchestration | Agentic AI | Agile | Bias Mitigation | Context engineeringCross-functional collaboration | Mentorship | Travel opportunitySenior-level Full TimeMenlo Park, CA | Seattle, WA …5h ago
-
Data Engineer USD 185K-196KData Modeling | Data Warehousing | Dimensional Modeling | ETL | Object-OrientedMid-level Full TimeMenlo Park, CA5h ago
-
Data Engineer USD 183K-203KAnomaly Detection | Data Drift | Data Quality | Data Visualization | Feature DriftEntry-level Full TimeMenlo Park, CA5h ago
-
Data Engineer USD 185K-196KC++ | Data Migration | Data Modeling | Data Warehousing | Dimensional dataMid-level Full TimeMenlo Park, CA5h ago
-
Software Engineer, Compiling, Quantum AI USD 147K-211KC++ | Classical coding theory | Coding theory | Compiler architecture | Compiler designMid-level Full TimeLos Angeles, CA, USA; Mountain View, …5h ago
-
C++ | Data Processing | Debugging | Information Retrieval | Language ModelsSenior-level Full TimeMountain View, CA, USA5h ago
-
Algorithms | C++ | Capacity Planning | Code Reviews | Data StructuresSenior-level Full TimeSeattle, WA, USA5h ago
-
Senior Software Engineer, Google Cloud AI USD 174K-252KArtificial Intelligence | C++ | Data Structures | Data Structures and Algorithms | Design and ArchitectureSenior-level Full TimeSunnyvale, CA, USA5h ago
-
Senior Staff Software Engineer, AI/ML, Google Cloud USD 262K-365KAlgorithms | Data Processing | Data Structures | Debugging | Distributed SystemsSenior-level Full TimeSunnyvale, CA, USA5h ago
-
A/B | A/B Testing | B testing | C++ | Content Understanding401k match | Commuter benefits | Disability insurance | Life insurance | Medical/Dental/Vision insuranceSenior-level Full TimeSunnyvale, CA11h ago
-
Forward Deployed AI Engineer/Data Scientist USD 78K-195KA/B | A/B Testing | B testing | Chatbot Platforms | Clustering401k matching | Basic life insurance | Employee stock purchase plan | Health, dental, vision coverage | Long-term disabilityMid-level Full TimeUnited States (Remote) R12h ago
-
Senior-level Full TimeSan Francisco16h ago
-
AWS Cloud ETL Engineer - Cleared USD 118K-178KAWS CDK | AWS Glue | AWS IAM | AWS Lambda | AWS Step FunctionsSecurity ClearanceMid-level Full TimeWashington, DC, US16h ago
-
Staff AI Researcher / Engineer USD 200K-240KAttention Mechanisms | Data Modeling | Debugging | Deep learning | Diffusion ModelsDiversity and inclusionSenior-level Full TimeSan Jose, California, United States16h ago
-
Agent systems | Machine Learning | Multi-Agent | Multi-Agent Systems | Offline LearningEntry-level InternshipBay Area, California16h ago
-
Robotics Engineer, Maritime USD 191K-253KAnomaly Detection | C++ | Cameras | Computer Vision | Data Analysis401k retirement plan | Commuter benefits | Dental benefits | Disability insurance | Healthcare benefitsSenior-level Full TimeBoston, Massachusetts, United States16h ago
-
API Integration | AWS | Azure | Cloud Computing | Data PipelinesDental insurance | Health insurance | On-site collaboration | Vision insurance | Work with AI LabsMid-level Full TimeSan Francisco, CA; Onsite16h ago
-
Senior-level Full TimeRedmond, WA, US16h ago
-
AI Search | AWS Bedrock | Agentic Workflows | Amazon SageMaker | Anthropic401k | Dental insurance | Medical insurance | Paid sick hours | Vision insuranceSenior-level Contract Full TimeRidgefield Park, NJ, United States16h ago