Applied Reinforcement Learning Engineer
Tasks
- Architect multi step reasoning agents with tool calling
- Build RL training pipelines from human labeled traces
- Design and build RL environments
- Design reward functions verifiers and validation frameworks
- Train LLM based agents with RLHF
- Translate RL research into production systems
Perks/Benefits
- Collaborative research culture
- Hybrid remote work
- Open to Publications
- Research to Production Opportunities
Skills/Tech-stack
A2C | A3C | Actor-critic | Agent systems | BCQ | Behavioral cloning | CQL | DPO | Deep Q-Network | Domain Randomization | Double Deep Q Network | Dreamer | Dueling Deep Q Network | Eligibility Traces | Entropy Regularization | GAIL | GRPO | Gymnasium | Hierarchical reinforcement learning | IQL | JAX | MDP | MuZero | Multi-Agent | Multi-Agent Systems | Multi-step reasoning | Offline RL | OpenAI Gym | Options Framework | PPO | Policy Gradient | PyTorch | Python | Q-learning | RLHF | RLOO | Reinforcement Learning | Reward Modeling | Reward engineering | Rllib | SAC | Simulation to Real | Stable Baselines | TD Lambda | TRPO | TensorFlow | Tool use | Trust Region | World Models
Education
Related jobs
-
ANSYS | CAD | Git | NVIDIA Omniverse | PipEntry-level InternshipPennsylvania, Canonsburg2h ago
-
ANSYS Mechanical | Ansys Twin Builder | C++ | Control Systems | DrakeFinancial benefits | Health benefits | Internship program | Remote work | Wellness benefitsEntry-level InternshipCalifornia, US (California) Off-Site2h ago
-
Software Engineer, Data License Monitoring & Resiliency USD 160K-240KAIOps | Anomaly Detection | C# | C++ | Capacity Planning401k match | Dental insurance | Life insurance | Long-term disability | Medical insuranceMid-level Full TimeNew York2h ago
-
Senior Software Engineer - Data & Analytics Federation USD 160K-240KApache Arrow | Asynchronous frameworks | C++ | Distributed Systems | Linux401k match | Dental insurance | Life insurance | Medical insurance | Paid HolidaysSenior-level Full TimeNew York2h ago
-
API Design | Agentic Workflows | C plus plus | Code review | Context ManagementSenior-level Full TimeRedmond, WA3h ago
-
Business Engineer, Business AI USD 147K-203KAgent Orchestration | Agentic AI | Agile | Bias Mitigation | Context engineeringCross-functional collaboration | Mentorship | Travel opportunitySenior-level Full TimeMenlo Park, CA | Seattle, WA …3h ago
-
Data Engineer USD 185K-196KData Modeling | Data Warehousing | Dimensional Modeling | ETL | Object-OrientedMid-level Full TimeMenlo Park, CA3h ago
-
Data Engineer USD 183K-203KAnomaly Detection | Data Drift | Data Quality | Data Visualization | Feature DriftEntry-level Full TimeMenlo Park, CA3h ago
-
Data Engineer USD 185K-196KC++ | Data Migration | Data Modeling | Data Warehousing | Dimensional dataMid-level Full TimeMenlo Park, CA3h ago
-
Software Engineer, Compiling, Quantum AI USD 147K-211KC++ | Classical coding theory | Coding theory | Compiler architecture | Compiler designMid-level Full TimeLos Angeles, CA, USA; Mountain View, …3h ago
-
C++ | Data Processing | Debugging | Information Retrieval | Language ModelsSenior-level Full TimeMountain View, CA, USA3h ago
-
Algorithms | C++ | Capacity Planning | Code Reviews | Data StructuresSenior-level Full TimeSeattle, WA, USA3h ago
-
Senior Software Engineer, Google Cloud AI USD 174K-252KArtificial Intelligence | C++ | Data Structures | Data Structures and Algorithms | Design and ArchitectureSenior-level Full TimeSunnyvale, CA, USA3h ago
-
Senior Staff Software Engineer, AI/ML, Google Cloud USD 262K-365KAlgorithms | Data Processing | Data Structures | Debugging | Distributed SystemsSenior-level Full TimeSunnyvale, CA, USA3h ago
-
A/B | A/B Testing | B testing | C++ | Content Understanding401k match | Commuter benefits | Disability insurance | Life insurance | Medical/Dental/Vision insuranceSenior-level Full TimeSunnyvale, CA10h ago
-
Forward Deployed AI Engineer/Data Scientist USD 78K-195KA/B | A/B Testing | B testing | Chatbot Platforms | Clustering401k matching | Basic life insurance | Employee stock purchase plan | Health, dental, vision coverage | Long-term disabilityMid-level Full TimeUnited States (Remote) R11h ago
-
Senior-level Full TimeSan Francisco14h ago
-
AWS Cloud ETL Engineer - Cleared USD 118K-178KAWS CDK | AWS Glue | AWS IAM | AWS Lambda | AWS Step FunctionsSecurity ClearanceMid-level Full TimeWashington, DC, US14h ago
-
Staff AI Researcher / Engineer USD 200K-240KAttention Mechanisms | Data Modeling | Debugging | Deep learning | Diffusion ModelsDiversity and inclusionSenior-level Full TimeSan Jose, California, United States14h ago
-
Agent systems | Machine Learning | Multi-Agent | Multi-Agent Systems | Offline LearningEntry-level InternshipBay Area, California14h ago
-
Robotics Engineer, Maritime USD 191K-253KAnomaly Detection | C++ | Cameras | Computer Vision | Data Analysis401k retirement plan | Commuter benefits | Dental benefits | Disability insurance | Healthcare benefitsSenior-level Full TimeBoston, Massachusetts, United States14h ago
-
Senior-level Full TimeRedmond, WA, US15h ago
-
AI Search | AWS Bedrock | Agentic Workflows | Amazon SageMaker | Anthropic401k | Dental insurance | Medical insurance | Paid sick hours | Vision insuranceSenior-level Contract Full TimeRidgefield Park, NJ, United States15h ago
-
AWS S3 | Access Control | Active IQ | Ansible | Audit Logging401k | Dental insurance | Medical insurance | Paid sick hours | Vision insuranceSenior-level Contract Full TimeRidgefield Park, NJ, United States15h ago
-
Senior-level Full TimeRedmond, WA, US; Atlanta, GA, US; …15h ago