Applied Reinforcement Learning Engineer 2
Tasks
- Architect multi step reasoning agents with tool calling and closed learning loops
- Build end to end pipelines from human labeled traces to RL training data
- Design and build RL environments for enterprise workflows
- Design reward functions verifiers and validation frameworks
- Train LLM based agents using PPO GRPO DPO and RLHF
- Translate RL research into production systems
Perks/Benefits
- N/A
Skills/Tech-stack
ActorCritic | BCQ | BehavioralCloning | CQL | DQN | Deep ReinforcementLearning | DirectPreferenceOptimization | DistributedTraining | DomainRandomization | DoubleDQN | Dreamer | DuelingDQN | GAIL | Gymnasium | HierarchicalReinforcementLearning | IQL | JAX | LargeLanguageModels | MarkovDecisionProcess | ModelBasedReinforcementLearning | MuZero | MultiAgentSystems | OfflineReinforcementLearning | OpenAI Gym | OptionsFramework | PPO | PolicyGradient | PreferenceLearning | PyTorch | Python | Q-learning | ReinforcementLearning | ReinforcementLearningFromHumanFeedback | RewardModeling | Rllib | SAC | SimToReal | Simulation | StableBaselines | TD Lambda | TRPO | TensorFlow | Tooluse | WorldModels
Education
Related jobs
-
Data Engineer USD 100K-128KData Governance | Data Modeling | Data Security | Databricks | ELT401k match | Adoption Assistance | Community volunteer opportunities | Continuing education support | Dental insuranceMid-level Full TimeSouth Sioux City, NE, United States R11h ago
-
Senior Data Engineer TS/SCI Clearance USD 160K-220KAWS | Cloud Native | Data Visualization | Database Design | Database performanceBest place to work recognition | Employee development | Full employee approach | High employee morale and retentionSenior-level Full TimeHuntsville, United States12h ago
-
Data cleaning | Data collection | Deep learning | Machine Learning | Model EvaluationSenior-level Full TimeSan Jose, California, United States13h ago
-
GenAI Engineer USD 93K-163KAWS Bedrock | Agentic Workflows | C++ | CI/CD | CohereHealth and wellness benefits | Mentorship | Professional developmentEntry-level Full TimeArlington/Rosslyn, Virginia, United States13h ago
-
Senior GenAI Engineer USD 102K-171KAPI Development | AWS Bedrock | Agentic Workflows | CI/CD | CohereSenior-level Full TimeArlington/Rosslyn, Virginia, United States13h ago
-
Data Scientist - Platform Infrastructure USD 127K-189KData Governance | Data Modeling | Data Pipelines | Data Quality | ETLMid-level Full TimeLos Angeles, California, United States13h ago
-
C++ | Data Compression | Data Ingestion | Data Processing | Data StorageSenior-level Full TimeSan Jose, California, United States13h ago
-
Computer Vision | Data Pipelines | Language Models | Language Processing | Large Language ModelsSenior-level Full TimeBellevue, WA | Menlo Park, CA14h ago
-
Research Engineer, Pretraining, DeepMind USD 174K-252KFine Tuning | JAX | Language Models | Large Language Models | Machine LearningMid-level Full TimeNew York, NY, USA14h ago
-
SOC Architect, XProf USD 147K-211KC# | C++ | Compiler profiling | Data Analysis | Data VisualizationSenior-level Full TimeSunnyvale, CA, USA14h ago
-
Data Engineer, Global Business and Operations USD 130K-187KBigQuery | Data Governance | Data Marts | Data Modeling | Data PipelinesMid-level Full TimeNew York, NY, USA14h ago
-
Data Engineer USD 110K-110KAutomated testing | Data Architecture | Data Modeling | Data Security | Data VisualizationMid-level Full TimeAnnapolis, MD, US16h ago
-
Sr. Machine Learning Engineer USD 91K-177KAlgorithms | Anomaly Detection | Apache Airflow | Data Analysis | Deep learning401k plan | Employee recognition | Employee stock purchase plan | Health insurance | Paid time offSenior-level Full TimeIrvine, CA, US17h ago
-
ArcGIS Pro | Arcpy | Bokeh | Dash | GDAL401k | Dental insurance | Health insurance | Vision insuranceSenior-level Full TimeFayetteville, North Carolina, United States1d ago
-
ArcGIS Pro | Arcpy | Bokeh | Dash | GDAL401k | Dental insurance | Health insurance | TS/SCI clearance | Vision insuranceSenior-level Full TimeSneads Ferry, North Carolina, United States1d ago
-
Data Engineer - Mid-Level USD 130K-160KAirflow | Automated Deployment | Automated testing | CI/CD | Control workflows401k matching | Dental insurance | Health insurance | Lunch and snacks provided | Maternity & paternity leaveMid-level Full TimeEl Segundo, California, United States1d ago
-
Staff Engineer, Machine Learning USD 196K-269KCamera | Computer Vision | Convolutional Neural Networks | DETR | Deep Neural Networks401k employer match | Dental insurance | Life insurance | Long-term disability | Medical insuranceSenior-level Full TimeMountain View, CA1d ago
-
Software Engineer – Surgical Robot Manufacturing USD 127K-192KAutomated testing | Control Systems | HTML | JSON | JavaScriptMid-level Full TimeSunnyvale, CA, United States1d ago
-
Senior Manufacturing Analytics Engineer USD 115K-140KChemometrics | Data Preparation | Descriptive Analytics | Feature Engineering | Machine LearningComprehensive benefits | Medical benefits | Sick leave | Travel up to 15 percentSenior-level Full TimeWayzata, Minnesota, US United States, 553911d ago
-
Data Engineer USD 127K-170KAzure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake Storage | Azure DevOpsCareer advancement | Flexible work environment | Professional development | Recognition | Work-life balanceSenior-level Full TimeTampa, FL, USA, United States1d ago
-
Data Reliability Engineer USD 87K-123KAWS | Alerting | Amazon DynamoDB | Amazon EMR | Amazon Kinesis401k matching | Dental insurance | Employee resource groups | Flexible work environment | Health insuranceMid-level Full TimeKS Overland Park, United States1d ago
-
Senior Data Engineer (Apache Spark, NiFi, SQL) USD 140K-231KApache NiFi | Apache Spark | Hadoop | Java | Netezza401k match | Fitness reimbursement | Paid sick and safe time | Paid vacation | Tuition reimbursementSenior-level Full TimeO'Fallon, Missouri (Main Campus), United States1d ago
-
Quantum Network Manager USD 130K-155KC# | C++ | Cisco | Computer Networking | Fiber opticsHealth benefits | Paid time off | Retirement benefitsMid-level Full TimeHyde Park Campus, United States1d ago
-
AWS | Amazon Redshift | Amazon S3 | Apache Airflow | Bitbucket401k match | Dental insurance | Employee assistance program | Flexible work/life support | Health insuranceSenior-level Full TimeBOSTON, United States1d ago
-
Agile | C# | C++ | CAN | Communication Protocols401k matching | Adoption benefits | Career development | Dental insurance | Employee assistance programSenior-level Full TimeMossville, Illinois, United States1d ago