Principal AI Research Engineer - RL

NYC | SF

āš ļø We'll shut down after Aug 1st - try foošŸ¦ for all jobs in tech āš ļø

Reflex Robotics

We're building affordable general-purpose robots to free humanity from the drudgery of boring and repetitive tasks. Schedule a pilot today.

View all jobs at Reflex Robotics

Apply now Apply later

Company Overview

Reflex Robotics is building affordable ($10k) wheeled humanoid robots to automate dangerous and repetitive tasks in manufacturing and logistics.

We envision a future where intelligent robots are doing all kinds of boring work that people hate doing—loading chicken nuggets into Costco boxes, lifting forty pound bags of dog food at Petco stores, and cleaning up cranberry juice spills in your apartment.

We are a three-year-old startup backed by Khosla Ventures, with $60M/year of revenue lined up pending successful pilots with e-commerce warehouses in 2025.


How Does It Work?

Our robots are designed and built entirely in-house by an engineering team that led development of the Stretch robot at Boston Dynamics and key systems on the Tesla Model S, X, and Y production lines. Reflex robots are high-performance, low-inertia, and optimized for low-cost manufacturing.

We’ve built the fastest, real-time teleoperation system in the world, allowing a remote operator in South America to ā€œplay a video gameā€ to control our robots. This has allowed us to already ship robots with positive unit economics, and enables us to create a powerful human-intervention + RL product feedback loop. Our system allows us to collect high-quality demonstrations at scale—giving us the proprietary data engine needed to train increasingly capable AI systems. We're on track to build the largest robotics dataset in the world, which will serve as an important long-term advantage.


Key Company Beliefs

  • High-quality, proprietary robotics data is the next foundation for generational AI companies (like Tesla FSD and ChatGPT).

  • Being nerd-sniped by maximizing an engineering metric is way less important than solving our customers’ biggest pain points.

  • An insane work ethic is required for outsized success—and you'll be rewarded for it.


What We’re Looking For

We’re looking for stellar on-policy RL engineers to work on creating robust robot policies.

We’re still a small team—which means high ownership, high equity, and the chance to shape the product from the ground up.

VLAs and other great ā€œbase policiesā€ for robotics achieve ~80% success rates, but in real robot deployments, it’s essential to achieve 99.99% success rates. We can’t ask our customers to tolerate our robots packing three socks into a bin instead of four, or swapping shipping labels between two packages—not even once!

You should apply for this role if:

  • You’ve re-implemented core RL algorithms (SAC, DDPG) from scratch and can debug unstable gradients / tune hyperparameters correctly

  • You’ve made meaningful intellectual contributions to sample-efficient RL algorithms (e.g., DreamerV3 and MuZero)

  • You’ve shipped on-policy RL on hardware that learns in the real-world (e.g., for quadruped walking or drone racing)

You’d be joining a company that already has a solid core business—with working hardware, delighted customers, and profitable unit economics. Reflex is de-risked enough to see the hazy outlines of success, but still small enough that there’s enormous upside up for grabs.


Come Join Us

This is a rare chance to help build a flagship robotics company from first principles – and do work that will redefine what people think is possible in robotics.

We love to see the things you’ve worked on. Have a portfolio or insane project you’ve worked on? Share it. We’re looking for people who push past the status quo, are passionate at work and in their own time—we’re looking for people who want to win.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index šŸ’°

Job stats:  0  0  0

Tags: ChatGPT E-commerce Economics Engineering GPT Research Robotics

Perks/benefits: Equity / stock options Startup environment

Region: North America
Country: United States

More jobs like this