Principal AI Research Engineer - RL
NYC | SF
ā ļø We'll shut down after Aug 1st - try fooš¦ for all jobs in tech ā ļø
Reflex Robotics
We're building affordable general-purpose robots to free humanity from the drudgery of boring and repetitive tasks. Schedule a pilot today.Company Overview
Reflex Robotics is building affordable ($10k) wheeled humanoid robots to automate dangerous and repetitive tasks in manufacturing and logistics.
We envision a future where intelligent robots are doing all kinds of boring work that people hate doingāloading chicken nuggets into Costco boxes, lifting forty pound bags of dog food at Petco stores, and cleaning up cranberry juice spills in your apartment.
We are a three-year-old startup backed by Khosla Ventures, with $60M/year of revenue lined up pending successful pilots with e-commerce warehouses in 2025.
How Does It Work?
Our robots are designed and built entirely in-house by an engineering team that led development of the Stretch robot at Boston Dynamics and key systems on the Tesla Model S, X, and Y production lines. Reflex robots are high-performance, low-inertia, and optimized for low-cost manufacturing.
Weāve built the fastest, real-time teleoperation system in the world, allowing a remote operator in South America to āplay a video gameā to control our robots. This has allowed us to already ship robots with positive unit economics, and enables us to create a powerful human-intervention + RL product feedback loop. Our system allows us to collect high-quality demonstrations at scaleāgiving us the proprietary data engine needed to train increasingly capable AI systems. We're on track to build the largest robotics dataset in the world, which will serve as an important long-term advantage.
Key Company Beliefs
High-quality, proprietary robotics data is the next foundation for generational AI companies (like Tesla FSD and ChatGPT).
Being nerd-sniped by maximizing an engineering metric is way less important than solving our customersā biggest pain points.
An insane work ethic is required for outsized successāand you'll be rewarded for it.
What Weāre Looking For
Weāre looking for stellar on-policy RL engineers to work on creating robust robot policies.
Weāre still a small teamāwhich means high ownership, high equity, and the chance to shape the product from the ground up.
VLAs and other great ābase policiesā for robotics achieve ~80% success rates, but in real robot deployments, itās essential to achieve 99.99% success rates. We canāt ask our customers to tolerate our robots packing three socks into a bin instead of four, or swapping shipping labels between two packagesānot even once!
You should apply for this role if:
Youāve re-implemented core RL algorithms (SAC, DDPG) from scratch and can debug unstable gradients / tune hyperparameters correctly
Youāve made meaningful intellectual contributions to sample-efficient RL algorithms (e.g., DreamerV3 and MuZero)
Youāve shipped on-policy RL on hardware that learns in the real-world (e.g., for quadruped walking or drone racing)
Youād be joining a company that already has a solid core businessāwith working hardware, delighted customers, and profitable unit economics. Reflex is de-risked enough to see the hazy outlines of success, but still small enough that thereās enormous upside up for grabs.
Come Join Us
This is a rare chance to help build a flagship robotics company from first principles ā and do work that will redefine what people think is possible in robotics.
We love to see the things youāve worked on. Have a portfolio or insane project youāve worked on? Share it. Weāre looking for people who push past the status quo, are passionate at work and in their own timeāweāre looking for people who want to win.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index š°
Tags: ChatGPT E-commerce Economics Engineering GPT Research Robotics
Perks/benefits: Equity / stock options Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.