Engineering Manager - Models
Remote (Pacific/Mountain time zones)
Full Time Mid-level / Intermediate USD 235K - 300K
The models team keeps Replicate’s model library stocked with the latest generative AI models. We make popular models fast, reliable, and easy to use. We also add features — things people ask for and things they didn’t know they needed.
We’re hiring an engineering manager to help lead this team of six to eight engineers working at the edge of open-source AI and high-performance computing. You’ll support the team, shape the technical direction, and stay close to the code. The team focuses on three things:
Turn research into APIs. We make it easy to package models with cog and run them on Replicate.
Make models faster. CUDA, quantization, parallelism — we use whatever works to make models run faster and cheaper.
Build new model features. This could mean training video models, adding inpainting or ControlNet-style conditioning, or inventing new ways to use models.
We build in the open. That means contributing upstream, releasing internal tools, and sharing what we learn.
What we’re looking for
You’re a strong leader. You bring energy and clarity. You help people do their best work. You like solving real problems and moving fast. If that sounds like you, we’d love to hear from you.
What you’ll do
Lead and grow a team that packages, optimizes, and improves generative models. Push model quality and performance every day.
Collaborate with other teams to improve performance, tooling, and usability across the platform. Represent and advocate for our customers as an internal user of our platform.
Bring momentum and clarity to projects: set goals, unblock the team, and keep things moving.
Work with company leadership to prioritize and align on strategy; help shape the technical roadmap — from ML tooling to infrastructure.
Give back to the Replicate community — contribute to open-source projects, and support the team in doing the same.
You should apply if…
You’ve helped teams grow and thrive, especially in fast-paced or startup environments. You mentor engineers and know how to scale a team.
You’ve worked in machine learning, data science, or adjacent fields. You know a bit about model optimization and performance.
You’re great at communication and project management. You bring order to ambiguity and keep the team focused.
You care about AI and want to build practical tools that help real people.
You want to make generative AI easier for developers and creators to use.
You’re part of the generative AI or open-source infrastructure community.
You’ll get to work on some of the most interesting problems in AI infrastructure — while contributing to the open-source communities that make this work possible.
This role can be remote anywhere in the US (or other countries that align with US time zones) or in-person. If you're local to the Bay Area, we would like you to work out of our San Francisco office at least 3 days a week.
Tags: APIs ControlNet CUDA Engineering Generative AI Generative modeling Machine Learning ML infrastructure Open Source Research
Perks/benefits: Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.