Engineering Manager - Models

Remote (Pacific/Mountain time zones)

⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️

Applications have closed

Replicate

Run open-source machine learning models with a cloud API

Find more jobs like this Jobs in the United States

Posted 1 month ago

The models team keeps Replicate’s model library stocked with the latest generative AI models. We make popular models fast, reliable, and easy to use. We also add features — things people ask for and things they didn’t know they needed.

We’re hiring an engineering manager to help lead this team of six to eight engineers working at the edge of open-source AI and high-performance computing. You’ll support the team, shape the technical direction, and stay close to the code. The team focuses on three things:

Turn research into APIs. We make it easy to package models with cog and run them on Replicate.
Make models faster. CUDA, quantization, parallelism — we use whatever works to make models run faster and cheaper.
Build new model features. This could mean training video models, adding inpainting or ControlNet-style conditioning, or inventing new ways to use models.

We build in the open. That means contributing upstream, releasing internal tools, and sharing what we learn.

What we’re looking for

You’re a strong leader. You bring energy and clarity. You help people do their best work. You like solving real problems and moving fast. If that sounds like you, we’d love to hear from you.

What you’ll do

Lead and grow a team that packages, optimizes, and improves generative models. Push model quality and performance every day.
Collaborate with other teams to improve performance, tooling, and usability across the platform. Represent and advocate for our customers as an internal user of our platform.
Bring momentum and clarity to projects: set goals, unblock the team, and keep things moving.
Work with company leadership to prioritize and align on strategy; help shape the technical roadmap — from ML tooling to infrastructure.
Give back to the Replicate community — contribute to open-source projects, and support the team in doing the same.

You should apply if…

You’ve helped teams grow and thrive, especially in fast-paced or startup environments. You mentor engineers and know how to scale a team.
You’ve worked in machine learning, data science, or adjacent fields. You know a bit about model optimization and performance.
You’re great at communication and project management. You bring order to ambiguity and keep the team focused.
You care about AI and want to build practical tools that help real people.
You want to make generative AI easier for developers and creators to use.
You’re part of the generative AI or open-source infrastructure community.

You’ll get to work on some of the most interesting problems in AI infrastructure — while contributing to the open-source communities that make this work possible.

This role can be remote anywhere in the US (or other countries that align with US time zones) or in-person. If you're local to the Bay Area, we would like you to work out of our San Francisco office at least 3 days a week.

Find more jobs like this Jobs in the United States

Job stats: 1 0 0

Categories: Engineering Jobs Leadership Jobs

Tags: APIs ControlNet CUDA Engineering Generative AI Generative modeling Machine Learning ML infrastructure Open Source Research