Engineering Manager - Models
San Francisco
⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️
Full Time Mid-level / Intermediate USD 235K - 300K
Replicate makes it easy for software engineers to run and customize machine learning models in the cloud. With a library of thousands of open-source models, you can get started with one line of code—or fine-tune and deploy your own models when you need something custom. We handle the infrastructure, so you can focus on building.
Our team comes from places like Docker, GitHub, and NVIDIA, and we’re obsessed with making AI as intuitive as deploying a web app. We build in public, ship fast, and care about getting the details right.
The Models team keeps Replicate’s model library stocked with the latest generative AI models. We make popular models fast, reliable, and easy to use. We also add features — things people ask for and things they didn’t know they needed.
We’re hiring an Engineering Manager to help lead this team of six to eight engineers working at the edge of open-source AI and high-performance computing. You’ll support the team, shape the technical direction, and stay close to the code. The team focuses on three things:
Turn research into APIs. We make it easy to package models with cog and run them on Replicate.
Make models faster. CUDA, quantization, parallelism — we use whatever works to make models run faster and cheaper.
Build new model features. This could mean training video models, adding inpainting or ControlNet-style conditioning, or inventing new ways to use models.
We build in the open. That means contributing upstream, releasing internal tools, and sharing what we learn.
What we’re looking for
You’re a strong leader. You bring energy and clarity. You help people do their best work. You like solving real problems and moving fast. If that sounds like you, we’d love to hear from you.
What you’ll do
Lead and grow a team that packages, optimizes, and improves generative models. Push model quality and performance every day.
Collaborate with other teams to improve performance, tooling, and usability across the platform. Represent and advocate for our customers as an internal user of our platform.
Bring momentum and clarity to projects: set goals, unblock the team, and keep things moving.
Work with company leadership to prioritize and align on strategy; help shape the technical roadmap — from ML tooling to infrastructure.
Give back to the Replicate community — contribute to open-source projects, and support the team in doing the same.
You should apply if…
You’ve helped teams grow and thrive, especially in fast-paced or startup environments. You mentor engineers and know how to scale a team.
You’ve worked in machine learning, data science, or adjacent fields. You know a bit about model optimization and performance.
You’re great at communication and project management. You bring order to ambiguity and keep the team focused.
You care about AI and want to build practical tools that help real people.
You want to make generative AI easier for developers and creators to use.
You’re part of the generative AI or open-source infrastructure community.
You’ll get to work on some of the most interesting problems in AI infrastructure — while contributing to the open-source communities that make this work possible.
This is a hybrid role and requires you to work from our office in the Mission District, San Francisco at least 3 days a week.
U.S. Benefits*
100% paid coverage for medical, dental, vision, long-term disability, and life insurance
Company-paid laptop and work setup — use your expense card for anything you need to be happy and productive (screen, desk, chair, etc.)
Medical and Dependent Care FSAs
12 weeks of paid parental leave
401k through Guideline
Pre-tax commuter benefits
Unlimited paid time off (we encourage at least 20 days off each year)
*We do our best to match our international benefits to our U.S. benefits
Equal Opportunity Statement
Replicate is committed to diversity in its workforce and is proud to be an equal opportunity employer. Replicate considers qualified applicants without regard to race, color, religion, creed, gender, national origin, age, disability, veteran status, marital status, pregnancy, sex, gender expression or identity, sexual orientation, citizenship, or any other legally protected class.
Replicate is an equal employment opportunity employer offering opportunities to all job seekers, including individuals with disabilities. If you believe you need a reasonable accommodation in order to search for a job opening or to apply for a position, please contact us by sending an email to careers@replicate.com.
Tags: APIs ControlNet CUDA Docker Engineering Generative AI Generative modeling GitHub Machine Learning ML infrastructure ML models Open Source Research
Perks/benefits: 401(k) matching Career development Gear Health care Insurance Medical leave Parental leave Startup environment Unlimited paid time off
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.