Senior / Principal Recommendations Infrastructure Engineer - ML Platform
San Mateo, CA, United States
Roblox
Roblox is the ultimate virtual universe that lets you create, share experiences with friends, and be anything you can imagine. Join millions of people and discover an infinite variety of immersive experiences created by a global community!Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators.
At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there.
A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.
As a Senior / Principal Recommendations Infrastructure Engineer on ML Platform you will build the next generation of ML Ecosystem Tooling for recommendation systems. ML Platform today supports billions of requests per day across our homepage, marketplace, economy, and more. We are looking for accomplished engineers to help build out the next generation of ML platform tooling for recommender systems in a quickly innovating space.
You Are:
- Have 4+ years of professional experience and a tool chest of system design experience upon which to draw to build scalable, reliable platforms for all of Roblox.
- Have significant experience running large-scale recommendation systems that recommend hundreds of millions of items to millions of users.
- Experienced building complex distributed systems that scale to real-time ML inference serving millions of QPS, particularly for real-time recommendation systems.
- Passionate about supporting and working cross functionally with internal partners (Data Scientists and ML Engineers) to meet and understand their needs.
- A reliability nut: you love digging into tricky postmortems and identifying and fixing weaknesses in complicated systems.
- Ideally familiar with ML model inference frameworks like Triton Inference Server, TensorRT, KServe.
- Bachelor's degree or higher in Computer Science, Computer Engineering, Data Science, or a similar technical field.
You Will:
- Set technical strategy and oversee development of high scale, reliable infrastructure systems for recommender systems, especially as we scale up both inference qps and model size.
- Dig into performance bottlenecks all along the recommendation inference stack, spanning from model optimizations to infrastructure optimizations.
- Stay abreast of industry trends in machine learning and infrastructure to ensure the adoption of leading-edge technologies and practices.
- Bootstrap and maintain infrastructure for ML Platform components--Serving Layer, Metadata Store, Model Registry, and Pipeline Orchestrator.
- Partner across organizations to build tooling, interfaces, and visualizations that make the ML@Roblox a delight to use.
For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits.
Annual Salary Range$233,840—$283,780 USDRoles that are based in our San Mateo, CA Headquarters are in-office Tuesday, Wednesday, and Thursday, with optional in-office on Monday and Friday (unless otherwise noted).
You’ll Love:
- Industry-leading compensation package
- Excellent medical, dental, and vision coverage
- A rewarding 401k program
- Flexible vacation policy (varies by exemption status)
- Roflex - Flexible and supportive work policy
- Roblox Admin badge for your avatar
- At Roblox HQ:
- Free catered lunches five times a week and several fully stocked kitchens with unlimited snacks
- Onsite fitness center and fitness program credit
- Annual CalTrain Go Pass
Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Roblox also provides reasonable accommodations for all candidates during the interview process.
Tags: Computer Science Distributed Systems Engineering KServe Machine Learning Model inference Recommender systems TensorRT
Perks/benefits: Career development Equity / stock options Flex hours Flex vacation Health care Unlimited paid time off
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.