Principal GenAI Data Infrastructure Engineer, 3D Foundation Model
San Mateo, CA, United States
Roblox
Roblox is the ultimate virtual universe that lets you create, share experiences with friends, and be anything you can imagine. Join millions of people and discover an infinite variety of immersive experiences created by a global community!Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators.
At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there.
A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.
As a Principal GenAI Data Infrastructure Engineer on the Data Infrastructure team for Roblox Cube, you will be responsible for building the foundational data systems that enable the creation and training of our cutting-edge 3D and 4D generative AI models. You will design, implement, and scale robust, high-performance infrastructure to crawl, curate, store, and serve the massive datasets required for these models. We are seeking accomplished software engineers with a passion for data, experience building large distributed systems, and a commitment to writing high-quality, well-tested code to solve complex data challenges at scale.
You Will:
- Design, build, and own critical components of the data infrastructure supporting our generative AI efforts, focusing on reliability, scalability, and performance.
- Develop sophisticated systems for crawling and extracting diverse data from the Roblox platform, including complex 3D asset data and 4D functional data.
- Implement robust pipelines and tooling for cleaning, transforming, and curating petabyte-scale datasets to meet the specific needs of multimodal AI model training and evaluation.
- Design and optimize data storage solutions that provide efficient, high-throughput access patterns for distributed model training workloads.
- Collaborate closely with ML Engineers and Data Scientists to understand their data requirements, build tooling that streamlines their data workflows, and troubleshoot data-related system issues.
- Drive improvements in data infrastructure architecture, engineering best practices, and operational excellence.
- Ensure data quality and governance are implemented at the system level through automated checks and monitoring.
You have:
- Have minimum 8+ years of professional experience and a tool chest of system design experience upon which to draw to build scalable, reliable platforms for all of Roblox.
- Have significant experience working with and processing very large datasets (Petabytes or more).
- Are passionate about the potential of generative AI, particularly in creative domains like 3D/4D content.
- Thrive in building complex, high-scale distributed systems from the ground up.
- Excel at writing clean, efficient, well-tested code in languages like Python, C++, or Go.
- Have experience with cloud data platforms and distributed processing technologies (e.g., Spark, Ray, Kubeflow, S3, etc.).
- Value collaboration and enjoy working closely with cross-functional teams, especially researchers and ML engineers.
- Are comfortable operating in a dynamic, fast-paced research environment where challenges and requirements can evolve.
- Bachelor's degree or higher in Computer Science, Computer Engineering, Data Science, or a similar technical field.
For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits.
Annual Salary Range$289,460—$338,270 USDRoles that are based in our San Mateo, CA Headquarters are in-office Tuesday, Wednesday, and Thursday, with optional in-office on Monday and Friday (unless otherwise noted).
Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Roblox also provides reasonable accommodations for all candidates during the interview process.
Tags: Architecture Computer Science Data quality Distributed Systems Engineering Excel Generative AI Kubeflow Machine Learning Model training Pipelines Python Research Spark
Perks/benefits: Equity / stock options
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.