Principal Software Engineer, ML Infrastructure

San Francisco, CA (Hybrid)

Apply now Apply later

Tubi is a global entertainment company and the most watched free TV and movie streaming service in the U.S. and Canada. Dedicated to providing all people access to all the world’s stories, Tubi offers the largest collection of on-demand content, including over 250,000 premium movies and TV episodes and over 300 exclusive originals. With a passionate fanbase and over 80 million monthly active viewers, the company is committed to putting viewers first with free, accessible entertainment for all.

About the Team:

The Machine Learning Infrastructure team is where cutting-edge technology meets real-world impact. We build a powerful, scalable foundation for our content recommendation engine, directly influencing the user experience. Working closely with ML, Product, and Data Science, we solve complex technical challenges to deliver innovative ML-powered solutions that delight millions.

About the Role:

As a Principal Engineer on the ML Infrastructure team, you will be a technical leader and visionary, driving the evolution of our machine learning platform. You will tackle the most complex and impactful technical challenges, shaping the architecture and technology choices that enable our ML capabilities to scale and deliver exceptional user experiences. You will be a key influencer, bridging the gap between engineering and product, and a mentor to senior engineers, fostering a culture of technical excellence and continuous improvement. Your work will be used by millions of users.

Responsibilities:

Technical Leadership & Strategy:

  • Vision & Influence: Define and champion the long-term vision for ML infrastructure, aligning it with company goals and industry best practices. Influence technical direction and technology selection across the ML platform
  • Strategic Roadmap: Develop and maintain a roadmap (6-12 months) for the ML Infra team, anticipating future needs and proactively addressing emerging trends
  • Innovation & Optimization: Identify opportunities to improve ML infrastructure efficiency, scalability, and performance. Research and advocate for new technologies and approaches to optimize the ML development lifecycle

Architecture, Design & Engineering:

  • System Design: Lead the architecture and design of complex ML systems, ensuring scalability, reliability, security, and maintainability
  • Distributed Systems Expertise: Design and build scalable, high-throughput, and/or low-latency distributed systems using Scala and related technologies. This includes expertise in areas like distributed databases, message queues, and stream processing
  • Quality & Standards: Champion and enforce engineering best practices, including code quality, testing, and documentation. Contribute to the development and implementation of ML infrastructure standards

Problem Solving & Delivery:

  • Technical Problem Solving: Resolve critical and complex technical challenges related to ML infrastructure, demonstrating expertise in debugging, performance optimization, and system troubleshooting
  • Project Execution: Lead and deliver complex ML infrastructure projects, effectively managing scope, timelines, and dependencies. Mentor engineers on project management best practices
  • Collaboration & Mentorship: Foster a collaborative environment and provide technical mentorship to other engineers, enabling their growth and development

Communication & Collaboration:

  • Cross-functional Partnership: Collaborate effectively with data scientists, ML engineers, and product managers to understand their needs and translate them into infrastructure solutions
  • Stakeholder Management: Communicate effectively with stakeholders at all levels, including senior leadership. Clearly articulate technical concepts, progress updates, and roadblocks
  • Knowledge Sharing: Promote knowledge sharing and best practices across the organization through documentation, presentations, and mentorship

Qualifications:

  • 10+ years of experience in software engineering, with a significant focus on building and scaling large-scale distributed systems
  • Bachelor's or Master's degree in Computer Science or a related field
  • Proven experience as a technical leader, architecting and designing complex systems, preferably in the ML infrastructure domain
  • 5+ years of experience with databases, caching technologies, and message brokers
  • Expertise in Scala, Java, Python programming languages
  • Extensive experience with cloud platforms (preferably AWS)

Bonus:

  • Experience in the media or streaming industry
  • Contributions to open-source projects related to ML infrastructure

Pursuant to state and local pay disclosure requirements, the pay range for this role, with final offer amount dependent on education, skills, experience, and location is listed annually below. This role is also eligible for an annual discretionary bonus, long-term incentive plan, and various benefits including medical/dental/vision, insurance, a 401(k) plan, paid time off and other benefits in accordance with applicable plan documents.

California Compensation

Base ($246,000 to $352,000 / year) + Bonus + Long-Term Incentive Plan + Benefits

#LI-MQ1 #LI-Hybrid 

Tubi is a division of Fox Corporation, and the FOX Employee Benefits summarized here, covers the majority of all US employee benefits.  The following distinctions below outline the differences between the Tubi and FOX benefits:

  • For US-based non-exempt Tubi employees, the FOX Employee Benefits summary accurately captures the Vacation and Sick Time.
  • For all salaried/exempt employees, in lieu of the FOX Vacation policy, Tubi offers a Flexible Time off Policy to manage all personal matters.
  • For all full-time, regular employees, in lieu of FOX Paid Parental Leave, Tubi offers a generous Parental Leave Program, which allows parents twelve (12) weeks of paid bonding leave within the first year of birth, adoption, surrogacy, or foster placement of a child in addition to applicable government leave program(s) and FOX’s short-term disability policy. This time is 100% paid through a combination of any applicable state, city, and federal leaves and wage-replacement programs in addition to contributions made by Tubi.
  • For all full-time, regular employees, Tubi offers a monthly wellness reimbursement.

Tubi is proud to be an equal opportunity employer and considers qualified applicants without regard to race, color, religion, sex, national origin, ancestry, age, genetic information, sexual orientation, gender identity, marital or family status, veteran status, medical condition, or disability. Pursuant to the San Francisco Fair Chance Ordinance, we will consider employment for qualified applicants with arrest and conviction records. We are an E-Verify company.

Apply now Apply later
Job stats:  0  0  0

Tags: Architecture AWS Computer Science Distributed Systems Engineering Java Machine Learning ML infrastructure Open Source Python Research Scala Security Streaming Testing

Perks/benefits: Career development Flex hours Flex vacation Health care Insurance Medical leave Parental leave Salary bonus Startup environment Wellness

Region: North America
Country: United States

More jobs like this