Head of Data Science & Engineering, Data-as-a-Service

Redwood City, CA (Hybrid); San Francisco, CA (Hybrid)

⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️

Snorkel AI

Snorkel AI powers specialized AI with expert data—accelerate evaluation, training, and deployment with our full AI data development platform.

View all jobs at Snorkel AI

Apply now Apply later

About Snorkel

At Snorkel, we believe meaningful AI doesn’t start with the model, it starts with the data.

We’re on a mission to help enterprises transform expert knowledge into specialized AI at scale. The AI landscape has gone through incredible changes between 2015, when Snorkel started as a research project in the Stanford AI Lab, to the generative AI breakthroughs of today. But one thing has remained constant: the data you use to build AI is the key to achieving differentiation, high performance, and production-ready systems. We work with some of the world’s largest organizations to empower scientists, engineers, financial experts, product creators, journalists, and more to build custom AI with their data faster than ever before. Excited to help us redefine how AI is built? Apply to be the newest Snorkeler!

About the Role

Snorkel AI is hiring a hands-on technical leader to build and lead the machine learning team within our Expert Data-as-a-Service organization. This is a foundational leadership role focused on scaling machine learning and data science excellence across our delivery workflows. You’ll be responsible for hiring and mentoring a high-performing team, establishing best practices for data quality and validation, designing novel ML approaches to make human-in-the-loop techniques to make data generation and review more efficient. You will build the systems and tools that enable consistent, scalable, and high-quality data delivery to our customers.

Sitting at the critical intersection of machine learning and operations, you’ll partner closely with the Delivery Operations team and cross-functional stakeholders to define quality standards, develop measurement frameworks, drive ML-based workflows to improve data pipelines and unblock projects through technical innovation. As the founding member, you’ll also roll up your sleeves to define and own the workflows and processes that are needed to deliver exceptional data at scale.

Main Responsibilities

  • Build and lead the Data Science and Engineering, Data-as-a-Service organization setting a clear vision and scaling its impact across Snorkel’s Expert Data-as-a-Service workflows
  • Own and evolve the ML components of the DaaS stack, including model-assisted labeling, quality estimation, and data-centric feedback loops that guide human input
  • Define and implement scalable processes for data generation and validation, quality measurement, and delivery-readiness across a range of annotation projects
  • Develop robust systems for request intake, task distribution, SLA tracking, and progress monitoring—ensuring critical delivery support doesn’t fall through the cracks
  • Prototype and deploy LLM-based workflows to assess annotation quality, augment human review and data generation, and accelerate delivery timelines
  • Collaborate cross-functionally with research and engineering teams to develop and productionize HITL data generation methods, quality techniques and improve internal delivery tooling
  • Drive continuous improvement by developing reusable workflows, surfacing operational insights, and helping the org scale faster with higher quality

 

What We’re Looking For

  • 6+ years of experience in machine learning roles, with 2+ years in technical management positions
  • Proven track record of managing technical teams in fast-paced, delivery-focused environments with competing priorities
  • Experience as a player-coach—comfortable being hands-on while supporting and scaling a small team
  • Proven ability to thrive in fast-paced, ambiguous environments with cross-functional stakeholders
  • Strong practical experience with LLM-based workflows, Python, SQL, and data tooling (e.g., pandas, Plotly, Streamlit, Dash)
  • Bonus: experience working with labeling workflows or internal tooling for data delivery orgs

Why Join Snorkel AI?

At Snorkel AI, we're building the future of data-centric AI. Our Expert Data-as-a-Service organization partners with world-class customers to solve some of the hardest data challenges — creating training and evaluation data that power the next generation of LLMs and AI systems. You'll work directly on projects that impact real production systems, while shaping how internal teams deliver faster, better, and more intelligently. This is a rare opportunity to found a function and have a broad, lasting impact across the company.

The salary range for this position based in the San Francisco Bay Area is $240,000 - $300,000.  All offers include equity compensation in the form of employee stock options.

#LI-CG1

Be Your Best at Snorkel

Joining Snorkel AI means becoming part of a company that has market proven solutions, robust funding, and is scaling rapidly—offering a unique combination of stability and the excitement of high growth. As a member of our team, you’ll have meaningful opportunities to shape priorities and initiatives, influence key strategic decisions, and directly impact our ongoing success. Whether you’re looking to deepen your technical expertise, explore leadership opportunities, or learn new skills across multiple functions, you’re fully supported in building your career in an environment designed for growth, learning, and shared success.

Snorkel AI is proud to be an Equal Employment Opportunity employer and is committed to building a team that represents a variety of backgrounds, perspectives, and skills. Snorkel AI embraces diversity and provides equal employment opportunities to all employees and applicants for employment. Snorkel AI prohibits discrimination and harassment of any type on the basis of race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local law. All employment is decided on the basis of qualifications, performance, merit, and business need.

We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Apply now Apply later
Job stats:  1  0  0

Tags: Data pipelines Data quality Engineering Generative AI LLMs Machine Learning Pandas Pipelines Plotly Python Research SQL Streamlit

Perks/benefits: Career development Equity / stock options Salary bonus Startup environment

Region: North America
Country: United States

More jobs like this