Machine Learning Engineer III

Waltham, Massachusetts, United States

⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️

ZoomInfo

It’s our business to grow yours! Own your market with leading B2B contact data combined with sales intelligence, engagement software, and workflow tools.

View all jobs at ZoomInfo

Apply now Apply later

At ZoomInfo, we encourage creativity, value innovation, demand teamwork, expect accountability and cherish results. We value your take charge, take initiative, get stuff done attitude and will help you unlock your growth potential. One great choice can change everything. Thrive with us at ZoomInfo.

About ZoomInfo

ZoomInfo is building the next generation go-to-market platform using high-quality GTM data, agentic workflows, and a robust intelligence layer to give sales, marketing, and revenue operations teams a competitive advantage.

About the Applied AI Team

The Applied AI team builds the intelligence layer that sits between ZoomInfo's high-quality data and the application layer through which customers engage. Using a product-led growth model, this team leverages customer engagement as input to build better recommendations, scoring, classification, and generative models.

 

What you will do :

Foundation Data Quality Enhancement

  • Improve data quality for ZoomInfo's foundation datasets including firmographics, demographics, C-suite profiles, workforce information, titles, skill sets, scoops, intent signals, and web-extracted data
  • Design and implement data validation pipelines and quality metrics to ensure high-fidelity information across millions of records

Embedding and Model Development

  • Build and fine-tune embedding models using large language models (Llama) and small language models (*BERT*) for various text understanding tasks
  • Develop language-agnostic clustering and classification models using vector search technologies
  • Optimize embedding models for production deployment at petabyte scale

Named Entity Recognition & Data Extraction

  • Build high-recall NER models to extract people, organizations, locations, and industry-specific entities from web-extracted data
  • Develop robust data extraction pipelines that process diverse web content and structure unstructured information

Agentic Workflows & Evaluation

  • Design and implement agentic workflows focused on web extraction, NER, and entity resolution
  • Create comprehensive evaluation frameworks for agent performance and reliability
  • Collaborate on agent optimization and performance tuning

Scalable Production Systems

  • Deploy and maintain ML models serving millions of users daily with sub-second latency requirements
  • Work with engineering teams to ensure models integrate seamlessly into ZoomInfo's platform architecture
  • Monitor model performance and implement automated retraining pipelines to design cost-aware training & inference workflows
  • Use integrated CI/CD and testing workflows for seamless deployment

Cross-Functional Collaboration & Prototyping

  • Partner with product managers and engineering teams to translate business requirements into ML solutions
  • Prototype and benchmark emerging AI/infra tech
  • Present findings and technical solutions to stakeholders across the organization

What you bring:

Experience & Education

  • 3 - 5 years (1+ years post-PhD) of hands-on ML/NLP experience with demonstrated impact on production systems. Preference for masters and background in Computer Science and other allied data science/engineering disciplines.
  • Strong background in transformer architectures, embedding models, and vector search technologies
  • Experience with named entity recognition, summarization and data extraction at scale is a plus

Technical Skills

  • Proficiency in PyTorch or TensorFlow for model development and fine-tuning
  • Experience with vector databases (Pinecone, Weaviate, FAISS, OpenSearch) and hybrid retrieval systems
  • Strong software engineering skills in Python; familiarity with Go/Java is a plus
  • Knowledge of MLOps tools: Docker, Kubernetes, GitOps, feature stores, model registries

Applied AI Expertise

  • Hands-on experience with LLM fine-tuning techniques (LoRA, quantization, distillation) is a plus
  • Understanding of agentic workflows and multi-agent systems
  • Experience building language-agnostic ML solutions and cross-lingual models
  • Knowledge of entity resolution and knowledge graph concepts

Collaboration & Communication

  • Ability to work effectively in cross-functional teams and communicate technical concepts to non-technical stakeholders
  • Experience mentoring junior team members and contributing to team knowledge sharing
  • Strong problem-solving skills and ability to work independently with guidance from team leads

Preferred Qualifications

  • Experience processing large-scale unstructured data
  • Background in information retrieval and search systems
  • Familiarity with MLOps concepts, A/B testing and experimental design for ML systems
  • Knowledge of data quality frameworks and validation methodologies

 

#LI-SK

#LI-Hybrid

Actual compensation offered will be based on factors such as the candidate’s work location, qualifications, skills, experience and/or training. Your recruiter can share more information about the specific salary range for your desired work location during the hiring process. We want our employees and their families to thrive.

In addition to comprehensive benefits we offer holistic mind, body and lifestyle programs designed for overall well-being. Learn more about ZoomInfo benefits here.

Below is the US base salary for this position. Additional compensation such as Bonus, Commission, Equity and other benefits may also apply.$153,600—$211,200 USD

About us: 

ZoomInfo (NASDAQ: GTM) is the Go-To-Market Intelligence Platform that empowers businesses to grow faster with AI-ready insights, trusted data, and advanced automation. Its solutions provide more than 35,000 companies worldwide with a complete view of their customers, making every seller their best seller.

ZoomInfo may use a software-based assessment as part of the recruitment process. More information about this tool, including the results of the most recent bias audit, is available here.

ZoomInfo is proud to be an equal opportunity employer, hiring based on qualifications, merit, and business needs, and does not discriminate based on protected status. We welcome all applicants and are committed to providing equal employment opportunities regardless of sex, race, age, color, national origin, sexual orientation, gender identity, marital status, disability status, religion, protected military or veteran status, medical condition, or any other characteristic protected by applicable law. We also consider qualified candidates with criminal histories in accordance with legal requirements.

 

For Massachusetts Applicants: It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability. ZoomInfo does not administer lie detector tests to applicants in any location.

Apply now Apply later
Job stats:  1  1  0

Tags: A/B testing Architecture BERT CI/CD Classification Clustering Computer Science Data quality Docker Engineering FAISS Generative modeling Java Kubernetes LLaMA LLMs LoRA Machine Learning ML models MLOps NLP OpenSearch PhD Pinecone Pipelines Prototyping Python PyTorch TensorFlow Testing Unstructured data Weaviate

Perks/benefits: Career development Competitive pay Equity / stock options Salary bonus

Region: North America
Country: United States

More jobs like this