Technical Product Manager - GenAI Inference Optimization

US, CA, Santa Clara

NVIDIA

NVIDIA erfindet den Grafikprozessor und fördert Fortschritte in den Bereichen KI, HPC, Gaming, kreatives Design, autonome Fahrzeuge und Robotik.

View all jobs at NVIDIA

Apply now Apply later

GenAI has unlocked significant capabilities for developers across industries, from coding copilots, to chatbots, or product recommendations. Each has significantly differing models, inputs, and constraints. Accelerating these models can be an ambitious and pivotal task for cost-effective deployment; good tooling is critical to a developer's success. At NVIDIA, we are crafting products to enable developers to understand and optimize their workloads across all of these use cases, as well as the ones that come next! As a Product Manager for Inference Optimization, you are the champion inside NVIDIA for developers looking to get the most out of GPUs for their specific needs. You will be driving cloud services & APIs that allow developers to articulate their unique requirements for many diverse applications, and learn how to best optimize the NVIDIA stack to suite their needs.

As Product Managers, we work directly with developers inside and outside of NVIDIA to identify key improvements, define a roadmap for availability, and stay alert on alternative solutions. In addition, these products are new to the space and will require a go-to-market strategy & clear product direction. The Product Management organization at NVIDIA is a small, strong, and impactful group. We focus on enabling deep learning across all GPU use cases and providing great solutions for developers. We are seeking a rare blend of product skills, technical depth, and passion for creating new technology. Does that sounds familiar? If so, we would love to hear from you!

What you'll be doing:

  • Create products to help developers analyze and improve their GenAI inference performance across the NVIDIA platform

  • Develop product strategy and go-to-market plans

  • Collaborate with internal and external GenAI developers to build product-based roadmaps for model optimization software

  • Work with NVIDIA leadership to align with and drive company strategy

What we need to see:

  • Experience working with API services for GenAI

  • Demonstrable knowledge of GenAI or machine learning concepts, particularly around performance optimization, and software development and delivery

  • BS or MS degree in Computer Science, Computer Engineering, or equivalent experience

  • 3+ years of technical product management, or similar, experience at a technology company

  • Strong communication and interpersonal skills

Ways to Stand Out from the crowd:

  • Experience working on emerging products and bringing them to market

  • Lead developer products & deep customer interactions

  • Knowledge of GPU architecture, HW/SW co-design, and performance profiling

The base salary range is 108,000 USD - 218,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Apply now Apply later
  • Share this job via
  • 𝕏
  • or

Tags: APIs Architecture Chatbots Computer Science Deep Learning Engineering Generative AI GPU Machine Learning

Perks/benefits: Career development Equity / stock options

Region: North America
Country: United States

More jobs like this