Senior Inference Technical Product Marketing Manager - Accelerated Computing

US, CA, Santa Clara, United States

NVIDIA

NVIDIA on grafiikkasuorittimen keksijä, jonka kehittämät edistysaskeleet vievät eteenpäin tekoälyn, suurteholaskennan.

View all jobs at NVIDIA

Apply now Apply later

We are looking for a Senior Technical Product Marketing Manager. This role will be located in our rapidly growing data center business and pivotal in our inference marketing. You will be focused on working with engineering to understand the technical capabilities of our inference stack from GPUs, CPUs, networking, CUDA libraries, model architectures and deployment techniques (parallelisms, configurations, etc.). You will influence NVIDIA’s entire technical marketing strategy to showcase our leadership position in AI inference.

Want to join a fun, creative company that is at the forefront of outstanding Generative AI technologies? NVIDIA is developing groundbreaking solutions in some of the world’s most exciting areas including artificial intelligence and high performance computing. Come grow your career to new heights at one of the fastest growing technology companies!

What You’ll Be Doing:

  • Help drive NVIDIA’s inference platform technical go-to-market efforts

  • Work closely with engineering and product management teams to understand key technical capabilities of our inference stack from GPUs, CPUs, networking, CUDA libraries, model architectures and deployment techniques (e.g.parallelisms, configurations, etc.)

  • Diligently review and remain up to date on model architectures, frameworks, arxiv papers, whitepapers deployment techniques (e.g.disaggregated serving, KV cache implementations) and identify intersection points between the latest AI models and NVIDIA’s platform to maximize performance and minimize TCO

  • Develop crisp clear positioning, messaging and assets to highlight NVIDIA’s leadership position in inference. Assets (blogs, whitepapers, presentations, analyst briefings, seminars at developer conferences)

  • Closely follow competitive inference announcements and prepare appropriate responses for business and technical/developer audiences

  • Assist on building keynote slides for executives for areas that you’re a subject matter expert

What We Need to See:

  • A BS Degree in Computer Science or Engineering or related field or equivalent experience in a technical product marketing role; Masters Degree preferred.

  • 6+ years of experience in LLM, AI/ML development in an engineering role followed by 5+ years of experience in product management or technical product marketing of AI/ML products

  • Deep understanding of modern data center architectures, accelerated computing, distributed inference, deep learning frameworks (PyTorch, TensorFlow, JAX), and inference-specific frameworks & optimizations (Dynamo, Triton Inference Server, TensorRT-LLM, vLLM, SGLang)

  • Market Awareness – Experience conducting technical competitive analysis and synthesizing key insights

  • Collaboration & Influence – Proven ability to work cross-functionally across engineering, product management, sales, and marketing teams

  • Strong Communication, Asset Creation & Storytelling – Ability to translate sophisticated technical concepts into clear, compelling narratives for both technical and business audiences

  • Ability to present to executive audiences

Ways to Stand Out from the crowd:

  • Hands-on experience with AI inferencing workflows using NVIDIA or open-source serving frameworks running on accelerated computing in the data center

  • Experience developing LLM models

  • Experience working with hyperscale cloud providers

  • Hands-on Technical Competence – Background in software development, AI infrastructure, data center silicon

  • Demonstrated ability to engage with executive leadership and external partners

  • Published technical content or speaking experience at industry events

  • Have a portfolio of published marketing/launch assets

NVIDIA is widely considered to be one of high technology's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Our goal is to craft an environment where you can do your life's best work. If you're creative, self-motivated, and autonomous, we want to hear from you!

#LI-Hybrid

The base salary range is 144,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Apply now Apply later
Job stats:  1  0  0

Tags: Architecture Computer Science CUDA Deep Learning Engineering Generative AI HPC JAX LLMs Machine Learning ML infrastructure Open Source PyTorch TensorFlow TensorRT vLLM

Perks/benefits: Career development Competitive pay Conferences Equity / stock options Team events

Region: North America
Country: United States

More jobs like this