Senior Global Business Development Manager – AI Inference Infrastructure Services

US, CA, Santa Clara, United States

NVIDIA

NVIDIA on grafiikkasuorittimen keksijä, jonka kehittämät edistysaskeleet vievät eteenpäin tekoälyn, suurteholaskennan.

View all jobs at NVIDIA

Apply now Apply later

NVIDIA is the engine of modern artificial intelligence, the biggest technology breakthrough of our time, and is transforming industries. We are seeking a Global Business Development to join our outstanding team, preferably in Santa Clara, CA, USA, to support our Telecom NVIDA cloud partners (NCP) build AI Inference Infrastructure/Services leveraging NVIDIA Reference Architecture and technologies like NIMs, TensorRT, Triton and Dynamo.


You will have a vital role to create a new GenAI inference infrastructure and platform services to expand the use of our innovative technology. You will have a key role to evangelize AI-RAN and enable a common AI distributed infrastructure for both 5G/6G and AI workloads.


What you’ll be doing:

  • Drive new inference infrastructure investment with Telco NVIDIA cloud partners (NCP).

  • Enable new inference PaaS services with NCPs including Token as a Service and Serverless/Cloud Function/API services.

  • Explore inference deployment strategies to deploy services across distributed and edge data centers.

  • Engage NVIDIA Ecosystem of ISV applications to drive services demand.

  • Create, articulate, and present the business/technical value proposition for the above; identify and prioritize key partners and work to educate ecosystem partners as a multiplier.


What we need to see:

  • 12+ years of experience with Cloud business models, technology, value proposition and ecosystem.

  • Progressive experiences across Cloud and AI.

  • Technical background and experience in building and using cloud services.

  • Quality experience in collaborating with partner ecosystems.

  • Ability to present technical information both to engineering and to the C-Suite.

  • Bachelor’s degree from a leading University or equivalent experience.

  • Willingness to travel approximately 40% of the time.


Ways to stand out from the crowd:

  • Work experience with specialized AI GPU Cloud providers, NVIDIA cloud Partners (NCP) and  AI “Neoclouds”.

  • Work experience with AI Inference and cloud stack. 

  • Familiarity of NVIDIA AI Software platform (ex. NEMO, NIMs, Triton, Dynamo, Lepton and NVCF).

  • Understanding of RAN virtualization, ORAN technology and AI RAN alliance.

  • Master's degree and/or MBA is preferred, although equivalent experience will also be considered.

NVIDIA is widely considered to be one of the technology world’s most desirable employers! We have a diverse group of technology and industry specialists tackling today’s most exciting problems. If you are partner focused, technically astute, creative and driven, we want to hear from you!

The cash compensation range is 224,000 USD - 356,500 USD, with 85% paid through base salary and 15% variable compensation. Your cash compensation will be determined based on your location, experience and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Apply now Apply later
Job stats:  0  0  0

Tags: APIs Architecture Engineering Generative AI GPU TensorRT

Perks/benefits: Equity / stock options

Region: North America
Country: United States

More jobs like this