Senior Staff Software Engineer - Enterprise AI Platform
US, CA, Santa Clara, United States
â ď¸ We'll shut down after Aug 1st - try foođŚ for all jobs in tech â ď¸
Full Time Senior-level / Expert USD 168K - 322K
NVIDIA
NVIDIA on grafiikkasuorittimen keksijä, jonka kehittämät edistysaskeleet vievät eteenpäin tekoälyn, suurteholaskennan.NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. Itâs a unique legacy of innovation thatâs fueled by great technologyâand amazing people. Today, weâre tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing whatâs never been done before takes vision, innovation, and the worldâs best talent. As an NVIDIAN, youâll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
NVIDIA is looking to hire a deeply technical, creative, and hands-on Senior Staff full-stack developer to build the next generation AI platforms and products that improve business efficiency and productivity. This engineer is expected to be familiar with concepts of RAG, agentic AI to be able to build AI products using SOTA agentic paradigms, third party platforms and open-source repos. As a key leader in our technology team, you will play a pivotal role in shaping the architecture, development, and scaling of our software systems. This role will give an opportunity to collaborate with Cloud, AI/ML & Generative AI workforce in a multifaceted and agile working environment, while meeting the immediate and evolving needs of our business.
What you will be doing:
Own the end-to-end lifecycle of software development, from concept to deployment, including architecture design, development, testing, and scaling
Understand internal micro-services, platforms, third party platforms and growing open-source code-repos to best leverage them during AI product development
Able to contribute to internal platforms and build re-usable components that can connect to enterprise data sources and power search, chatbots and other gen AI applications
Develop AI applications, platforms and systems enabling unified experience across applications and driving insights for end-to-end user experience
Build services that can support Inference, Training jobs, Ingestion Jobs
Understand the eco-system of data connectors and build secure AI applications which can access structured, unstructured data from a variety of databases at scale
Ensure system reliability, performance, and security at scale.
Help build and maintain our Continuous Delivery pipeline with the goal of moving changes to production faster and safer, while ensuring key operational standards.
Create and implement strategies to support business growth and technological advancements, ensuring flexibility and adaptability.
Provide peer reviews to other specialists including feedback on performance, scalability, and correctness.
Keep abreast of emerging trends and technologies in AI, software development, and system architecture.
Are a strong advocate of proven methods in software engineering and bring a detailed approach to testing, continuous delivery, and reducing technical debt.
What we need to see:
Bachelorâs or Masterâs degree in Computer Science, Engineering, or a related field, or equivalent experience.
8+ years of proven experience building sophisticated applications and APIs in On-prem, Cloud and hybrid cloud environments at large scale preferably in Python
Proven experience to build full stack applications including UI, backend, infrastructure
Proven expertise of performance, reliability in sophisticated distributed systems and the teams that build them
Strong proficiency in multiple programming languages and technologies relevant to AI and system development
Familiarity with gen AI application building, AI application deployments, Model Deployments (LLMs, Embeddings, Re-rankers, OCR etc)
Has delivered software with full understanding of deploying applications in Kubernetes clusters along with GPU and CPU pod scheduling (Ability to understand on Prem)
Proven track record to lead complex projects and deliver results in a fast-paced, multifaceted environment.
Extremely motivated, highly passionate, and curious about new technologies. Take pride in your work and strive to achieve incredible results and possess superb communication and planning skills.
Excellent leadership, problem-solving, analytical and communication skills, capable of inspiring and leading a technical team.
Ways to stand out from the crowd:
Experience enhancing enterprise efficiency and employee experience through the effective use of Generative AI based solutions.
Background with Kubernetes, Openshift, ML ops as well as experience with Model deployments (Inference, Training)
Self-motivation and a drive to get things to âdoneâ.
Excellent programming, debugging, performance analysis, and test design skills using python is a plus.
NVIDIA is widely considered to be one of the technology worldâs most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and enjoy learning while having fun, then what are you waiting for? Apply today!
#LI-Hybrid
The base salary range is 168,000 USD - 322,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.Tags: Agile APIs Architecture Chatbots Computer Science Distributed Systems Engineering Generative AI GPU Kubernetes LLMs Machine Learning OCR Open Source Python RAG Security Testing Unstructured data
Perks/benefits: Career development Equity / stock options
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.