Software Engineer, Fleet Health Instrumentation Intern - Fall 2025
US, CA, Santa Clara, United States
Applications have closed
NVIDIA
NVIDIA on grafiikkasuorittimen keksijĂ€, jonka kehittĂ€mĂ€t edistysaskeleet vievĂ€t eteenpĂ€in tekoĂ€lyn, suurteholaskennan.Our work at NVIDIA is dedicated towards a computing model focused on visual and AI computing. For two decades, NVIDIA has pioneered visual computing, the art and science of computer graphics, with our invention of the GPU. The GPU has also shown to be spectacularly effective at solving some of the most complex problems in computer science. Today, NVIDIAâs GPU simulates human intelligence, running deep learning algorithms and acting as the brain of computers, robots and self-driving cars that can perceive and understand the world. We are looking to grow our company and teams with the smartest people in the world and there has never been a more exciting time to join our team! Join the opportunity to design, prototype, and ship high-impact features that keep NVIDIA's GPU-accelerated platforms running smoothly at global scale.
Youâll enter the same engineering culture that powers NVIDIAâs services, applying modern software practicesâfrom service design and development to system instrumentation and data-pipeline engineering. Our internship focuses on writing robust, performant code (Golang / Python) and automating everything that can be automated, so NVIDIAâs cloud offerings deliver world-class reliability.
What you will do:
Design and build software that collects, transforms, and publishes health data about our global GPU fleet.
Develop micro-services and data pipelines in Go or Python that ingest and normalize data from many diverse sourcesârouting millions of records per day (Kafka, Airflow, Kinesis).
Instrument production infrastructure and workloads running on Kubernetes and bare-metal clusters; add tracing and metrics hooks for deeper insights.
Automate deployments and testing with CI/CD (GitLab, Argo) and IaC (Terraform), ensuring repeatable, low-touch releases.
Participate in the full lifecycle of cloud servicesâfrom design docs and code reviews through deployment, monitoring, and continuous improvement.
Collaborate with other engineers to debug live issues and turn post-incident insights into durable code fixes.
Contribute to internal tooling and dashboards that help engineers visualize fleet health, utilization, and capacity trends.
What we need to see:
Actively pursuing a BS or MS in Computer Science, Computer Engineering, or a closely related quantitative field (e.g., Physics or Mathematics).
Solid understanding of distributedâsystems fundamentals, modern softwareâengineering practices, and dataâmodeling principles.
Proficiency in at least one programming languageâpreferably Python or Go.
Working knowledge of Linux, basic networking concepts, and Kubernetes container orchestration.
Ways to stand out from the crowd:
A systematic, analytical problemâsolving approach paired with clear written and verbal communication skills and a strong sense of ownership.
Demonstrated ability to debug, optimize, and automate code or workflows with minimal guidance.
Handsâon experience building, deploying, and operating services in a publicâcloud or large onâprem environment.
NVIDIA is widely considered to be one of the technology worldâs most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!
The hourly rate for our interns is 18 USD - 71 USD. Our internship hourly rates are a standard pay determined based on the position and your location, year in school, degree, and experience.You will also be eligible for Intern benefits. NVIDIA accepts applications on an ongoing basis. â
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.Tags: Airflow CI/CD Computer Science Data pipelines Deep Learning Engineering GitLab Golang GPU Kafka Kinesis Kubernetes Linux Mathematics Physics Pipelines Python Terraform Testing
Perks/benefits: Career development Health care
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.