Senior Developer - Data Center Server Management
Poland, Remote
NVIDIA
NVIDIA on grafiikkasuorittimen keksijä, jonka kehittämät edistysaskeleet vievät eteenpäin tekoälyn, suurteholaskennan.NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as "the AI computing company." We're looking to grow our company and establish teams with the most thoughtful people in the world.NVIDIA GH200 superchip provides performance and productivity required for strong scaling for HPC and generative AI workload. Scale out is inherent to design of this massive superchip. We are looking for skilled software engineers to help implement firmware and software components for next generation AI supercomputing platforms.We are looking for a strong senior developer to implement manageability components for these products in data centers. You will collaborate with various teams, understand customer requirements, and develop robust solutions to drive our products to market.
What you'll be doing:
Develop and optimize server management software for GPU and Grace solutions in large clusters
Implement firmware and software components based on performance requirements and architecture specifications
Collaborate with data center architects to understand requirements and ensure timely implementation
Work with cross-functional teams to align implementation with design requirements
Optimize firmware components for reliability in data center environments
Support cluster validation and resolve technical issues efficiently
Contribute to quality, reliability and telemetry performance of firmware delivered to data centers
What we need to see:
5+ years of relevant experience working on server firmware (BMC) and platform software development with BS, MS, or PhD in EE/CS or related field
Experience with data center health management implementation
Track record of delivering server firmware components
Knowledge of server architecture and manageability in data centers
Understanding of hardware management interfaces (USB, SMBus/I2C, PCIe) and familiarity with modern management protocols including Redfish, MCTP, and PLDM
Strong proficiency in C/C++ and Python
Strong programming and debugging skills for server platforms
Experience with SCM (e.g. Git, Perforce) and project management tools like Jira
Excellent written and oral communication skills, good work ethics, team-oriented mentality, and dedication to quality work
Self-starter who can solve sophisticated technical problems with effective coding solutions
Ways to stand out from the crowd:
Familiarity with x86 or ARM system architecture
Experience collaborating effectively within large engineering teams
Background with performance optimization in firmware components
Experience with RTOS and bare metal programming
Linux kernel and user space development experience
NVIDIA is at the forefront of breakthroughs in Artificial Intelligence, High-Performance Computing, and Visualization. Our teams are composed of driven, innovative professionals dedicated to pushing the boundaries of technology. We offer highly competitive salaries, an extensive benefits package, and a work environment that promotes diversity, inclusion, and flexibility. As an equal opportunity employer, we are committed to fostering a supportive and empowering workplace for all.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture Deep Learning Engineering Generative AI Git GPU HPC Jira Linux PhD Python
Perks/benefits: Career development Health care Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.