Enterprise Software Test Development Engineer

Taiwan, Taipei

NVIDIA

NVIDIA erfindet den Grafikprozessor und fördert Fortschritte in den Bereichen KI, HPC, Gaming, kreatives Design, autonome Fahrzeuge und Robotik.

View all jobs at NVIDIA

Apply now Apply later

NVIDIA is the world leader in GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC, datacenters and networking in addition to our traditional OEM business. NVIDIA is also well positioned as the ‘AI Computing Company’, and NVIDIA GPUs are the brains powering Deep Learning software frameworks, analytics, data centers, and driving autonomous vehicles. We have some of the most experienced and dedicated people in the world working for us. If you are dedicated, forward-thinking, and hard-working technical people across countries sounds exciting, this job is for you.

NVIDIA is looking for an outstanding individual who thrives in a diverse work environment, has outstanding interpersonal skills and possesses a strong sense of engagement and continuous process improvement. This candidate must have enterprise system integration, strong OS/FW experience, reliability testing with various telemetries, test plan development, CI/CD and DevOps experience to join our Enterprise Server QA team.

What you’ll be doing:

  • Responsible for the development and execution of NVIDIA HGX/DGX platform test plan on OS, FW and CUDA SW stack from design doc.

  • Installing and testing various systems OS, system firmware and software stack including Windows & Linux

  • Drive support for root cause analysis on reliability and validation test failures to identify root cause(s) and achieve mitigation.

  • Leverage AI (Language Model) skills to build automation front-end and back-end framework which could interaction with human

  • Review partner and supplier test results and prescribe additional reliability testing on components, systems, and packaging as needed.

  • Work in an agile software development team with very high production quality standards.

  • Manage bug lifecycle and collaborate with inter-groups to drive for solutions.

What we need to see:

  • Bachelor’s Degree (or equivalent experience) in a STEM (Science, Technology, Engineering, Math or Physics) field with 2+ years proven experience; or Master’s Degree.

  • 2+ years of meaningful work experience

  • Proven years of automation experience using Python, Shell Script, Ansible, Jenkins

  • Strong OS (Ubuntu, RedHat, CentOS, SuSE, Fedora, Windows, etc.) trouble-shooting and debugging experience in a bare-metal and KVM/VMWare environment.

  • Experience in using AI development tools for test plans creation, test cases development and test cases automation

  • Ability to write test plans focusing on functional, performance, stress and negative testing.

  • Experience in developing CI/CD automation processes and DevOps contribution with a real passion for automation and Good teamwork with ability to work independently.

Ways to stand out from the crowd:

  • Experience working with NVIDIA GPU hardware is a strong plus.

  • Have implemented error handling for x86 based servers, online and offline health monitoring tools.

  • Experience of developing x86/ARM based environment

  • Background in parallel programming ideally CUDA/OpenCL is a plus

  • Strong experience in FW, BMC/OpenBMC, SBIOS, Network protocol, enterprise storage devices, Redfish - huge plus

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  1  0  0
Category: Engineering Jobs

Tags: Agile Ansible CI/CD CUDA Deep Learning DevOps Engineering GPU HPC Jenkins Linux Mathematics Physics Python STEM Testing

Perks/benefits: Career development Health care

Region: Asia/Pacific
Country: Taiwan

More jobs like this