Senior System Software Engineer - TAO ML Data Platforms

US, CA, Santa Clara

NVIDIA

NVIDIA erfindet den Grafikprozessor und fördert Fortschritte in den Bereichen KI, HPC, Gaming, kreatives Design, autonome Fahrzeuge und Robotik.

View all jobs at NVIDIA

Apply now Apply later

NVIDIA is hiring a Senior Systems Software Engineer for the ML Data Platform to join the TAO Toolkit Team. Our team builds frameworks, services, algorithms, and tools that power the largest NVIDIA Multi-Modal Foundation Models and their customization. We are seeking someone who can help develop novel, scalable, and automated pipelines to make sense of petabytes of unstructured data and collaborate with multiple deep-learning architects and engineers to enable the development of pioneering AI models.

What you’ll be doing:

  • Design, develop, and support a platform to access large datasets, integrating data from various sources.

  • Build scalable and robust ETL pipelines to deliver high-quality datasets.

  • Implement lake house structures and model data for optimized reporting to analyze the datasets.

  • Develop and maintain cloud services for data management and machine learning workflows in a production environment.

  • Help to find the right data for a Multi-Modal model with scalable query-retrieval systems.

  • Work with internal teams to define requirements, enhance products, and automate workflows.

What we need to see:

  • Bachelor's degree (or equivalent experience) in Computer Engineering, Computer Science, Electrical Engineering, Robotics, or related field

  • 5+ years of ML / related software engineering experience with strong architecture and design skills.

  • Proficient in Python and familiar with knowledge of the latest tools.

  • Experience with SQL databases and cloud infrastructure (AWS, GCP, Kubernetes).

  • Experience in high-performance computing environments and workflow automation frameworks (e.g., Airflow).

  • Knowledge of PyTorch, distributed machine learning, and distributed file systems.

  • 3+ years leading complex sometimes ambiguous projects, particularly in high-throughput services at supercomputing scale.

  • Proficient in running applications on cloud platforms using Kubernetes and Docker, and ML frameworks like Pytorch.

Ways to stand out from the crowd:

  • Familiar with GPU programming concepts, and writing custom CUDA kernels.

  • Experience with Dagster, Terraform.

  • Proficient in building systems and familiar with deep learning architectures and tools like Nvidia TensorRT-LLM, Multimodal-LLM, and Triton Server.

With a competitive salary package and benefits, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Are you a creative and autonomous Senior Software Engineer, who loves challenges? Do you have a genuine passion for advancing the state of Data Science across a variety of industries? If so, we want to hear from you.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you are creative and autonomous, we want to hear from you!

The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Apply now Apply later
  • Share this job via
  • 𝕏
  • or
Job stats:  4  1  0

Tags: Airflow Architecture AWS Computer Science CUDA Dagster Data management Deep Learning Docker Engineering ETL GCP GPU Kubernetes LLMs Machine Learning Pipelines Python PyTorch Robotics SQL TensorRT Terraform Unstructured data

Perks/benefits: Career development Competitive pay Equity / stock options

Region: North America
Country: United States

More jobs like this