Full Stack Developer, AI and LLM
US, CA, Remote, United States
NVIDIA
NVIDIA erfindet den Grafikprozessor und fördert Fortschritte in den Bereichen KI, HPC, Gaming, kreatives Design, autonome Fahrzeuge und Robotik.NVIDIA's Silicon Solutions Group is seeking a full-stack developer with AI/LLM expertise to help integrate AI into its data analysis and automation infrastructure. The solutions developed will support multiple critical large-scale automation initiatives. In this role, you will lead strategies and design of AI solutions to improve the efficiency of our existing and new automation workflows. The ideal candidate will combine technical expertise with hands-on experience to drive all AI planning, design, and implementation aspects. At NVIDIA, we strive for excellence, encourage innovation, and provide opportunities to explore new ways to succeed!
What You'll Be Doing:
Study and develop groundbreaking techniques in deep learning, graphs, machine learning, and data analytics, and perform in-depth analysis.
Collaborate with developers and cross-functional teams to identify current and emerging challenges.
Design and implement end-to-end generative AI solutions, specializing in Large Language Model (LLM) training, efficient deployment strategies, and sophisticated Retrieval-Augmented Generation (RAG) workflows.
What We Need to See:
MS (or equivalent experience) with 6+ years of software development; 2+ years relevant work experience in developing and deploying AI solutions
Proven full-stack development experience with a focus on improving application performance and user experience
Proficiency in Python, C++ programming, and Deep Learning frameworks
Ability to work independently and as part of a team
Motivated self-starter with strong analytical and debug skills
Ability to balance multiple simultaneous projects
Excellent verbal and written communication skills
Ways to Standout from the crowd:
Experience with CUDA programming and benchmarking and analyzing performance AI Agentic systems
Expertise in training, fine-tuning, and evaluating LLMs using popular frameworks such as TensorFlow or PyTorch
Proficiency in model deployment and optimization techniques for efficient inference on various hardware platforms
Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM
Proficiency in model deployment and optimization techniques for efficient inference on various hardware platforms · Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM
NVIDIA is widely considered to be one of the world's most desirable employers in the technology field. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!
#LI-Hybrid
The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.Tags: CUDA Data analysis Data Analytics Deep Learning Generative AI LLMs Machine Learning Model deployment Python PyTorch RAG TensorFlow TensorRT
Perks/benefits: Career development Equity / stock options
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.