Tech Lead, ML Infrastructure
Mountain View, California (HQ)
Nuro, Inc.
Driving the future of AI-powered autonomy for all roads, all rides.Who We Are
Nuro exists to better everyday life through robotics. Founded in 2016, Nuro is a leading autonomous technology company with vehicles on road today in California and Texas. The company’s core technology is the Nuro Driver, an integrated autonomous driving system consisting of state-of-the-art, AI-first software and custom-built sense and compute hardware. The first commercial application of the Nuro Driver is autonomous goods delivery with our custom, electric, zero-occupant vehicles in partnership with some of the most recognized brands in the world including Uber and FedEx.
About the Role
The ML Infra team is growing and we are looking for talented engineers to join us. Nuro is pursuing an ML-first software stack. Our team provides solutions to empower machine learning development in Nuro and optimize on-cloud training and onboard inference. Our solutions include a distributed training platform, ML compiler, model components libraries, e.t.c. We are looking for a tech lead to drive the initiatives in distributed training and inference optimization, the key technology to scale our deep learning models to achieve higher performance.
About the Work
- Lead the exploration of the latest technology of distributed training and inference optimization.
- Dive deep into the pain points of ML teams and understand model development bottlenecks.
- Define and drive the technical strategy for model optimization.
- Collaborate with ML teams to ensure smooth adoption of the model optimization solutions.
- Promote engineering best practices and cultivate a culture of continuous improvement in the team.
About You
- 6+ years of work or research experience in ML Infra, distributed training, model inference or GPU programming.
- Knowledge in using at least one deep learning framework, e.g. Tensorflow, Pytorch, JAX. Be able to understand deep learning algorithms, e.g. computer vision, NLP, behavior planning.
- Proficient in C++/C and Python. Strong coding ability.
- Strong communication and teamwork skills. Be passionate about exploring and promoting cutting edge technology.
Bonus Points
- Experience with CUDA, Cublas, Cudnn or any other Nvidia SDKs.
- Experience with model quantization or pruning.
- Experience with compilers or ML compilers (e.g. TensorRT, Triton, XLA, Clang).
- Experience with AI algorithms and hardware codesign (e.g. Depthwise Conv, Flash Attention).
- Experience with distributed training speedup (e.g. FSDP, DeepSpeed).
At Nuro, your base pay is one part of your total compensation package. For this position, the reasonably expected base pay range is between $202,350 and $303,050 for the level at which this job has been scoped. Your base pay will depend on several factors, including your experience, qualifications, education, location, and skills. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for an annual performance bonus, equity, and a competitive benefits package.
At Nuro, we celebrate differences and are committed to a diverse workplace that fosters inclusion and psychological safety for all employees. Nuro is proud to be an equal opportunity employer and expressly prohibits any form of workplace discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other legally protected characteristics.
Tags: Autonomous Driving Computer Vision CUDA cuDNN Deep Learning Engineering FSDP GPU JAX Machine Learning ML infrastructure ML models Model inference NLP Python PyTorch Research Robotics TensorFlow TensorRT
Perks/benefits: Career development Competitive pay Equity / stock options Salary bonus
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.