Staff Machine Learning Engineer - Inference
Santa Clara, CA
⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️
Full Time Senior-level / Expert USD 179K - 303K
XPeng Motors
XPENG's electric vehicles designed for performance, safety, and sustainability. Explore our range of smart EVs, advanced technology, and commitment to a greener future.
XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical take-off and landing (eVTOL) aircraft, and robotics. With a strong focus on intelligent mobility, XPENG is dedicated to reshaping the future of transportation through cutting-edge R&D in AI, machine learning, and smart connectivity.
We are looking for a full-time Staff Machine Learning Engineer - Inference, with deep knowledge of ML inference and strong enthusiasm towards optimize our models to utilize every FLOP and every byte of RAM of our AI accelerator hardware.
Our mission is to solve the autonomous driving problem. You will work with a team of talented software engineers, machine learning engineers and research scientists to push the boundary of state-of-art machine learning models which will enable the next-generation solution of autonomous driving.
Job Responsibilities:
- Optimization of models towards deployment on customized AI accelerators
- Write kernels for customized AI accelerators
- Develop performance estimates for critical kernels
- Master in CS/CE/EE, or equivalent, in industry experience
- Strong code skill in C/C++ and Python
- Experience with CUDA programming or related AI accelerator programming.
- Experience with enabling accuracy machine learning modeling inference using low precision data formats
- Familiarity with the fundamentals of deep learning
- Have strong engineering skills to unblock yourself and are willing to pick up whatever knowledge you are missing to get the job done
- Have an understanding of ML architecture and an intuition for how to reduce model latency
- Familiarity with GPU architecture or custom silicon chip architecture
- Experience training deep learning models
- A track record of efficiently solving complex problems collaboratively on larger teams of ML engineers, compiler engineers, kernel writers etc.
- A fun, supportive and engaging environment
- Infrastructures and computational resources to support your work.
- Opportunity to work on cutting edge technologies with the top talents in the field.
- Opportunity to make significant impact on the transportation revolution by the means of advancing autonomous driving
- Competitive compensation package
Job stats:
1
0
0
Categories:
Engineering Jobs
Leadership Jobs
Machine Learning Jobs
Tags: Architecture Autonomous Driving CUDA Deep Learning Engineering GPU Machine Learning ML models Python R R&D Research Robotics
Perks/benefits: Career development Competitive pay Equity / stock options Salary bonus
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Sr. Data Engineer jobsPrincipal Data Engineer jobsBusiness Intelligence Developer jobsPower BI Developer jobsData Scientist II jobsStaff Data Scientist jobsPrincipal Software Engineer jobsStaff Machine Learning Engineer jobsDevOps Engineer jobsData Science Intern jobsJunior Data Analyst jobsAI/ML Engineer jobsSoftware Engineer II jobsStaff Software Engineer jobsData Science Manager jobsData Manager jobsLead Data Analyst jobsData Analyst Intern jobsData Specialist jobsSr. Data Scientist jobsBusiness Data Analyst jobsData Governance Analyst jobsBusiness Intelligence Analyst jobsData Engineer III jobsSenior Backend Engineer jobs
Business Intelligence jobsAirflow jobsMLOps jobsOpen Source jobsKafka jobsEconomics jobsKPIs jobsGitHub jobsLinux jobsJavaScript jobsTerraform jobsRAG jobsPostgreSQL jobsBanking jobsPrompt engineering jobsStreaming jobsScikit-learn jobsData Warehousing jobsNoSQL jobsRDBMS jobsClassification jobsComputer Vision jobsPhysics jobsdbt jobsPandas jobs
Google Cloud jobsScala jobsHadoop jobsLangChain jobsGPT jobsData warehouse jobsMicroservices jobsR&D jobsBigQuery jobsCX jobsDistributed Systems jobsELT jobsReact jobsScrum jobsOracle jobsLooker jobsIndustrial jobsPySpark jobsOpenAI jobsJira jobsRobotics jobsRedshift jobsSAS jobsUnstructured data jobsTypeScript jobs