Machine Learning Engineer, Efficiency Engineering - USDS
San Jose, California, United States
The Efficiency Engineering team is all about our passion for crafting innovative tools and applications that empower IT operations and devops teams to achieve new levels of efficiency. We're a tight-knit crew of experienced developers, engineers and problem solvers fueled by a shared vision: streamlining operations, reducing manual workload, and empowering teams to do their best work.
In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.
Responsibilities:
- Responsible for the design and development of large-scale ML system architecture such as solving technical system problems on high concurrency, reliability, scalability, etc
- Develop end-to-end solutions on deep model inference for internal business units such as Search and relevant Large Language Model (LLM) based systems etc
- Provide highly automated and extremely performant model optimization solutions for frameworks such as PyTorch and TensorFlow. Some technical solutions includes subgraph matching, compilation optimization, model quantization, heterogeneous hardware, etc.
- Manage the large-scale GPU computing power cluster for our global businesses by improving utilization rates of the computing power through methods such as elastic scheduling, GPU overselling, and task orchestration;
- Engage in cross functional collaboration with the algorithm department to conduct joint optimization of algorithms and systems.
In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.
Responsibilities:
- Responsible for the design and development of large-scale ML system architecture such as solving technical system problems on high concurrency, reliability, scalability, etc
- Develop end-to-end solutions on deep model inference for internal business units such as Search and relevant Large Language Model (LLM) based systems etc
- Provide highly automated and extremely performant model optimization solutions for frameworks such as PyTorch and TensorFlow. Some technical solutions includes subgraph matching, compilation optimization, model quantization, heterogeneous hardware, etc.
- Manage the large-scale GPU computing power cluster for our global businesses by improving utilization rates of the computing power through methods such as elastic scheduling, GPU overselling, and task orchestration;
- Engage in cross functional collaboration with the algorithm department to conduct joint optimization of algorithms and systems.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
1
0
0
Categories:
Engineering Jobs
Machine Learning Jobs
Tags: Architecture DevOps Engineering GPU LLMs Machine Learning Model inference PyTorch TensorFlow
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
BI Developer jobsData Engineer II jobsStaff Data Scientist jobsPrincipal Data Engineer jobsSr. Data Engineer jobsPrincipal Software Engineer jobsStaff Machine Learning Engineer jobsData Science Manager jobsData Manager jobsData Science Intern jobsSoftware Engineer II jobsDevOps Engineer jobsBusiness Intelligence Analyst jobsJunior Data Analyst jobsData Analyst Intern jobsData Specialist jobsBusiness Data Analyst jobsLead Data Analyst jobsStaff Software Engineer jobsSr. Data Scientist jobsSenior Backend Engineer jobsData Governance Analyst jobsAI/ML Engineer jobsData Engineer III jobsResearch Scientist jobs
Consulting jobsAirflow jobsMLOps jobsOpen Source jobsKPIs jobsEconomics jobsJavaScript jobsLinux jobsKafka jobsTerraform jobsNoSQL jobsData Warehousing jobsGoogle Cloud jobsRDBMS jobsComputer Vision jobsGitHub jobsPostgreSQL jobsScikit-learn jobsR&D jobsPhysics jobsStreaming jobsData warehouse jobsBanking jobsHadoop jobsdbt jobs
Scala jobsLooker jobsClassification jobsPandas jobsBigQuery jobsOracle jobsRAG jobsReact jobsCX jobsScrum jobsPySpark jobsPrompt engineering jobsDistributed Systems jobsIndustrial jobsELT jobsJira jobsGPT jobsRedshift jobsMicroservices jobsRobotics jobsLangChain jobsTypeScript jobsSAS jobsOpenAI jobsJenkins jobs