Machine Learning Engineer, Efficiency Engineering - USDS
San Jose, California, United States
The Efficiency Engineering team is all about our passion for crafting innovative tools and applications that empower IT operations and devops teams to achieve new levels of efficiency. We're a tight-knit crew of experienced developers, engineers and problem solvers fueled by a shared vision: streamlining operations, reducing manual workload, and empowering teams to do their best work.
In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.
Responsibilities:
- Responsible for the design and development of large-scale ML system architecture such as solving technical system problems on high concurrency, reliability, scalability, etc
- Develop end-to-end solutions on deep model inference for internal business units such as Search and relevant Large Language Model (LLM) based systems etc
- Provide highly automated and extremely performant model optimization solutions for frameworks such as PyTorch and TensorFlow. Some technical solutions includes subgraph matching, compilation optimization, model quantization, heterogeneous hardware, etc.
- Manage the large-scale GPU computing power cluster for our global businesses by improving utilization rates of the computing power through methods such as elastic scheduling, GPU overselling, and task orchestration;
- Engage in cross functional collaboration with the algorithm department to conduct joint optimization of algorithms and systems.
In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.
Responsibilities:
- Responsible for the design and development of large-scale ML system architecture such as solving technical system problems on high concurrency, reliability, scalability, etc
- Develop end-to-end solutions on deep model inference for internal business units such as Search and relevant Large Language Model (LLM) based systems etc
- Provide highly automated and extremely performant model optimization solutions for frameworks such as PyTorch and TensorFlow. Some technical solutions includes subgraph matching, compilation optimization, model quantization, heterogeneous hardware, etc.
- Manage the large-scale GPU computing power cluster for our global businesses by improving utilization rates of the computing power through methods such as elastic scheduling, GPU overselling, and task orchestration;
- Engage in cross functional collaboration with the algorithm department to conduct joint optimization of algorithms and systems.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
3
0
0
Categories:
Engineering Jobs
Machine Learning Jobs
Tags: Architecture DevOps Engineering GPU LLMs Machine Learning Model inference PyTorch TensorFlow
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Power BI Developer jobsBusiness Intelligence Developer jobsPrincipal Data Engineer jobsBI Developer jobsStaff Data Scientist jobsStaff Machine Learning Engineer jobsPrincipal Software Engineer jobsJunior Data Analyst jobsData Science Intern jobsDevOps Engineer jobsData Manager jobsData Science Manager jobsSoftware Engineer II jobsAccount Executive jobsStaff Software Engineer jobsData Analyst Intern jobsLead Data Analyst jobsBusiness Data Analyst jobsAI/ML Engineer jobsSr. Data Scientist jobsData Specialist jobsData Governance Analyst jobsSenior Backend Engineer jobsData Engineer III jobsBusiness Intelligence Analyst jobs
Consulting jobsMLOps jobsAirflow jobsOpen Source jobsEconomics jobsLinux jobsKPIs jobsKafka jobsTerraform jobsGitHub jobsJavaScript jobsPostgreSQL jobsRDBMS jobsData Warehousing jobsNoSQL jobsClassification jobsBanking jobsStreaming jobsScikit-learn jobsPrompt engineering jobsRAG jobsComputer Vision jobsPhysics jobsGoogle Cloud jobsPandas jobs
Hadoop jobsOracle jobsScala jobsdbt jobsBigQuery jobsLooker jobsReact jobsGPT jobsData warehouse jobsR&D jobsLangChain jobsScrum jobsPySpark jobsDistributed Systems jobsMicroservices jobsCX jobsELT jobsIndustrial jobsSAS jobsOpenAI jobsJira jobsRedshift jobsModel training jobsTypeScript jobsJenkins jobs