Tech Lead Machine Learning Ops Engineer, Global SRE
San Jose, California, United States
USD 187K-359K (estimate) Senior-level Full Time
Tasks
- Ensure stability of AIGC machine learning tasks
- Improve resource efficiency
- Improve training task success rate
- Maintain stability of offline machine learning training tasks
- Maintain stability of online machine learning serving systems
- Manage and plan machine learning resources
- Optimize cost and budget
- Roll out GPU model training in non-China regions
- Set SLOs for online machine learning serving systems
Perks/Benefits
- N/A
Skills/Tech-stack
Cost Optimization | GPU Computing | Learning operations | Machine Learning | Machine Learning Operations | Model Deployment | Model Serving | Model Training | Resource Management | SLO | SRE
Education
N/A
Related jobs
-
Student Researcher (LLM - Seed) - 2027 Start (BS/MS) USD 187K-300KAlgorithms | Deep learning | Language Models | Language Processing | Large Language ModelsMid-level Full TimeSan Jose, California, United States5h ago
-
Machine Learning Engineer, Ads Targeting USD 137K-237KA/B | A/B Testing | Ad Targeting | B testing | Data PipelinesMid-level Full TimeSan Jose, California, United States6h ago
-
Machine Learning Engineer, Commerce Ads Ranking USD 145K-250KAuto Crossing | Cold Start | Cold Start Modeling | Delayed Feedback Modeling | Distributed TrainingMid-level Full TimeSan Jose, California, United States6h ago
-
Conversational AI | Generative AI | Image Generation | Language Models | Language ProcessingSenior-level Full TimeSan Jose, California, United States6h ago
-
Machine Learning Engineer Graduate (Global E-commerce Recommendation/Search) - 2026 Start (BS/MS) USD 115K-177KData Modeling | Data Processing | Information Retrieval | Language Processing | Large Scale DataEntry-level Full TimeSeattle, Washington, United States6h ago
-
Machine Learning Engineer Graduate (Global E-commerce Recommendation/Search) - 2026 Start (PhD) USD 118K-187KComputer Vision | Data Analysis | Deep learning | Information Retrieval | Language ProcessingEntry-level Full TimeSeattle, Washington, United States6h ago
-
Machine Learning Engineer - Search Ads USD 145K-250KActive Learning | Ad optimization | CTR Prediction | CVR Prediction | Data AnalysisEntry-level Full TimeSan Jose, California, United States6h ago
-
A/B | A/B Testing | B testing | Click Through Rate | Conversion RateSenior-level Full TimeSan Jose, California, United States6h ago
-
Machine Learning Engineer Graduate (Global E-commerce Recommendation/Search) - 2026 Start (BS/MS) USD 115K-177KContent Safety | Data Analysis | Graph Algorithms | Language Processing | Large-scaleEntry-level Full TimeSan Jose, California, United States6h ago
-
Machine Learning Engineer, App Ads and Gaming USD 136K-205KData Analysis | Deep learning | Distributed Systems | Feature Engineering | Learning algorithmsMid-level Full TimeSan Jose, California, United States6h ago
-
Partner Engineering GenAI - US USD 140K-203KAPIs | Agent Orchestration | Bias Mitigation | C++ | Cloud ComputingSenior-level Full TimeMenlo Park, CA | Seattle, WA …6h ago
-
Software Engineer - Language (Technical Leadership) USD 213K-293KASR | Automatic Speech Recognition | C++ | Conversational AI | Deep learningSenior-level Full TimeMenlo Park, CA | Seattle, WA …6h ago
-
Machine Learning Hardware Research Scientist USD 91K-145KASIC design | Architecture Search | Bias Mitigation | C# | C++Entry-level Full TimeSunnyvale, CA6h ago
-
Code review | Data Deduplication | Data Quality | Data Quality Filtering | Data pipelineEntry-level Full TimeMenlo Park, CA6h ago
-
Code review | Contamination Checking | Data Generation | Data Ingestion | Data PipelinesSenior-level Full TimeMenlo Park, CA6h ago
-
Software Engineer III, AI/ML, Search Ads USD 147K-211KC++ | Data Storage | Data Structures | Data Structures and Algorithms | Deep learningSenior-level Full TimeMountain View, CA, USA; New York, …6h ago
-
Site Reliability Engineer, ML Compute SRE USD 147K-211KAlgorithms | Automation | Cause analysis | Data Structures | DebuggingMid-level Full TimeRaleigh, NC, USA; Durham, NC, USA6h ago
-
Software Engineer III, Flume ML USD 147K-211KBackend Development | C++ | Colossus | Data Processing | Data StorageSenior-level Full TimeSunnyvale, CA, USA6h ago
-
Software Engineer III, AI/ML, Search USD 147K-211KAI content | AI content safety | Application Security | C++ | Content SafetySenior-level Full TimeMountain View, CA, USA; Chicago, IL, …7h ago
-
Research Software Engineer, Gen AI/LLM Development USD 174K-252K3D Modeling | Agent reasoning | Data Structures | Data Structures and Algorithms | Diffusion ModelsMid-level Full TimeMountain View, CA, USA7h ago
-
Senior Software Engineer, AI/ML, Search USD 174K-252KAI content | AI content safety | Accessibility | Algorithms | Artificial IntelligenceSenior-level Full TimeMountain View, CA, USA; Cambridge, MA, …7h ago
-
Autocuration | Data Processing | Data Storage | Debugging | Distributed ComputingSenior-level Full TimeMountain View, CA, USA7h ago
-
C++ | CSS | Dashboards | Data Infrastructure | Data WarehousingMid-level Full TimeBoulder, CO, USA; Atlanta, GA, USA7h ago
-
Staff Machine Learning Engineer USD 154K-231KData Pipelines | Feature Engineering | Machine Learning | Model Evaluation | NumPyOccasional travelSenior-level Full Time#, CA, US, #12h ago
-
Sr. Solutions Engineer USD 152K-209KAccount Management | Amazon Web Services | Artificial Intelligence | Azure | Big DataSenior-level Full TimeBoston, Massachusetts13h ago