ML Runtime Optimization Engineer
Sunnyvale, California, United States
USD 159K-199K Senior-level Full Time
Tasks
- Apply model pruning and quantization
- Collaborate with ML and software engineers
- Deploy models to embedded runtime environments
- Develop compute usage strategies for inference
- Drive ML performance optimization
- Optimize model architecture for efficient deployment
- Profile model performance and identify bottlenecks
Perks/Benefits
- 401k match
- Dental insurance
- Disability insurance
- Health insurance
- Learning stipend
- Life insurance
- Paid time off
- Vision insurance
- Wellness stipend
Skills/Tech-stack
CPU | CUDA | Deep learning | Embedded Systems | GPU | Inference Optimization | JAX | Microarchitecture | Model Pruning | Model Quantization | ONNX | Performance Profiling | PyTorch | SoC | SoC Architecture | TensorRT | Triton | XLA
Education
Regions
Countries
States
Cities
Related jobs
-
AI Architect USD 130K-230KAgentic AI | Amazon Web Services | Azure | Bagging | CI/CDMentorship opportunities | Remote work environmentSenior-level Full TimeUnited States8h ago
-
Machine Learning Engineer Graduate (TikTok-Data-Search-Basic Ranking) - 2026 Start (BS/MS) USD 116K-177KComputer Vision | Deep learning | Language Models | Language Processing | Large Language ModelsEntry-level Full TimeSan Jose, California, United States9h ago
-
Senior Staff Machine Learning Engineer, Infrastructure USD 248K-310KA/B | A/B Testing | Agentic AI | Apache Airflow | Apache KafkaDisability accommodation support | Employee travel credits | Inclusion focused hiring | Remote eligibleSenior-level Full TimeUnited States16h ago
-
Anomaly Detection | Automated testing | Benchmarking | Computer Vision | Data Preparation401k | Education reimbursement program | Flexible schedules | Hybrid schedule | Relocation assistanceSenior-level Full TimeLivermore, CA, United States R20h ago
-
Design Optimization Engineer - Postdoctoral Researcher USD 122K-143KAdjoint method | C# | C++ | CUDA | Data Science401k | Education reimbursement program | Flexible schedule | Hybrid work schedule | Relocation assistanceEntry-level Full TimeLivermore, CA, United States R20h ago
-
Senior Lead Machine Learning Engineer USD 229K-286KAWS | Apache Spark | Bias Variance | Cloud Computing | Cloud platformHealth benefits | Long-term incentives | Performance-Based IncentivesSenior-level Full TimeMcLean, VA, United States21h ago
-
Responsible AI Engineer USD 68K-220KAWS | Azure | Bitbucket | Clustering | Computer Vision401k matching | Dental insurance | Life insurance | Long-Term Disability coverage | Medical insuranceMid-level Full TimeNew York, One Manhattan West, Corp, …21h ago
-
Principal or Senior Principal Embedded Software Engineer USD 108K-203KBitbucket | C++ | CUDA | Confluence | DebuggingHealth insurance | Paid time off | Relocation assistanceSenior-level Full TimeMDAN02, United States21h ago
-
Sr. Staff Software Engineer, Machine Learning USD 191K-315KContent Safety | Deep learning | Evaluation Pipelines | Fine Tuning | Harm TaxonomyHealth and wellness programs | Time away from workSenior-level Full TimeMountain View, CA, United States22h ago
-
Data Scientist USD 90K-115KAI Agents | ARIMA | Bayesian Inference | DBT | DagsterBackground checks required | Compliance with regulations | Hybrid workMid-level Full TimeChicago, Illinois, United States1d ago
-
Agentic AI Data Engineer USD 150K-150KAWS Glue | AWS Lambda | AWS Step Functions | Amazon Athena | Amazon BedrockMid-level Full TimeUnited States1d ago
-
Research Engineer, Asta USD 118K-178KAWS | Agentic Learning | Agentic Planning | Agentic Reasoning | Cloud ComputingFamily leave | Paid sick leave | Paid vacation | Work-life balanceMid-level Full TimeSeattle, WA1d ago
-
Deep Learning Engineer USD 140K-220KAnchor Free Detectors | C++ | Computer Vision | Data strategy | Deep learning401k plan | Commuter benefits | Flexible PTO | Life insurance | Long-term disabilityMid-level Full TimeSeattle, WA1d ago
-
Senior Machine Learning Engineer, Sentry Tower USD 220K-330KC plus plus | Computer Vision | Continuous integration | Data collection | Dataset curationEquity grants | Health benefits | Recovery BenefitsSenior-level Full TimeIrvine, California, United States; Remote R1d ago
-
Data Scientist / Software Engineer - REMOTE USD 100K-175KAPI Design | AWS | Agile | Azure | CI/CD401k match | Medical, dental & vision coverage | Remote-friendly | Training opportunitiesMid-level Full TimeDallas, TX, US R1d ago
-
Staff AI Engineer USD 170K-277KBig Data | Data Mining | Deep learning | Information Retrieval | JavaHealth and wellness programs | Time away from workSenior-level Full TimeSunnyvale, CA, United States1d ago
-
Sr Data Scientist - Gen AI ML - Irving USD 53K-188KAPI Security | Agent systems | Asynchronous programming | CI/CD | Docker401k retirement plan | Dental insurance | Health insurance | Paid Holidays | Paid time offSenior-level Full TimeUnited States1d ago
-
AI Intern- Summer 2026 USD 80K-100KAlgorithms | Algorithms and Data Structures | Claude | Computer Vision | Data StructuresNetworking events | Paid Company Holidays | Paid internship | Paid volunteer time | Professional development sessionsEntry-level InternshipSan Jose, CA, USA1d ago
-
Senior-level Full TimeTampa, FL1d ago
-
Amazon SageMaker | Azure ML | Computer Vision | Data Pipelines | Deep learning401k | Dental insurance | Disability insurance | Employee assistance program | Health insuranceSenior-level Full TimeAustin1d ago
-
API Design | Artificial Intelligence | CI/CD | Cloud Computing | Containerization401k savings plan | Company holidays | Employee assistance program | Employee stock purchase plan | Health benefitsMid-level Full TimeUnited States R1d ago
-
Computer Vision | Data Analysis | Deep learning | Language Processing | Machine LearningEntry-level Full TimeSeattle, Washington, United States1d ago
-
Machine Learning Engineer, TikTok - Business Governance USD 145K-250KAI Agents | Audio Processing | Content Moderation | Deep learning | Direct Preference OptimizationMid-level Full TimeSan Jose, California, United States1d ago
-
Large Model Training Acceleration Engineer USD 187K-387KBenchmarking | Data parallelism | Deep learning | Distributed Training | Distributed inferenceMid-level Full TimeSan Jose, California, United States1d ago
-
Staff Software Engineer, AI/ML, YouTube Ads USD 207K-300KA/B | A/B Testing | Advertising Systems | Algorithms | B testingSenior-level Full TimeMountain View, CA, USA1d ago