ML Runtime Optimization Engineer
Sunnyvale, California, United States
USD 159K-199K Senior-level Full Time
Tasks
- Apply model pruning and quantization
- Collaborate with ML and software engineers
- Deploy models to embedded runtime environments
- Develop compute usage strategies for inference
- Drive ML performance optimization
- Optimize model architecture for efficient deployment
- Profile model performance and identify bottlenecks
Perks/Benefits
- 401k match
- Dental insurance
- Disability insurance
- Health insurance
- Learning stipend
- Life insurance
- Paid time off
- Vision insurance
- Wellness stipend
Skills/Tech-stack
CPU | CUDA | Deep learning | Embedded Systems | GPU | Inference Optimization | JAX | Microarchitecture | Model Pruning | Model Quantization | ONNX | Performance Profiling | PyTorch | SoC | SoC Architecture | TensorRT | Triton | XLA
Education
Regions
Countries
States
Cities
Related jobs
-
C++ | Data Processing | Data Storage | Debugging | Distributed ComputingSenior-level Full TimeMountain View, CA, USA1h ago
-
Systems Engineer (Network / Storage / Systems) USD 335K-455KAutomation | Bash | Cause analysis | Cluster management | Configuration ManagementHybrid work model | Relocation assistanceSenior-level Full TimeSan Francisco9h ago
-
Software Engineer - Voice AI (Inference Runtime) USD 165K-330KAPI Development | CLI Development | Docker | Kubernetes | Language Processing401k matching | Fertility and family building stipend | Flexible PTO | Medical, dental, and vision insurance | Paid parental leaveSenior-level Full TimeSan Francisco13h ago
-
AI Search | AWS | AWS Bedrock | Azure | Azure AI401k | Dental insurance | Medical insurance | Paid sick hours | Vision insuranceSenior-level Contract Full TimeRidgefield Park, NJ, United States13h ago
-
Autonomy and Robotics Software Engineer USD 125K-220KC++ | CI/CD | Classification | Computer Vision | Dataset versioningE-Verify enrollment | Health insurance | Professional development | Retirement plansMid-level Full TimeHuntington Beach14h ago
-
Autonomy and Robotics Software Engineer USD 125K-220KC++ | CI/CD | Embedded Systems | Fault detection | GNSSHealth insurance | Professional development | Retirement plansMid-level Full TimeHuntington Beach14h ago
-
Early Career Software Engineer – Applied AI USD 100K-120KAWS | Cloud Computing | Cloud platform | Google Cloud | Google Cloud PlatformFlexible PTO | Lunch covered | Mental wellness days | Parental leave | Wellness reimbursementsEntry-level Full TimeSan Francisco14h ago
-
Software Engineer - Performance Optimization USD 199K-264KC++ | CPU Profiling | Concurrency | Debugging | Embedded SystemsSenior-level Full TimeMountain View, California, United States16h ago
-
Senior Staff ll, Machine Learning Engineer (Tech Lead) USD 187K-322KA/B | A/B Testing | B testing | Big Data | C++401k plan | Dental insurance | Disability insurance | Electric car charging | Employee assistance programSenior-level Full TimeMountain View, USA16h ago
-
Staff Machine Learning Engineer - Search USD 159K-208KAWS | Airflow | Autocomplete | Automation | Azure401k matching | Commuter benefits | Fitness benefits | Health insurance | Mental health supportSenior-level Full TimeAtlanta16h ago
-
Staff Machine Learning Engineer - Search USD 159K-208KAWS | AWS OpenSearch | Apache Airflow | Apache Flink | Apache Kafka401k match | Commuter benefits | Fitness benefits | Health insurance | Mental health supportSenior-level Full TimeNew York City16h ago
-
Software Engineer - Embedded Firmware USD 125K-210KAvionics | C++ | Debugging | Embedded Linux | Embedded SystemsDiscretionary annual bonus | Equity compensation | Medical/Dental/Vision insurance | Paid time off | Performance bonusSenior-level Full TimeSouth San Francisco, California, USA19h ago
-
3D Perception Engineer - Autonomy (Droid) USD 180K-265K3D Geometry | CNN | Camera Calibration | Computer Vision | Data PipelinesDental insurance | Equity compensation | Medical insurance | Paid time off | Vision insuranceMid-level Full TimeSouth San Francisco, California, USA19h ago
-
Autonomy Perception Engineer - CV / 3D Reconstruction USD 180K-265K3D Geometry | Camera Calibration | Computer Vision | Convolutional Neural Networks | Data AnnotationDental insurance | Equity compensation | Medical insurance | Overtime pay | Paid time offMid-level Full TimeSouth San Francisco, California, USA19h ago
-
Mid-level Full TimeRedford, MI, United States21h ago
-
ML Engineer, II - App Engine FRENCH USD 139K-166KC++ | CUDA | Distributed Systems | Embedded Software | EthernetDental insurance | Flexible schedule | Health insurance | Life insurance | Paid time offMid-level Full TimeMontreal, Canada, Ann Arbor, MI22h ago
-
ML Engineer, II - App Engine USD 153K-183KC++ | CUDA | Distributed Systems | GPU Programming | Linux401k match | Dental insurance | Disability insurance | Health insurance | Life insuranceMid-level Full TimeAnn Arbor, MI, Montreal, Canada22h ago
-
Machine Learning Engineer, Lyft Business USD 176K-211KAWS SageMaker | Amazon Bedrock | Anomaly Detection | Behavior detection | Cloud ML401k match | Child care benefits | Commuter benefits | Dental insurance | Family building benefitsEntry-level Full TimeNew York, NY; San Francisco, CA R22h ago
-
Research Engineer - LLM Infra training - Seed Infra USD 232K-427KCheckpointing | Data-Driven Optimization | Data-driven | Deep learning | Distributed TrainingMid-level Full TimeSeattle, Washington, United States1d ago
-
Causal Inference | Cross-modal fusion | DPO | Data Modeling | Deep learningMid-level Full TimeSeattle, Washington, United States1d ago
-
Machine Learning Engineer Graduate (E-Commerce Supply Chain & Logistics)- 2026 Start (BS/MS) USD 122K-256KData Mining | Deep learning | Knowledge graphs | Language Models | Language ProcessingEntry-level Full TimeSan Jose, California, United States1d ago
-
Computer Science Research - US - IC5 USD 166K-244KData Pipelines | Deep learning | Experimentation | Generative Models | Image-to-videoKnowledge sharing | Mentoring | Open source contributionsMid-level Full TimeBellevue, WA | Menlo Park, CA1d ago
-
API Design | Agentic Workflows | C plus plus | C# | Computer VisionSenior-level Full TimeRedmond, WA1d ago
-
Machine Learning Solutions Engineer, Google Cloud USD 153K-222KApache Beam | C++ | ELT | ETL | Generative AISenior-level Full TimeChicago, IL, USA; Atlanta, GA, USA1d ago
-
Audio Visual | Audio/visual processing | C# | C++ | Deep learningHealth insurance | Local commuting card | Professional development | Relocation stipend | Travel coveredEntry-level InternshipCambridge, MA1d ago