Senior / Staff ML Training Optimization Engineer
Tasks
- Build distributed training frameworks
- Create tooling and dashboards
- Design CUDA kernels
- Evaluate emerging training and inference technologies
- Identify performance bottlenecks
- Implement quantization aware training
- Profile model runtime and memory
Perks/Benefits
- Catered meals
- Dental insurance
- Flexible hours
- Health insurance
- Snacks
- Social events
- Team-building activities
- Unlimited vacation
- Vision insurance
- Work from home
Skills/Tech-stack
Bazel | C++ | CPU Profiling | CUDA | CUDA kernels | GPU Profiling | JAX | Kubernetes | NVIDIA Nsight | PyTorch | PyTorch Profiler | Python | Quantization | Rust
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Related jobs
-
Applied AI Engineer - AI Solutions USD 172K-300KAgentic Workflows | Airflow | Apache Spark | Chroma | CrewAIAnnual travel up to 25% | Employee stock options | Hybrid work | Professional developmentMid-level Full TimeNew York City, NY (Hybrid); Redwood … R17h ago
-
Product Analytics Engineer USD 130K-140KA/B | A/B Testing | Airflow | B testing | DBT401k retirement savings plan | Employer-sponsored healthcare | Flexible spending account | Health savings account | Paid parental leaveSenior-level Full TimeRemote, USA R22h ago
-
Edge AI Engineer USD 100K-150KC++ | Core ML | DSP | Embedded Systems | Federated LearningCareer growth | H1B transfer support | Remote workSenior-level Full TimeUnited States - Remote R22h ago
-
Senior-level Full TimeUnited States - Remote R22h ago
-
Senior-level Full TimeUnited States - Remote R22h ago
-
A2A protocols | API Integration | Agent Orchestration | Agentic Systems | AuthenticationRemote work | Training and support opportunitiesSenior-level Full TimeRemote - USA, United States R22h ago
-
AI Research Engineer USD 100K-150KAccelerator hardware | Agentic Systems | Computer Vision | Data Quality | Data quality monitoringMid-level Full TimeUnited States - Remote R22h ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Computer Vision | Data Quality | Data quality monitoringCareer growth | Remote workMid-level Full TimeUnited States - Remote R22h ago
-
AI Research Engineer USD 100K-150KAccelerator hardware | Computer Vision | Data Quality | Deep learning | Distributed TrainingBenefits package | Remote workMid-level Full TimeUnited States - Remote R22h ago
-
Hadoop Big Data Developer USD 100K-150KAWS EMR | Airflow | Apache Atlas | Apache Flink | Apache HBaseBenefits | Full-time W2 employment | Remote workSenior-level Full TimeUnited States - Remote R22h ago
-
Hadoop Big Data Developer USD 100K-150KAirflow | Apache Atlas | Apache Flink | Apache Hive | Apache HudiCareer growth | Remote workSenior-level Full TimeUnited States - Remote R22h ago
-
Mid-level Full TimeUnited States - Remote R22h ago
-
Principal Data Engineer USD 151K-220KAWS | Cloud Computing | Data Governance | Data Management | Data Modeling401k matching | Business resource groups | Dental insurance | Family and medical leave | Health insuranceSenior-level Full TimeKS Remote, United States R22h ago
-
Sr. Data Engineer (Remote) USD 163K-192KAccess Control | Amazon Web Services | Apache Iceberg | Apache Kafka | Apache Spark401k plan | Dental insurance | Disability insurance | Employee assistance program | FSA/HSASenior-level Full TimeRemote - United States R1d ago
-
AWS | Agent Orchestration | CI/CD | Cloud platform | Databricks401k match | Counseling membership | Employer subsidized medical dental and vision | Flexible time away program | Life insuranceMid-level Full Time-REMOTE, USA- R1d ago
-
AI Engineer, Ecosystem USD 171K-240KAPI Integration | Access Management | Audit Logging | Authentication | AuthorizationHybrid work | Remote work up to 4 weeks per yearMid-level Full TimeSan Francisco, California, United States R1d ago
-
Forward Deployed AI Solutions Engineer USD 95K-145KAPIs | Agentic Workflows | Audit Logging | Cloud Computing | Command Line401k benefits | Commuter benefits | Employee referral program | Fertility care benefits | Free testingMid-level Full TimeUS Remote R1d ago
-
Staff Analytics Engineer USD 159K-187KClaude | Claude Code | DBT | Data Contracts | Data Modeling401k company match | Accident insurance | Company funded HSA contributions | Critical illness insurance | Health, dental, vision coverageSenior-level Full TimeRemote (United States) R1d ago
-
AI Engineer, Product USD 171K-240KA/B | A/B Testing | API Design | B testing | Data ModelingHybrid work | Remote work up to four weeks per yearMid-level Full TimeSan Francisco, California, United States R1d ago
-
Senior Machine Learning Engineer, Shield USD 211K-263KAnomaly Detection | Apache Spark | Behavior analytics | BigQuery | Cloud DataflowSenior-level Full TimeRedwood City, CA, United States R1d ago
-
Senior Data Engineer USD 150K-175KAPI Development | AWS | Agile | Apache Airflow | Apache SparkHybrid/Remote flexibilitySenior-level Full TimeHerndon, VA R1d ago
-
Staff AI Engineer - Grafana AI/ML | USA | Remote CAD 186K-230KAWS | Agent Frameworks | Agent workflows | Alerting | AzureCompany funded AI coding assistant budget | Global annual leave policy | Remote workSenior-level Full TimeCanada (Remote) R1d ago
-
Staff AI Engineer - Grafana AI/ML | USA | Remote USD 174K-220KAWS | Azure | Cloud platform | Docker | Generative AIAnnual leave | Bonus | Equity | RSUs | Remote workSenior-level Full TimeUnited States (Remote) R1d ago
-
Senior AI Engineer - Grafana AI/ML | USA | Remote CAD 129K-217KAWS | Azure | Docker | GCP | GenAIAnnual leave policy | Company funded AI usage budget | Developer productivity support | Global culture | In-person onboardingSenior-level Full TimeCanada (Remote) R1d ago
-
Senior AI Engineer - Grafana AI/ML | USA | Remote USD 127K-203KAWS | Agent Frameworks | Agent workflows | Cloud Computing | Cloud platformBonus | Equity | Global annual leave policy | Remote workSenior-level Full TimeUnited States (Remote) R1d ago