Software Engineer, Training & Inference Infrastructure
Tasks
- Architect training infrastructure reliable scalable cost efficient
- Automate resource orchestration and fault recovery
- Build model serving infrastructure low latency high throughput
- Ensure infrastructure reliability security observability
- Optimize training and inference pipelines performance reliability cost
- Partner with researchers to productionize models and features
Perks/Benefits
- 401k match
- Daily meals
- Health insurance
- Learning and development stipend
- Relocation assistance
- Unlimited PTO
- Wellness stipend
Skills/Tech-stack
AWS EFA | CUDA | GPU Computing | HPC | Infiniband | Model Parallelism | NCCL | Observability | PyTorch | Python | Security Engineering | VLLM
Education
N/A
Roles
Regions
Countries
States
Cities
Related jobs
-
Senior-level Full TimeBoston, MA4h ago
-
Systems Engineer - Analytical chemistry USD 120K-130KCavity Ring-Down Spectroscopy | Computational Methods | Data Analysis | Data Visualization | Experimental uncertainty401k | Dental insurance | Employee referral program | Flexible spending account | Health savings accountMid-level Full TimeSanta Clara, CA5h ago
-
AI Engineer USD 120K-200KActive Learning | Data Flywheel | Data Generation | Dataset Construction | Deep learningIn-person collaboration | Medical, dental & vision coverageEntry-level Full TimeSan Francisco Office10h ago
-
AI Engineer (React UI) - Remote US USD 135K-170KAWS | Accessibility | Anthropic Claude | Apache Airflow | AzureRemote workSenior-level Full TimeWauwatosa, WI, United States R15h ago
-
Staff Machine Learning Engineer USD 192K-305K.NET | A/B | A/B Testing | API Design | AlertingCommute subsidy | Disability insurance | Employee assistance program | Employee resource groups | Employee stock ownershipSenior-level Full TimeMountain View, CA, USA1d ago
-
Staff Data Platform Engineer USD 164K-246KAWS | AWS CDK | AWS Lambda | Batch Processing | CI/CD401k match | Annual equity awards | Catered meals | Company paid life insurance | Flexible paid time offSenior-level Full TimeRemote, United States R1d ago
-
Embedded Software Engineer USD 140K-200KBare Metal | Buildroot | C++ | CI Pipelines | CI configurationSenior-level Full TimeLos Angeles, CA1d ago
-
Backup | Cooling | Data Integrity | Data pipeline | Disaster RecoverySenior-level Full TimeSan Francisco1d ago
-
Machine Learning Infrastructure Engineer USD 150K-350KCloud Storage | Cloud platform | Compute Engine | Deep learning | GPUsMid-level Full TimeRedwood City, CA1d ago
-
AI Engineer USD 200K-380KCI/CD | Cloud deployment | DVC | Deep learning | Distributed SystemsCompany provided lunch | Dental insurance | Fitness Membership Coverage | Medical insurance | Regular offsitesMid-level Full TimeNew York City1d ago
-
Senior Software Engineer, Storage USD 212K-318KCI/CD | Design Patterns | Kubernetes | PostgreSQL | PythonSenior-level Full TimeSan Francisco1d ago
-
Embedded Systems Engineer – Robotics Hardware USD 70K-300KARM | C++ | CAN | Consumption analysis | Device DriversHybrid work option | Onsite collaboration | Remote optionSenior-level Full TimeIrvine, CA1d ago
-
Machine Learning Engineer USD 170K-210KC++ | Computer Vision | Deep learning | Localization | MappingMid-level Full TimeRemote, US R1d ago
-
AI/Machine Learning Engineer USD 155K-200KData Visualization | Deep learning | Machine Learning | PyTorch | PythonMid-level Full TimeSunnyvale, CA1d ago
-
Space Operations Engineer (Embedded Software) USD 100K-150KARM | C# | C++ | Command and control | Communication ProtocolsMid-level Full TimeSan Francisco, CA1d ago
-
Computer Vision Engineer USD 170K-215K3D Mapping | 3D Reconstruction | C++ | CUDA | Camera Calibration401k matching | Catered lunches | Coffee | Dental insurance | Health insuranceMid-level Full TimeBoston1d ago
-
Senior Engineer, Software - Perception (R3771) USD 123K-194KData Fusion | Data synchronization | Debugging | Deep learning | Inertial odometrySenior-level Full TimeSan Diego, California1d ago
-
Sr. Back-End Software Engineer - Machine Learning USD 150K-250KAPIs | C++ | Computer Vision | Distributed Systems | Language Processing401k matching | Commuter benefits | Employee Medical Premium Coverage | Employee referral program | Flexible spending accountsSenior-level Full TimeSanta Clara, CA1d ago
-
Data Engineer - Mid USD 96K-179KAWS | Apache Spark | Azure | CI/CD | Cloud platform401k match | Certification reimbursement | Dental insurance | Disability coverage | Flexible work arrangementsMid-level Full TimeBolling, AFB, DC1d ago
-
Machine Learning Engineer USD 150K-220KAWS | Azure | CI/CD | Docker | Experiment tracking401k match | Certification reimbursement | Health, dental, vision coverage | Life insurance and disability coverage | Paid HolidaysMid-level Full TimeWashington, DC1d ago
-
Senior / Staff ML Optimization Engineer USD 141K-249KBazel | C++ | CPU Profiling | CUDA | Distributed TrainingCatered meals | Dental insurance | Flexible hours | Health insurance | Social eventsSenior-level Full TimeRemote US & Canada R1d ago
-
Senior / Staff Perception Engineer USD 158K-269K3D Object Detection | C++ | CUDA | Computer Vision | Data setsDental insurance | Equity awards | Flexible hours | Health insurance | Unlimited vacationSenior-level Full TimeRemote US & Canada R1d ago
-
Senior Machine Learning Engineer - Ads R&D USD 184K-262KAdtech | Apache Beam | Apache Spark | Causal Inference | Data Analysis401k retirement plan | Health insurance | Meal allowance | Paid flexible holidays | Paid parental leaveSenior-level Full TimeNew York, NY1d ago
-
Senior-level Full TimeUS TX Austin1d ago
-
AI Solutions Specialist USD 113K-197KAI Search | API Development | Azure AI | Azure AI Search | Azure FunctionsHybrid work scheduleMid-level Full TimeKansas City, MO, United States1d ago