Senior Software Engineer, Machine Learning Infrastructure - Generative AI
USD 137K-299K Senior-level Full Time
Tasks
- Architect scalable systems for model serving batch inference GPU autoscaling and GPU utilization
- Build platforms for rapid experimentation with production reliability and observability
- Drive cost and latency optimization for GPU inference and fine tuning
- Lead design of generative AI model platform infrastructure
- Own open weights serving stack real time GPU endpoints high throughput batch inference and fine tuning
- Partner with ML product and platform teams to deliver reusable platform capabilities
- Set technical direction for next generation centralized generative AI platform including post training and agent optimization
Perks/Benefits
- 401k plan with employer matching
- Basic life insurance
- Commuter benefits match
- Disability insurance
- Medical, dental, and vision benefits
- Mental health program
- Paid Holidays
- Paid parental leave
- Paid sick leave
- Paid time off
- Wellness benefits
Skills/Tech-stack
APIs | AWS | Backend Services | Batch inference | Cloud platform | Cold Start | Cold Start Optimization | Cost Optimization | DPO | Data Pipelines | Debugging | Distributed Systems | FP8 | Fine Tuning | GPU Utilization | GPU autoscaling | Google Cloud | Google Cloud Platform | INT8 | Incident Response | KV cache | Kubernetes | LLM Inference | LLM routing | LoRA | Machine Learning | Machine Learning Infrastructure | Model Serving | Monitoring | Observability | Performance Tuning | Python | Quantization | RAG | Reliability Engineering | SFT | SGLang | TensorRT-LLM | Tracing | VLLM | Vector Databases
Education
Regions
Countries
States
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R8d ago
-
SOFTWARE DATA ENGINEER - Enterprise Infrastructure - 5+ yrs of Experience - TS/SCI w/Poly clearance is required - ES A USD 168K-172KCloud Security | Cloud infrastructure | Linux | Network Automation | OpenStack401k retirement plan | Dental insurance | Health insurance | Life insurance | Long-term disabilityMid-level Full TimeLaurel, United States2h ago
-
Big Data | Data Modeling | Email Marketing | Graph Processing | Machine LearningEntry-level Full TimeSeattle, Washington, United States3h ago
-
AI Agents | Alerting | Automation | Capacity Planning | Change ManagementBlameless post incident review process | Mentorship | Rotational on call coverageSenior-level Full TimeSan Jose, California, United States3h ago
-
Senior Data Engineer USD 187K-321KAWS | Airflow | Apache Spark | Batch Processing | Data Modeling401k matching | Flexible work schedule | Health and wellness supportSenior-level Full TimeAustin, Texas15h ago
-
Senior Data Engineer USD 148K-361KAirflow | Apache Spark | Data Modeling | Data Quality | HDFS401k | Commuter benefits | Dental insurance | Disability benefits | Equity awardsSenior-level Full TimeSan Jose, California15h ago
-
Bash | Cloud platform | Data Ingestion | Data Processing | DockerMid-level Full TimeAustin, TX, USA15h ago
-
Bash | Data Processing | Docker | GCP | LinuxAsynchronous culture | Flexible remote work environment | Supportive entrepreneurial teamMid-level Full TimeAtlanta, GA, USA15h ago
-
Bash | Cloud infrastructure | Data Processing | Docker | GCPAsynchronous culture | Entrepreneurial team | Remote workMid-level Full TimeNew York, NY, USA15h ago
-
Bash | Cloud platform | Data Pipelines | Data Processing | DockerAsynchronous culture | Bonus | Equity | Laid-back atmosphere | Remote-friendlyMid-level Full TimeBoston, MA, USA15h ago
-
Bash | Cloud platform | Docker | Google Cloud | Google Cloud PlatformAsynchronous culture | Bonus | Equity | Flexible work environment | Laid-back atmosphereMid-level Full TimePortland, OR, USA15h ago
-
Bash | Cloud infrastructure | Docker | GCP | Infrastructure as CodeAsynchronous culture | Remote-friendlyMid-level Full TimeTempe, AZ, USA15h ago
-
Bash | Cloud platform | Data Ingestion | Data Processing | DockerAsynchronous culture | Competitive benefits | Laid-back atmosphere | Remote-friendlyMid-level Full TimeLas Vegas, NV, USA15h ago
-
Bash | Cloud platform | Data Processing | Docker | Google CloudAsynchronous culture | Bonus | Equity | Friendly work environmentMid-level Full TimeFrisco, TX, USA15h ago
-
Bash | Cloud platform | Data Processing | Docker | Google CloudAsynchronous culture | Bonuses | Equity | Friendly work environmentMid-level Full TimeMinneapolis, MN, USA15h ago
-
Bash | Data Processing | Docker | GCP | Infrastructure as CodeAsynchronous culture | Bonus | Equity | Flexible team environmentMid-level Full TimeRaleigh, NC, USA15h ago
-
Bash | Cloud platform | Data Pipelines | Docker | Google CloudAsynchronous culture | Flexible management approach | Friendly work environment | Opportunity to make impact | Remote/distributed teamMid-level Full TimeKansas City, MO, USA15h ago
-
Bash | Cloud platform | Data Processing | Docker | Google CloudAsynchronous culture | Entrepreneurial environment | Opportunity impact | Remote/distributed workMid-level Full TimeCincinnati, OH, USA15h ago
-
Bash | Data Processing | Docker | GCP | Infrastructure as CodeAsynchronous culture | Laid-back atmosphere | Portfolio support | Remote-friendlyMid-level Full TimeDetroit, MI, USA15h ago
-
Bash | Cloud infrastructure | Data Processing | Docker | GCPAsynchronous culture | Friendly laid-back atmosphereMid-level Full TimeEvanston, IL, USA15h ago
-
Bash | Cloud platform | Data Processing | Docker | GCPAsynchronous culture | Competitive benefits | Equity bonus | Remote-friendlyMid-level Full TimeRichmond, VA, USA15h ago
-
Bash | Data Processing | Docker | GCP | Infrastructure as CodeAsynchronous work culture | Entrepreneurial environment | Hands-off management | Remote-friendly, distributed teamMid-level Full TimeBakersfield, CA, USA15h ago
-
Bash | Cloud platform | Data Processing | Docker | Google CloudAsynchronous work culture | Friendly atmosphere | Handsoff managementMid-level Full TimeFort Collins, CO, USA15h ago
-
Bash | Cloud platform | Data Processing | Docker | Google CloudAsynchronous culture | Friendly laid-back atmosphereMid-level Full TimeCollege Station, TX, USA15h ago
-
Bash | Cloud platform | Data Processing | Docker | Google CloudAsynchronous work culture | Flexible priorities | Remote-friendly environment | Supportive teamMid-level Full TimeBirmingham, AL, USA15h ago