Lead Member of Technical Staff, Inference Infrastructure
Tasks
- Collaborate with cross-functional teams
- Deploy and operate large language model API endpoints
- Design and lead machine learning model serving architecture
- Drive infrastructure strategy for low latency high throughput systems
- Lead design of customized customer deployments
- Mentor engineers and set technical standards
- Own compute storage network resource and cost management
- Troubleshoot production Kubernetes and GPU workloads
Perks/Benefits
- Co-working stipend
- Health and dental benefits
- Mental health support budget
- Open and inclusive culture
- Parental leave top-up
- Remote-flexible work
- Vacation time
- Weekly lunch stipend
Skills/Tech-stack
AWS | Azure | C plus plus | Cost Optimization | Distributed Systems | GCP | GPU Computing | Golang | High Availability | High Throughput | Hybrid Cloud | Kubernetes | Linux | Low Latency | Multi-cloud | OCI | Resource Management
Education
N/A
Regions
Countries
States
Related jobs
-
Staff Engineer, Datacenter Server Lifecycle USD 320K-405KAWS | Asset tracking | Coreboot | Decommissioning | Firmware verificationFlexible working hours | Generous vacation | Hybrid work policy | Optional equity donation matching | Parental leaveSenior-level Full TimeSan Francisco, CA | New York …6h ago
-
AI & Robotics Data Collection Engineer (Intern) USD 68K-82KContainerization | Linux | Python | ROS2 | Shell ScriptingFully stocked kitchen | Meals provided | Onsite workEntry-level InternshipMilpitas, CA7h ago
-
Mid-level Full TimeTysons, VA, United States7h ago
-
Mid-level Full TimeRemote, United States R7h ago
-
Senior AI Infrastructure Engineer - Training Platform USD 216K-270KAWS | Admission controllers | C++ | CUDA | Custom ResourcesCommuter stipend | Comprehensive health, dental and vision coverage | Generous PTO | Learning and development stipend | Retirement benefitsSenior-level Full TimeSan Francisco, CA; Seattle, WA; New …8h ago
-
Mid-level Full TimeScottsdale, AZ11h ago
-
Principal Engineer, Data & ML Platform USD 120K-180KAPIs | Automated testing | Batch Processing | Cloud platform | Data ModelingSenior-level Full TimeScottsdale, AZ11h ago
-
Principle Data Engineer USD 220K-235KAWS | Airflow | BigQuery | Capacity Planning | Compliance401k | Equity | Essential equipment | Flexible PTO | Fully remoteSenior-level Full TimeCleveland, OH R11h ago
-
Data Science and AI Intern USD 50K-50KAWS | Cloud Computing | DBT | Data Visualization | ETLFree daily on site lunches | Free on site EV charging | Latest hardware | On-site gym | Open & transparent cultureEntry-level InternshipMenlo Park, CA12h ago
-
Software Engineer USD 153K-237KAPI | API Gateway | API Management | AWS | Apache Airflow401k employer match | Employer Disability Insurance | Employer health insurance | Employer life insurance | Paid HolidaysSenior-level Full TimeChantilly, VA12h ago
-
Machine Learning Engineer, PhD Intern USD 123K-161KAWS | Azure | Code review | Data Analysis | ExperimentationIn office work 5 days per week | Mentorship | Structured intern programming | Team eventsEntry-level InternshipSan Francisco, CA13h ago
-
Data Engineer II USD 150K-180KAWS | Apache Airflow | Apache Kafka | Apache Spark | Argo Workflows401k match | CLEAR Plus membership | Catered lunches | Family building benefits | Flexible time offMid-level Full TimeNew York, NY, United States14h ago
-
Software Engineer, Data Platform USD 105K-132KAPI | AWS | CI/CD | Code review | DBT401k | Baby bonding leave | Commuter benefits | Disability insurance | Employee referral programSenior-level Full TimeUS Remote R14h ago
-
Principal Software Engineer - Storage Cache USD 295K-345KActive/Active | Alertmanager | C++ | Chaos Engineering | Container OrchestrationEquity compensationSenior-level Full TimeSan Mateo, CA, United States R14h ago
-
Senior Software Engineer, Data Platform USD 125K-156KAPI Development | AWS | Automation | CI/CD | Data Engineering401k benefits | Baby bonding leave | Commuter benefits | Disability insurance | Employee referral programSenior-level Full TimeUS Remote R14h ago
-
Agents | Golang | Information Retrieval | Language Models | Language ProcessingRelocation support if required | Remote work flexibilitySenior-level Full TimeMountain View, CALIFORNIA, United States15h ago
-
Senior Software Engineer, Compute Infrastructure USD 164K-205KAWS | Azure | C++ | Distributed Systems | DockerSenior-level Full TimeSan Francisco, California16h ago
-
AWS | AWS CUR | Cloud Cost Management | Cost Management | DBT401k match | Flexible time off | Health savings account | MacBook laptop | Profit sharingMid-level Full TimeSt. Louis, MO16h ago
-
A/B | A/B Testing | AWS | Adversarial Testing | Amazon SQSHybrid work | W2 employmentSenior-level Contract Full TimeIrvine, CA, United States R17h ago
-
Data Engineer USD 112K-168KAWS | Airflow | Azure | CI/CD | CSV401k match | Dental insurance | Family leave | Health insurance | Legal insuranceMid-level Full TimeSanta Monica, CA, United States17h ago
-
Alerting | Backup and Restore | CI/CD | Canary deployments | ClickHouseSenior-level Full TimeSan Francisco17h ago
-
Principal Engineer, AI Architect USD 190K-210KCloud Architecture | Conversational AI | Enterprise SaaS | GCP | Generative AI401k match | Critical illness insurance | Dedicated WeWork space | Employee assistance program | Employee stock purchase programSenior-level Full TimeRemote - Colorado R17h ago
-
Principal Engineer, AI Architect USD 190K-210KCloud Native | Conversational AI | Enterprise Architecture | GCP | Generative AI401k match | Dental insurance | Development resources | Employee assistance program | Employee stock purchase programSenior-level Full TimeRemote - Colorado R17h ago
-
Data Engineer USD 130K-260KAWS | Amazon RDS | Apache NiFi | Apache Spark | Apache Superset401k match | Employer paid training budget | Employer-paid disability insurance | Employer-paid health insurance | Employer-paid life insuranceMid-level Full TimeMcLean, VA17h ago
-
Senior ADAS Data Infrastructure Engineer USD 122K-153KAmazon Elastic Kubernetes Service | Amazon S3 | Amazon Web Services | Apache Beam | Apache KafkaCollaboration culture | Continuous learning cultureSenior-level Full TimeAtlanta, GA17h ago