Lead Member of Technical Staff, Inference Infrastructure
Tasks
- Collaborate with cross-functional teams
- Deploy and operate large language model API endpoints
- Design and lead machine learning model serving architecture
- Drive infrastructure strategy for low latency high throughput systems
- Lead design of customized customer deployments
- Mentor engineers and set technical standards
- Own compute storage network resource and cost management
- Troubleshoot production Kubernetes and GPU workloads
Perks/Benefits
- Co-working stipend
- Health and dental benefits
- Mental health support budget
- Open and inclusive culture
- Parental leave top-up
- Remote-flexible work
- Vacation time
- Weekly lunch stipend
Skills/Tech-stack
AWS | Azure | C plus plus | Cost Optimization | Distributed Systems | GCP | GPU Computing | Golang | High Availability | High Throughput | Hybrid Cloud | Kubernetes | Linux | Low Latency | Multi-cloud | OCI | Resource Management
Education
N/A
Regions
Countries
States
Related jobs
-
Featured Feat. Applied AI Engineer - Bay Area USD 211K-263KArtificial Intelligence | C plus plus | C# | Embeddings | Feature Engineering401k | Comprehensive health and wellness benefits | Learning and development opportunities | Unlimited time offMid-level Full TimeHQ (San Francisco)24d ago
-
Ai Engineer USD 100K-150KAI Agents | API Development | AWS | AWS Bedrock | Agentic Workflows401k | Commuter benefits | Dental insurance | Disability coverage | EAPMid-level Full TimeColumbia, MD, United States7h ago
-
Senior Data Engineer (Remote) USD 155KAgile | Apache Spark | BigQuery | Cassandra | Data Governance401k match | Dental insurance | Employee assistance program | Employee stock purchase plan | Flexible scheduleSenior-level Full TimeWork From Home, United States R10h ago
-
Senior AI Operations Engineer USD 170K-180KAI infrastructure | Azure | CI/CD | Cloud infrastructure | Container Engine for Kubernetes401k match | Employee assistance program | Employee stock purchase plan | Flexible schedule | Flexible spending accountSenior-level Full TimeWork From Home, United States R10h ago
-
API Development | Airflow | Automated retraining | CI/CD | Cloud PlatformsEquityMid-level Full TimeNaples, United States13h ago
-
API Design | AWS | AWS Cloud | AWS Cloud Development Kit | AWS cloud developmentSenior-level ContractGlendale, United States14h ago
-
Mid-level Full TimeUS-Kansas-Wichita14h ago
-
Delivery Senior Consultant, Data Engineering and Gen AI USD 119K-208K.NET | AWS | Agentic AI | Agile | AngularSenior-level Full TimeGilbert, Arizona, United States; Lake Mary, …14h ago
-
Software Engineer/Researcher, AI-Native Database Systems USD 156K-387KC++ | Database Architecture | Distributed Systems | Indexing | Information RetrievalSenior-level Full TimeSan Jose, California, United States14h ago
-
Software Engineer Level 1 -FFNN-8889 USD 78K-250KAccumulo | BSON | Bigtable | Distributed Systems | HBase401k match | Employee referral programs | FSA | Flexible work arrangements | Mental health supportMid-level Full TimeHanover, MD15h ago
-
Software Engineer Level 2 -FFNN-8890 USD 78K-250KAccumulo | BSON | Bigtable | Database Design | Development Lifecycle401k match | Dental insurance | Employee referral programs | Flexible spending accounts | Flexible work arrangementsMid-level Full TimeHanover, MD15h ago
-
Data Pipelines | Data Storage | Distributed Systems | High Performance | High-Performance ComputingCareer growthEntry-level Full TimeSan Jose, California, United States15h ago
-
Apache Flink | Apache Spark | Automation | C++ | Cause analysisSenior-level Full TimeSan Jose, California, United States15h ago
-
Cost estimation | Distributed Caches | Distributed Systems | Document Databases | Embedding IngestionSenior-level Full TimeSeattle, Washington, United States15h ago
-
Research Engineer / Scientist - Storage for LLM USD 156K-387KAttention Mechanisms | CUDA | Caching | Distributed Systems | Eviction policiesCompetitive compensation | Conference attendance | Generous research resources | Innovation-driven culture | Open source contributionsEntry-level Full TimeSan Jose, California, United States15h ago
-
Staff Software Engineer, Agentic AI, Trust and Safety USD 207K-301KAgentic AI | Anti-abuse | Anti-abuse systems | Architecture ownership | Artificial IntelligenceSenior-level Full TimeKirkland, WA, USA16h ago
-
Software Engineer III, AI/ML GenAI, Google Ads USD 147K-211KC plus plus | Data Processing | Debugging | Generative AI | Language ProcessingSenior-level Full TimeMountain View, CA, USA16h ago
-
Senior Staff Software Engineer, Dataproc, Control Plane USD 262K-365KC++ | Cloud APIs | Cloud Computing | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA16h ago
-
Senior Research Engineer USD 174K-252KC plus plus | Code Reviews | Data Curation | Deep learning | JAXHybrid scheduleSenior-level Full TimeNew York, NY, USA R16h ago
-
AI accelerators | C++ | CPU | Diffusion Models | Edge ComputingSenior-level Full TimeMountain View, CA, USA16h ago
-
Senior Research Engineer USD 174K-252KC plus plus | Cause analysis | Code Reviews | Dataset curation | Deep Neural NetworksBenefits | Bonus | Equity | Hybrid work scheduleSenior-level Full TimeMountain View, CA, USA R16h ago
-
Senior Software Engineer, Spam and Abuse, AI/ML USD 174K-253KAPI Development | Algorithms | Continuous Deployment | Continuous Training | Data ProcessingSenior-level Full TimeSan Jose, CA, USA16h ago
-
Staff Software Engineer, Cloud SQL, MySQL USD 207K-301KArtificial Intelligence | Cloud Platforms | Data Structures | Data Structures and Algorithms | Database DesignEmployee stock purchase | Health insurance | Paid time off | Retirement plansSenior-level Full TimeSunnyvale, CA, USA16h ago
-
Senior Machine Learning Engineer, Ad Serving USD 195K-352KContinuous Improvement | Deep learning | Distributed Systems | Experimentation | Feature EngineeringCommuter benefits | Dental insurance | Disability benefits | Equity awards | Health insuranceSenior-level Full TimeBoston, Massachusetts23h ago
-
Software Engineer, Embedded Agentic AI USD 195K-345KAgent Orchestration | Agent systems | C# | C++ | ContainerizationFinancial wellness support | Hybrid work schedule | Mental health support | Paid time off | Remote work optionsEntry-level Full TimeAustin, Texas23h ago