Principal Engineer, GKE Platform for AI Inference Workloads
USD 307K-427K Senior-level Full Time
Tasks
- Build optimized scalable distributed LLM serving
- Collaborate with Kubernetes ecosystem for upstream initiatives
- Define GKE evolution for massive scale inference and RL
- Enable high throughput networking for inference workloads
- Lead architectural direction for llm d
- Partner with AI model builders to develop AI first roadmap
- Schedule multi host TPU GPU workloads
- Solve orchestration problems in dynamic resource allocation
Perks/Benefits
- N/A
Skills/Tech-stack
AI infrastructure | Accelerator Virtualization | Container Runtime | Distributed Systems | GPU | Google Kubernetes | Google Kubernetes Engine | Hardware Accelerators | High Performance | High-performance networking | Inference Serving | Kubernetes | Kubernetes Engine | LLM Debugger | Language Models | Large Language Models | Machine Learning | Multi host Scheduling | NCCL | RDMA | Reinforcement Learning | Storage caching | TPU
Education
Roles
Architect | Engineer | Platform | Platform Engineer | Principal | Principal Engineer | Software Architect
Regions
Countries
States
Related jobs
-
Automation Testing | CI/CD | CSS | Cypress | Feature DevelopmentMedical, dental & vision coverage | Paid time off | Parental leave | Reimbursement programs | Retirement planMid-levelRaleigh, United States R9d ago
-
Research Engineer, FAIR AI & Society USD 117K-173KAI alignment | Apache Hive | Apache Spark | Artificial Intelligence | DPOEntry-level Full TimeMenlo Park, CA | New York, …4h ago
-
AI Research Engineer, FAIR Chemistry USD 147K-208KApplied Mathematics | Artificial Intelligence | Computational modeling | Computational statistics | Data ScienceCollaborative research environment | Open source contributions | Reproducible researchSenior-level Full TimeSan Francisco, CA4h ago
-
AI Compression | AV1 | AV2 | Audio CODEC | Automated testingSenior-level Full TimeBellevue, WA | Menlo Park, CA …4h ago
-
Software Engineer, Video AI/ML Specialist USD 141K-208KAV synchronization | AV1 | AV2 | Audio CODEC | Automated testingMid-level Full TimeBellevue, WA | Menlo Park, CA4h ago
-
Software Engineer, Systems USD 221K-240KAlgorithms | CSS | Data Analysis | Data Modeling | Data ProcessingEntry-level Full TimeBellevue, WA4h ago
-
C++ | Debugging | Distributed Systems | Google Cloud | InfrastructureSenior-level Full TimeMadison, WI, USA4h ago
-
Staff Software Engineer, AI/ML, YouTube USD 207K-300KAudio Processing | Data Processing | Debugging | Distributed Systems | Fine TuningSenior-level Full TimeSan Bruno, CA, USA4h ago
-
Senior Staff Software Engineer, AI/ML, Google Cloud USD 262K-365KData Processing | Data Structures | Data Structures and Algorithms | Debugging | Fine TuningSenior-level Full TimeSunnyvale, CA, USA4h ago
-
Senior Software Engineer, AI/ML, Google Cloud AI USD 174K-252KC++ | Data Processing | Data Structures | Data structures algorithms | DebuggingSenior-level Full TimeMountain View, CA, USA4h ago
-
Senior Software Engineer, AI and Infrastructure USD 174K-252KArtificial Intelligence | C++ | Code review | Data Storage | Data StructuresSenior-level Full TimeSunnyvale, CA, USA4h ago
-
Staff Software Engineer, AI/ML Data Processing USD 207K-300KAPIs | Checkpointing | Cloud technologies | Data Processing | DebuggingSenior-level Full TimeSunnyvale, CA, USA4h ago
-
2026 Summer Internship | Data Analytics & AI USD 50K-90KDashboarding | Data Engineering | Data Pipelines | Data analytics | ETLCareer development opportunities | Community service opportunities | Executive spotlights | Hybrid schedule | Professional development sessionsEntry-level InternshipMiramar, FL, US, 331329h ago
-
Senior Machine Learning Engineer USD 150K-200KDistributed Systems | Feature Engineering | Feature Selection | Language Models | Language Processing401k matching | Cell phone and internet stipend | Employee stock purchase plan | Flexible time off | Learning programsSenior-level Full TimeRemote - USA R11h ago
-
Research Engineer, Gemini AutoRater USD 166K-244KData collection | Fine Tuning | Foundation Models | Human evaluation | Language ModelsSenior-level Full TimeMountain View, California, US13h ago
-
Software Engineer - Reference Data (Java/SQL) USD 175K-200KAPI Development | Distributed Systems | Git | HBase | HadoopMid-level Full TimeNew York13h ago
-
Senior LLM Software Engineer USD 191K-287KAWS | Agentic Automation | Data Pipelines | Event Driven | Event-driven architecture401k retirement plan | Caregiver leave | Commuter benefits | Dental insurance | Disability insuranceSenior-level Full TimeCosta Mesa, California, United States13h ago
-
Senior-level Full TimeHybrid (Salt Lake City, UT, US) R15h ago
-
Lead Data Architect USD 173K-279KAWS Redshift | Apache Flink | Apache Kafka | Apache Spark | Artificial IntelligenceSenior-level Full TimeChicago, Illinois, USA R15h ago
-
Machine Learning Engineer, Geometry Team USD 175K-215K3D Spatial Reasoning | C++ | Computer Vision | Distributed Training | JAXSenior-level Full TimeKirkland, Washington, United States15h ago
-
Lead- Full Stack Engineer USD 150K-190KAWS Fargate | AWS Lambda | Amazon RDS | Amazon S3 | CI/CDCareer development opportunities | Entrepreneurial environment | Equal employment opportunity | High responsibilityMid-level Full TimeMcLean, Virginia, United States16h ago
-
Robotics Perception Engineer USD 140K-204KC++ | CI/CD | Camera-based perception | Cameras | Cloud processingCompany-provided laptop | Dental insurance | Health insurance | Paid time off | Remote work optionMid-level Full TimeAtlanta, Georgia, United States - Remote R16h ago
-
Lead AI Engineer (ML Ops) USD 116K-170KAPI Development | AWS | Azure | CI/CD | Cloud services401k match | Charity match | Employee assistance program | Employee stock purchase plan | Health and wellness allowanceSenior-level Full TimeIrving - 6011 Connection, United States16h ago
-
AI Search | AWS | Amazon SageMaker | Azure AI | Azure AI Search401k | Dental insurance | Medical insurance | Paid sick hours | Vision insuranceSenior-level Contract Full TimeRidgefield Park, NJ, United States16h ago
-
Senior-level Full TimeRedmond, WA, US17h ago