aijobs.net

Staff Software Engineer, LLM Serving and GPU Performance, Google Distributed Cloud

Sunnyvale, CA, USA; Kirkland, WA, USA

USD 207K-300K Senior-level Full Time

Apply Save
Found 3h ago
Tasks
Perks/Benefits
Skills/Tech-stack

AI Model Serving | AI model | Benchmarking | Cache Management | Data Analysis | Data Visualization | Debugging | Disaggregated serving | Distributed Systems | GPU Performance | GPU performance tuning | High Performance | High-Performance Computing | KV cache | KV-cache management | Level optimization | Low-level optimization | Memory Management | Model Serving | Performance Computing | Performance Engineering | Performance Tuning | Quantization | Software Architecture | Speculative decoding | TPU optimization

Education

Bachelor of Science | Master of Science | PhD

Roles

Engineer | Software Engineer | Staff Software Engineer

Regions

North America

Countries

United States

States

California, US | Washington, US

Cities

Sunnyvale, California, US | Kirkland, Washington, US

Apply Save
Language: en | Views: 1 | Clicks: 0 | Saves: 0

Related jobs