Staff Software Engineer, LLM Serving and GPU Performance, Google Distributed Cloud
USD 207K-300K Senior-level Full Time
Tasks
- Analyze performance of LLMs
- Benchmark large language models
- Build infrastructure and tooling for deep profiling
- Deploy models into production with SRE
- Enhance serving stack
- Identify bottlenecks in compute memory and networking
- Improve latency throughput and resource utilization
- Maximize hardware efficiency
- Optimize KV cache management
- Prototype disaggregated serving
- Prototype speculative decoding
Perks/Benefits
- N/A
Skills/Tech-stack
AI Model Serving | AI model | Benchmarking | Cache Management | Data Analysis | Data Visualization | Debugging | Disaggregated serving | Distributed Systems | GPU Performance | GPU performance tuning | High Performance | High-Performance Computing | KV cache | KV-cache management | Level optimization | Low-level optimization | Memory Management | Model Serving | Performance Computing | Performance Engineering | Performance Tuning | Quantization | Software Architecture | Speculative decoding | TPU optimization
Education
Roles
Regions
Countries
States
Related jobs
-
AWS | Agentic AI | Angular | CI/CD | DatabricksHybrid work | Technical mentorshipSenior-level Full TimeNormal, United States2h ago
-
Sr. Data Engineer USD 108K-158KAWS | Apache Spark | Automated testing | Azure Event | Azure Event Hubs401k matching | Dental insurance | Disability insurance | Educational growth | Employee discount programSenior-level Full TimeNew York-TONAWANDA2h ago
-
Research Scientist - LLM Training System as a Service - Global Frontier Tech Recruitment Program - 2027 Start (PhD) USD 212K-450KCUDA | Deep learning | Distributed Systems | GPU Performance | GPU Performance OptimizationEntry-level Full TimeSan Jose, California, United States3h ago
-
Artificial Intelligence | Data Modeling | Data Pipelines | Data Quality | Data Visualization401k match | Dental insurance | Life insurance | Medical insurance | Paid time offSenior-level Full TimeNew York3h ago
-
Data-Driven Decision Making | Data-driven | Decision Making | Deep learning | Distributed TrainingSenior-level Full TimeSunnyvale, CA4h ago
-
Production Engineer USD 178K-200KApache | Apache Spark | Application Programming | Application Programming Interfaces | C++Entry-level Full TimeMenlo Park, CA4h ago
-
Research Engineer - MSL FAIR Foundations USD 117K-173KBenchmarking | Code review | Data Pipelines | Distributed Systems | Language ModelEntry-level Full TimeMenlo Park, CA4h ago
-
Staff Software Engineer, AI Data Generation Platform USD 207K-300KComputer Vision | Data Engineering | Data Processing | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeSunnyvale, CA, USA4h ago
-
C plus plus | C++ | Cloud Spanner | Cloud Storage | Cloud platformSenior-level Full TimeSunnyvale, CA, USA4h ago
-
Machine Learning Researcher, Multimodal LLMs USD 140K-250KAudio codecs | Data Analysis | Experiment design | Fine Tuning | Language ModelsDental insurance | Equity | Health insurance | High autonomy | High impactSenior-level Full TimeSan Francisco14h ago
-
Deployed Engineer (Phoenix) USD 150K-250KAWS | Agent Frameworks | Azure | Cloud Computing | Containers401k plan | Dental insurance | Flexible vacation | Meals on in office days | Medical insuranceSenior-level Full TimePhoenix, AZ15h ago
-
Senior Data Engineer USD 239K-271KAirflow | Alerting | Amazon Redshift | Automated testing | Cost OptimizationFamily planning support | Flexible time off | Lifestyle stipend | Mental health support | Paid parental leaveSenior-level Full TimeSan Francisco, CA R15h ago
-
AI/ML Subject Matter Expert (SME) / Analytics Team Lead USD 128K-206KData Visualization | Feature Engineering | Language Processing | Machine Learning | Model Evaluation100 percent on site | Active secret clearance requiredSenior-level Full TimeArlington, VA15h ago
-
Data Engineer USD 124K-149KAutomation | Data Analysis | Data Modeling | Data Security | Data Validation401k match | Paid time offSenior-level Full TimeUSA DC Washington - 330 C …15h ago
-
Machine Learning Engineer, Customer Support Engineering USD 162K-186KArtificial Intelligence | Fine Tuning | Human Feedback | Language Models | Large Language ModelsSenior-level Full TimeRemote-USA R15h ago
-
Infrastructure Data Engineer USD 140K-180KApache Iceberg | Cloud Computing | Containers | Data Governance | Data LineageMid-level Full TimeBoston, MA17h ago
-
Senior-level Full TimePalo Alto18h ago
-
Senior-level Full TimePalo Alto18h ago
-
ARM Cortex | ARM Cortex-M | Audio Processing | C# | C++Entry-level InternshipAustin, Texas18h ago
-
Senior AI Engineer USD 170K-225KAPIs | AWS | Agent systems | Azure | CI/CDCross-functional team | Fast-paced environment | Healthcare innovation focus | In-office collaborationSenior-level Full TimePalo Alto18h ago
-
Access Control | Amazon Web Services | Apache Airflow | C++ | CI/CD401k match | Employee wellness programs | Family leave | Health care benefits | Parental leaveSenior-level Full TimeUnited States19h ago
-
Sr Gen AI Engineer - NY/NJ USD 40K-141KAPI Security | Asynchronous programming | CI/CD | Docker | Embedding Models401k | Medical/Dental/Vision | Paid Holidays | Paid time offSenior-level Full TimeUnited States19h ago
-
Analytical Engineer USD 60K-80KDAX | Data Governance | Data Modeling | Data Validation | Data VisualizationMid-level Full TimeLancaster, PA, United States20h ago
-
Sr. Staff Machine Learning Engineer USD 154K-220KCaching | Data Aggregation | Data Pipelines | Data Processing | Distributed SystemsEducation reimbursement | Health plans | Hybrid work | Paid time off | Parental leaveSenior-level Full TimeSan Jose, California, USA20h ago
-
Senior Data Engineer USD 115K-145KApache Airflow | Apache Flink | Apache Spark | Cloud Computing | Data Modelling401k | Dental insurance | Discounts | Medical insurance | Paid leaveSenior-level Full TimeNew York, NEW YORK, United States R21h ago