Staff Software Engineer, LLM Serving and GPU Performance, Google Distributed Cloud
USD 207K-300K Senior-level Full Time
Tasks
- Analyze performance of LLMs
- Benchmark large language models
- Build infrastructure and tooling for deep profiling
- Deploy models into production with SRE
- Enhance serving stack
- Identify bottlenecks in compute memory and networking
- Improve latency throughput and resource utilization
- Maximize hardware efficiency
- Optimize KV cache management
- Prototype disaggregated serving
- Prototype speculative decoding
Perks/Benefits
- N/A
Skills/Tech-stack
AI Model Serving | AI model | Benchmarking | Cache Management | Data Analysis | Data Visualization | Debugging | Disaggregated serving | Distributed Systems | GPU Performance | GPU performance tuning | High Performance | High-Performance Computing | KV cache | KV-cache management | Level optimization | Low-level optimization | Memory Management | Model Serving | Performance Computing | Performance Engineering | Performance Tuning | Quantization | Software Architecture | Speculative decoding | TPU optimization
Education
Roles
Regions
Countries
States
Related jobs
-
Artificial Intelligence | Data Modeling | Data Pipelines | Data Quality | Data Visualization401k match | Dental insurance | Life insurance | Medical insurance | Paid time offSenior-level Full TimeNew York1h ago
-
Data-Driven Decision Making | Data-driven | Decision Making | Deep learning | Distributed TrainingSenior-level Full TimeSunnyvale, CA2h ago
-
Production Engineer USD 178K-200KApache | Apache Spark | Application Programming | Application Programming Interfaces | C++Entry-level Full TimeMenlo Park, CA2h ago
-
Research Engineer - MSL FAIR Foundations USD 117K-173KBenchmarking | Code review | Data Pipelines | Distributed Systems | Language ModelEntry-level Full TimeMenlo Park, CA2h ago
-
Staff Software Engineer, AI Data Generation Platform USD 207K-300KComputer Vision | Data Engineering | Data Processing | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeSunnyvale, CA, USA3h ago
-
C plus plus | C++ | Cloud Spanner | Cloud Storage | Cloud platformSenior-level Full TimeSunnyvale, CA, USA3h ago
-
Machine Learning Researcher, Multimodal LLMs USD 140K-250KAudio codecs | Data Analysis | Experiment design | Fine Tuning | Language ModelsDental insurance | Equity | Health insurance | High autonomy | High impactSenior-level Full TimeSan Francisco13h ago
-
Deployed Engineer (Phoenix) USD 150K-250KAWS | Agent Frameworks | Azure | Cloud Computing | Containers401k plan | Dental insurance | Flexible vacation | Meals on in office days | Medical insuranceSenior-level Full TimePhoenix, AZ14h ago
-
Senior Data Engineer USD 239K-271KAirflow | Alerting | Amazon Redshift | Automated testing | Cost OptimizationFamily planning support | Flexible time off | Lifestyle stipend | Mental health support | Paid parental leaveSenior-level Full TimeSan Francisco, CA R14h ago
-
Machine Learning Engineer, Customer Support Engineering USD 162K-186KArtificial Intelligence | Fine Tuning | Human Feedback | Language Models | Large Language ModelsSenior-level Full TimeRemote-USA R14h ago
-
Infrastructure Data Engineer USD 140K-180KApache Iceberg | Cloud Computing | Containers | Data Governance | Data LineageMid-level Full TimeBoston, MA16h ago
-
Senior-level Full TimePalo Alto16h ago
-
Senior-level Full TimePalo Alto16h ago
-
ARM Cortex | ARM Cortex-M | Audio Processing | C# | C++Entry-level InternshipAustin, Texas16h ago
-
Senior AI Engineer USD 170K-225KAPIs | AWS | Agent systems | Azure | CI/CDCross-functional team | Fast-paced environment | Healthcare innovation focus | In-office collaborationSenior-level Full TimePalo Alto16h ago
-
Access Control | Amazon Web Services | Apache Airflow | C++ | CI/CD401k match | Employee wellness programs | Family leave | Health care benefits | Parental leaveSenior-level Full TimeUnited States17h ago
-
Analytical Engineer USD 60K-80KDAX | Data Governance | Data Modeling | Data Validation | Data VisualizationMid-level Full TimeLancaster, PA, United States18h ago
-
Sr. Staff Machine Learning Engineer USD 154K-220KCaching | Data Aggregation | Data Pipelines | Data Processing | Distributed SystemsEducation reimbursement | Health plans | Hybrid work | Paid time off | Parental leaveSenior-level Full TimeSan Jose, California, USA19h ago
-
Senior Data Engineer USD 115K-145KApache Airflow | Apache Flink | Apache Spark | Cloud Computing | Data Modelling401k | Dental insurance | Discounts | Medical insurance | Paid leaveSenior-level Full TimeNew York, NEW YORK, United States R19h ago
-
AI Search | Amazon SageMaker | Application development | Azure AI | Azure AI Search401k | Dental insurance | Medical insurance | Paid sick hours | Vision insuranceSenior-level Contract Full TimeRidgefield Park, NJ, United States20h ago
-
AI Automation | Apache Airflow | Apache Kafka | Apache Spark | Cloud Computing401k match | Child care benefits | Family building benefits | Lyft Pink membership | Lyft creditsSenior-level Full TimeSeattle, WA R21h ago
-
AI | Apache Airflow | Apache Kafka | Apache Spark | Automation401k match | Child care benefits | Commuter benefits | Dental insurance | Family building benefitsSenior-level Full TimeSan Francisco, CA R21h ago
-
IT AI&S Senior Data Engineer USD 150K-196KAWS | Apache Spark | Azure | Data Architecture | Data GovernanceSenior-level Full TimeDallas, TX, United States21h ago
-
Forward Deployed Engineer USD 120K-180KAI code generation | API Development | AWS | Anthropic API | Azure401k | Company equity | Health insurance | Life insurance | Mental health supportMid-level Full TimeUnited States21h ago
-
AI Engineer I USD 104K-156KAgentic AI | Apache Spark | Async Processing | Async Processing Pipelines | Backend systemsMid-level Full TimeBoston, MA21h ago