Staff Software Engineer, TPU Performance
Tasks
- Analyze performance and efficiency metrics
- Apply model co design quantization and sparsity techniques
- Collaborate with product teams to onboard ML models onto TPU hardware
- Design and implement performance solutions
- Identify bottlenecks at fleet wide scale
- Improve compiler and runtime performance
- Maintain ML training and serving benchmarks
- Optimize performance and throughput
- Perform TPU fleet efficiency analysis
- Use benchmarks to identify performance opportunities
Perks/Benefits
- N/A
Skills/Tech-stack
Algorithm Design | Benchmarking | Compiler | Data Analysis | Data Structures | Data Structures and Algorithms | Debugging | Distributed Systems | Hardware-aware algorithm design | JAX | Low-level programming | Machine Learning | Memory hierarchy | OpenXLA | Performance Modeling | Performance optimization | PyTorch | Quantization | Runtime | Software Architecture | Sparsity | Tensor Processing Unit | Tensor processing | Visualization Tools
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
AIOps | Anomaly Detection | C# | C++ | Chaos Engineering401k match | Dental insurance | Life insurance | Long-term disability | Medical insuranceSenior-level Full TimeNew York3h ago
-
Apache Flink | CSS | Distributed Systems | Docker | Go401k match | Dental insurance | Life insurance | Medical insurance | Paid time offSenior-level Full TimeNew York3h ago
-
Palantir Data Engineer USD 107K-145KData Analysis | Data Engineering | Data Pipelines | DevOps | Palantir FoundryProfessional development | Travel opportunitiesEntry-level Full TimeHuntsville, Alabama, United States3h ago
-
Container Orchestration | Distributed Systems | GPU Acceleration | Kubernetes | LLM InferenceCareer growth opportunities | Collaborative engineering environment | Global datacenter exposure | Hyper scale environment | Open source contribution opportunitiesEntry-level Full TimeSeattle, Washington, United States3h ago
-
Dataset Construction | Efficient Inference | Human Feedback | Instruction Tuning | Language ModelsSenior-level Full TimeSeattle, Washington, United States4h ago
-
Continuous Learning | Data Engineering | Efficient Inference | Human Feedback | Instruction TuningSenior-level Full TimeSan Jose, California, United States4h ago
-
Sr Machine Learning Engineer, Recommendations - USDS USD 177K-280KClick Through Rate | Click Through Rate Prediction | Conversion Rate | Conversion Rate Prediction | Data PipelinesSenior-level Full TimeSan Jose, California, United States4h ago
-
AI Research Engineer USD 177K-251KBenchmarks | Data Pipelines | Data Versioning | Evaluation | Fine TuningCross-functional collaboration | End-to-end ownership | High autonomyMid-level Full TimeBellevue, WA | Menlo Park, CA …4h ago
-
Data Engineer USD 185K-196KApache Spark | Artificial Intelligence | CSS | Data Governance | Data ModelingMid-level Full TimeMenlo Park, CA4h ago
-
Entry-level Full TimeMenlo Park, CA4h ago
-
Privacy Engineer USD 194K-217KApache Airflow | Apache Spark | Automated testing | C plus plus | Continuous DeploymentEntry-level Full TimeMenlo Park, CA4h ago
-
Software Engineer III, AI/ML GenAI, Google Cloud Compute USD 147K-211KAudio generation | C++ | Computer Vision | Data Processing | Data StorageSenior-level Full TimeKirkland, WA, USA5h ago
-
Senior Software Engineer, AI/ML, Google Cloud AI USD 174K-252KC++ | Data Processing | Data Structures | Data Structures and Algorithms | DebuggingSenior-level Full TimeSunnyvale, CA, USA5h ago
-
Software Engineer, TPU Inference, AI/ML USD 147K-211KCloud Computing | Compilers | GPU | GPU Programming | InferenceMid-level Full TimeKirkland, WA, USA5h ago
-
Backup and Recovery | BigQuery | C++ | Consistency models | Copy-on-writeSenior-level Full TimeKirkland, WA, USA5h ago
-
Software Engineer, Machine Health USD 147K-211KAnalysis and Design | C++ | Data Processing | Data analytics | Distributed SystemsMid-level Full TimeSunnyvale, CA, USA5h ago
-
Networking AI Technical Lead USD 207K-300KAI | Algorithms | C++ | Compute Technologies | Data StructuresSenior-level Full TimeSunnyvale, CA, USA; Cambridge, MA, USA5h ago
-
Staff Software Engineer, BigQuery Managed Storage USD 207K-300KApache Iceberg | Data Lakes | Data Structures | Data Structures and Algorithms | Distributed ComputingSenior-level Full TimeKirkland, WA, USA; Sunnyvale, CA, USA5h ago
-
Software Engineer III, AI/ML, Health and Home USD 147K-211KData Analysis | Data Processing | Data Storage | Debugging | Distributed ComputingSenior-level Full TimeMountain View, CA, USA5h ago
-
Customer Engineer II, Applied AI, Google Cloud USD 149K-216KApplication Architecture | Artificial Intelligence | C++ | Cloud Computing | Cloud platformSenior-level Full TimeNew York, NY, USA; Chicago, IL, …5h ago
-
Software Engineer III, AI/ML GenAI, Google Cloud AI USD 147K-211KC++ | Data Processing | Debugging | Generative AI | Language ModelsSenior-level Full TimeSunnyvale, CA, USA5h ago
-
Senior Software Engineer, Full Stack, Gen AI Formats USD 174K-252KArtificial Intelligence | C++ | Data Storage | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeMountain View, CA, USA5h ago
-
Customer Engineer, Data Analytics, Google Cloud USD 153K-222KBatch Processing | Big Data | Cloud Native | Cloud Native Architecture | Cloud platformSenior-level Full TimeSunnyvale, CA, USA5h ago
-
Senior Software Engineer, Managed Kafka USD 174K-252KApache Kafka | Cloud platform | Distributed Systems | Go | Google CloudSenior-level Full TimeNew York, NY, USA; Raleigh, NC, …5h ago
-
Senior Software Engineer, AI/ML, Retail Ads, Fullstack USD 174K-252KAlgorithms | C++ | Data Processing | Data Structures | DebuggingSenior-level Full TimeMountain View, CA, USA5h ago