Staff Software Engineer, GPU Performance
Kirkland, WA, USA; Sunnyvale, CA, USA
USD 207K-301K Senior-level Full Time
Tasks
- Analyze performance and efficiency metrics
- Design and implement fleet wide performance solutions
- Drive XLA and Triton performance
- Identify and maintain LLM benchmarks
- Identify bottlenecks
- Perform roofline analysis
- Run GPU performance benchmarks
- Run architecture level GPU simulations
- Solve ML model performance problems with cross teams
- Use benchmarks to drive performance improvements
Perks/Benefits
Skills/Tech-stack
C Programming | CUDA | CUDA C | CUDA C Programming | Code generation | Compiler optimization | Cutlass | GPU Architecture | GPU Programming | LLM Deployment | Language Models | Large Language Models | Low-level GPU programming | MLIR | Memory hierarchies | OpenXLA | OpenXLA GPU | Performance Engineering | Roofline analysis | Runtime Systems | Triton | XLA
Education
Regions
Countries
States
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R4d ago
-
A/B | A/B Testing | AI Model Deployment | AI model | App ServiceRemote workMid-level ContractHartford, United States R8h ago
-
Delivery Senior Consultant, Software Engineering Solutions, Identity & Gen AI Engineer USD 155K-265KAI Agents | AWS | Access Management | Ansible | AuthenticationHybrid work model | Onsite up to 5 days per week | Professional training and development | Travel opportunitiesSenior-level Full TimeAtlanta, Georgia, United States; Charlotte, North …9h ago
-
AI Agents | AI Risk Management Framework | Access Management | Amazon Web Services | AnsibleSenior-level Full TimeArlington/Rosslyn, Virginia, United States; Baltimore, Maryland, …9h ago
-
Business Support Engineer USD 147K-203KAPI troubleshooting | AWS | Azure | Data Analysis | Debugging24 7 Oncall RotationMid-level Full TimeMenlo Park, CA10h ago
-
Business Support Engineer USD 141K-197KAI Agents | API troubleshooting | AWS | Agent Orchestration | Azure24 7 Oncall Rotation | Cross-functional collaborationSenior-level Full TimeMenlo Park, CA10h ago
-
Senior Software Engineer, DeepMind USD 221K-253KAlgorithms | Audio Processing | C++ | Cause analysis | Data StructuresBonus | Equity | Hybrid scheduleSenior-level Full TimeMountain View, CA, USA R11h ago
-
AI Application Engineer USD 144K-209KAI Agents | API Development | Artificial Intelligence | Data Pipelines | Data QualityBonus | Employee benefits | Equity | Health insurance | Paid time offSenior-level Full TimeAustin, TX, USA11h ago
-
Staff Software Engineer, AI/ML, Search Ads USD 207K-301KC++ | Data Processing | Data Quality | Debugging | Distributed ComputingBonuses | Comprehensive health insurance | Equity | Paid time off | Retirement planSenior-level Full TimeMountain View, CA, USA11h ago
-
Software Engineer III, Generative AI, Search Health USD 147K-211KA/B | A/B Testing | B testing | Benchmarking | Computer VisionSenior-level Full TimeMountain View, CA, USA11h ago
-
Staff Software Engineer, Network Health USD 207K-301KAnomaly Detection | Automated remediation | Data Processing | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeSunnyvale, CA, USA11h ago
-
Embedded Software Engineer USD 68K-73KAssembly | Auto-code | Auto-code generation | C# | C++401k match | Dental insurance | Disability insurance | Flexible spending account | Life insuranceMid-level Full TimeAllen Park, MI, 48101, US13h ago
-
Associate Director, AI Enablement & Machine Learning USD 212K-318KCloud Computing | Data Engineering | Data Governance | Deep learning | Generative AIAnnual incentive program | Healthcare coverage | Retirement benefitsMid-level Full TimeCambridge, MA16h ago
-
Senior AI Engineer I USD 123K-215KAWS | Agent Orchestration | Evaluation and monitoring | GCP | GRPCSenior-level Full TimePhoenix, AZ, United States20h ago
-
Machine Learning Research Engineer USD 161K-189KAWS | Azure | Bias Variance | Bias-Variance Tradeoff | C++Flexible hybrid work model | Mental health counseling | Mentorship programs | Paid parental leave | Paid volunteer time offMid-level Full TimeNew York, US, New York21h ago
-
Principal Embedded Software Engineer USD 180K-220KARM | Board Support | Board Support Package | C Programming | Device DriversBackground check clearance | Drug Test Clearance | Fully onsite | U S government contractor requirement U S citizenSenior-level Full TimeIrvine CA21h ago
-
Distinguished AI Engineer USD 231K-416KAI Agents | APIs | CI/CD | Cloud Computing | DevOps401k match | Financial education resources | Hybrid work flexibility | Life insurance | Medical, dental, vision benefitsSenior-level Full TimeVA-RICHMOND, 2015 STAPLES MILL RD,, United …22h ago
-
Data parallelism | Diffusion Models | Efficient Attention | Expert parallelism | FlaxSenior-level Full TimeMountain View, California, United States, New …1d ago
-
Data Science Lead USD 133K-200KAI Foundry | API Development | Agile | Auditability | Automated testingKnowledge sharing | Mentoring | Technical oversightSenior-level Full TimeDes Moines, IA, United States1d ago
-
Software Engineer - Embedded Firmware USD 125K-210KAvionics | C Programming | C++ | Communications Systems | DebuggingDental insurance | Equity compensation | Medical insurance | Paid time off | Performance bonusesSenior-level Full TimeSouth San Francisco, California, USA1d ago
-
Applied AI/ML & Causal Inference - Senior Associate USD 177K-215KA/B | A/B Testing | Agentic AI | B testing | Causal InferenceSenior-level Full TimeJersey City, NJ, United States1d ago
-
AWS Bedrock | Agent systems | Anthropic API | Autogen | Azure401k matching program | Adoption Assistance | Development and career growth opportunities | Fertility treatments | Flexible work schedulesSenior-level Contract Full TimeRemote, OR, United States R1d ago
-
Software Engineer, Applied AI USD 190K-280KContext engineering | Data Processing | Data Storage | Debugging | Docker401k match | Dental insurance | Health insurance | Hybrid work model | Professional developmentSenior-level Full TimeNew York City R1d ago
-
Forward Deployed Principal AI Engineer | Onsite USD 82K-289KAPI Design | API Integration | Data integration | Distributed Systems | Governance401k retirement plan | Dental insurance | Medical insurance | Paid Holidays | Paid time offSenior-level Full TimeUnited States1d ago
-
Staff Data Engineer USD 185K-220KAWS | Apache Airflow | Apache Kafka | Benthos | Big DataDental insurance | Disability insurance | Flexible work hours | Health insurance | Health savings accountSenior-level Full TimeRosslyn, VA or Remote R1d ago