Staff Software Engineer, GPU Performance
Sunnyvale, CA, USA; Kirkland, WA, USA
USD 207K-300K Senior-level Full Time
Tasks
- Analyze performance and efficiency metrics
- Drive XLA to GPU and Triton performance toward XLA releases
- Identify and maintain LLM training and serving benchmarks
- Identify bottlenecks and design solutions
- Perform roofline analysis for GPU designs
- Run GPU performance benchmarks using TRT LLM vLLM SGLang
- Run architecture level GPU simulations
- Solve ML model performance problems with cross functional teams
Perks/Benefits
- N/A
Skills/Tech-stack
AMD | CUDA | Code generation | Compiler optimization | Cutlass | GPU Architecture | GPU Performance | LLM | MLIR | Memory hierarchy | NVIDIA | OpenXLA | Performance bottlenecks | Roofline analysis | Runtime Systems | Triton | XLA
Education
Regions
Countries
States
Related jobs
-
Amazon S3 | Data Engineering | Data Modeling | Data Pipelines | Data QualitySenior-level Full TimeNew York12h ago
-
Amazon S3 | Automation | Data Engineering | Data Modeling | Data Pipelines401k match | Dental insurance | Life insurance | Long-term disability | Medical insuranceSenior-level Full TimePrinceton12h ago
-
Lead Databricks Forward Deployed Engineer - GPS USD 189K-372KAPI Integration | AWS | Airflow | Apache Spark | AzureSenior-level Full TimeArlington/Rosslyn, Virginia, United States; Atlanta, Georgia, …13h ago
-
Software Engineer, Systems ML - SW/HW Co-design USD 117K-173KAI infrastructure | Bias Mitigation | C# | C++ | Co-designSenior-level Full TimeSunnyvale, CA | Redmond, WA14h ago
-
Senior Staff Software Engineer, AI Innovation USD 262K-365KC++ | Cross-Functional Collaboration | Cross-functional | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeMountain View, CA, USA14h ago
-
Staff Software Engineer, AI/ML Performance USD 207K-300KAlgorithms | Auto sharding | C++ | Code debugging | Code generationSenior-level Full TimeSunnyvale, CA, USA14h ago
-
Principal AI/ML Engineer USD 165K-226KC# | C++ | CI/CD | CUDA | Computer Vision401k match | Dental insurance | Health insurance | Life insurance | Paid time offSenior-level Full TimeRemote PA - PA PAR, United … R1d ago
-
Data Engineer USD 122K-253KAI workloads | AWS | Access Control | Cloud Based AI Workloads | Cloud ComputingSenior-level Full TimeVA543: 22270 Pacific Blvd, Dulles 22270 …1d ago
-
Machine Learning Research Engineer USD 99K-225KBenchmarking | Code review | Computer Vision | Conformal Prediction | Contrastive LearningPaid leave | Professional development | Tuition assistanceMid-level Full TimeUSA, VA, Springfield (7500 Geoint Dr), …1d ago
-
Machine Learning Engineer, Specialist USD 131K-170KAPI Development | AWS Glue | Amazon S3 | Amazon SageMaker | AzureHybrid work modelMid-level Full TimeMalvern, PA, United States1d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | CUDA | Compiler optimization | Continuous batchingCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Perception Engineer, Machine Learning USD 166K-220KAutomated testing | C++ | CI/CD | CUDA | Camera CalibrationMid-level Full TimeSeattle, Washington, United States1d ago
-
AI Engineer - Nexus Black USD 90K-100KCI/CD | Cloud Computing | Embeddings | Evaluation | Function Calling401k | Community volunteering events | Dental insurance | Disability benefits | Flexible paid time offSenior-level Full TimeHouston, Texas, United States1d ago
-
Staff Software Engineer, Deep Learning Acceleration USD 189K-274KC++ | CUDA | Computer Vision | Deep learning | GPU Memory OptimizationAnnual bonus | Benefits | Equity compensation | Hybrid work environmentSenior-level Full TimeSan Francisco, California1d ago
-
Staff Software Engineer, Deep Learning Acceleration USD 189K-274KC++ | CUDA | Computer Vision | GPU Memory Optimization | GPU memoryHybrid work environmentSenior-level Full TimeMountain View, California1d ago
-
Staff Software Engineer, Deep Learning Acceleration USD 171K-247KC++ | CUDA | Computer Vision | GPU Memory Optimization | GPU memoryAnnual bonus | Equity compensation | Hybrid work environmentSenior-level Full TimePittsburgh, Pennsylvania1d ago
-
GTM Engineer USD 136K-150KAPIs | Agent Frameworks | Artificial Intelligence | BI Analytics | Data PipelinesSenior-level Full TimeRemote - US, PST preferred R1d ago
-
Senior Solution Engineer USD 165K-216KAPIs | AWS | Apache Airflow | Apache Kafka | Apache Spark401k | Flexible PTO | Health/Dental/Vision | Professional development budgetSenior-level Full TimeUS-TX-Remote R1d ago
-
Cybersecurity AI_ML Engineer USD 120K-145KAdversarial Machine Learning | Anomaly Detection | Application Firewall | Classification | Cloud Security401k matching | Bonding Leave | Community service pay | Flexible-hybrid work | GM employee discountMid-level Full TimeIrving, TX, United States1d ago
-
Entry-level Full TimeNew York, NY, United States1d ago
-
Entry-level Full TimeNew York, NY, United States1d ago
-
AI & Data Solutions Senior Manager (GCP) USD 119K-198KAgentic Frameworks | BigQuery | Cloud Native | Cloud Native Architecture | Cloud StorageSenior-level Full TimePhiladelphia, Pennsylvania, United States1d ago
-
AI Engineer USD 120K-180KAWS Bedrock | AWS SageMaker | Algorithms | Amazon ECS | ClassificationDental insurance | Health insurance | Paid time off | Retirement contributions | Vision insuranceMid-level Full TimeBoston, MA2d ago
-
Senior Software Engineer (AI/ML & Agentic) USD 83K-203KAI | API | Adjudication Systems | Agent systems | Agentic AISenior-level Full TimeRichardson-909 E Collins Blvd, United States2d ago
-
API | AWS | Amazon SageMaker | Azure | Azure Machine LearningContract position | Remote workMid-level ContractUnited States - Remote R2d ago