Staff Software Engineer, GPU Performance
Sunnyvale, CA, USA; Kirkland, WA, USA
USD 207K-300K Senior-level Full Time
Tasks
- Analyze performance and efficiency metrics
- Drive XLA to GPU and Triton performance toward XLA releases
- Identify and maintain LLM training and serving benchmarks
- Identify bottlenecks and design solutions
- Perform roofline analysis for GPU designs
- Run GPU performance benchmarks using TRT LLM vLLM SGLang
- Run architecture level GPU simulations
- Solve ML model performance problems with cross functional teams
Perks/Benefits
- N/A
Skills/Tech-stack
AMD | CUDA | Code generation | Compiler optimization | Cutlass | GPU Architecture | GPU Performance | LLM | MLIR | Memory hierarchy | NVIDIA | OpenXLA | Performance bottlenecks | Roofline analysis | Runtime Systems | Triton | XLA
Education
Regions
Countries
States
Related jobs
-
SYSTEM ENGINEER - Computer Network Support - AI/ML - 6+ yrs of Experience - TS/SCI w/Poly clearance is required - ES A USD 136K-140KArtificial Intelligence | Confluence | Jira | LLM | Machine Learning401k retirement plan | Dental insurance | Life insurance | Medical insurance | Paid time offMid-level Full TimeFort George G Meade, United States17h ago
-
APIs | CI/CD | Cloud platform | Compliance | ContainersAnnual leave | Dental coverage | Health coverage | High autonomy | Home office setup supportSenior-level Full TimeCanada R19h ago
-
AWS | Azure | C# | C++ | ExperimentationSenior-level Full TimeNew York, NY, United States1d ago
-
Mid-level Full TimeUnited States Remote, United States R1d ago
-
AI Agent | AI Agent Development | Agent Development | Analytics | BI reportingMid-level Full TimeUS-CA-Menlo Park1d ago
-
ML Engineer, Generative Video USD 175K-275KAutoregressive models | CUDA | Debugging | Deep learning | Diffusion Models401k match | Catered lunch | Commuter benefits | Dinner stipend | Grubhub subscriptionMid-level Full TimeUnion Square, New York City2d ago
-
C++ | Co-design | Compilers | Data Analysis | DebuggingSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA2d ago
-
Senior Software Engineer - AI Integrations USD 170K-240KAWS | Alerting | C++ | CSS | Continuous integrationSenior-level Full TimeMountain View, CA3d ago
-
AI Automation Engineer USD 96K-140KAPI Integration | Automation | Confidence Thresholding | Connectwise | DocumentationMid-level Full TimeGrand Rapids, Michigan3d ago
-
API Integration | Agent Orchestration | Bias Mitigation | C plus plus | C#Senior-level Full TimeMenlo Park, CA | Seattle, WA …3d ago
-
Senior Software Engineer, AI/ML GenAI, Core USD 174K-253KC++ | Computer Vision | Data Processing | Data Storage | Data StructuresHealth insurance | Paid time off | Parental leave | Retirement plansSenior-level Full TimeSan Jose, CA, USA3d ago
-
Senior Embedded Software Engineer USD 140K-190KARM | Azure | Build systems | C# | CI/CDDental insurance | Health insurance | Long-term stock incentives | Paid time off | Vision insuranceSenior-level Full TimeAllen, Texas, United States3d ago
-
GCP | LLM | Langgraph | Node.js | PythonCross-functional impact | Flexible work arrangements | Healthcare coverage | Leadership growth opportunities | Paid time offMid-level Full TimeCanada3d ago
-
AI Workflow Orchestration | AI workflow | AWS DynamoDB | AWS Lambda | AWS Step FunctionsArchitectural influence | Engineering Led Collaboration | High technical ownership | Learning opportunities | Remote-first work modelSenior-level Full TimeCanada R3d ago
-
Senior Software Engineer, GenAI USD 142K-195KAI Observability | AI orchestration | AWS | Agent-based | Agent-based systemsSenior-level Full TimeUS - Austin, United States4d ago
-
Staff Machine Learning Engineer, Adobe Firefly Services USD 172K-306KAdversarial Networks | CUDA | Diffusion Models | Distributed Systems | GANsSenior-level Full TimeSeattle, United States R4d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | Cache optimization | Compiler optimization | Continuous batchingMid-level Full TimeUnited States - Remote R4d ago
-
Senior Principal Applied AI-Software Engineer USD 112K-155KAI Coding Agents | AI coding | Artificial Intelligence | C# | C++401k | Dental insurance | Employee assistance program | Employee resource groups | Flexible time offSenior-level Full TimeAustin, TX, United States4d ago
-
Local Intent Data Mining Specialist USD 112K-293KData Modeling | Embeddings | Experimentation | Information Retrieval | Intent ClassificationMid-level Full TimeMountain View, California, United States4d ago
-
Senior Machine Learning Platform Engineer USD 170K-220KAWS SageMaker | Alerting | Amazon ECS | CI/CD | Container OrchestrationNYC office onsite | On-call rotationSenior-level Full TimeNew York, NY4d ago
-
AI Engineer USD 99K-163KAPI Integration | AWS | Amazon Bedrock | Data Analysis | Embeddings401k match | Dental insurance | Disability insurance | Hybrid work model | Life insuranceMid-level Full TimeRemote, United States R4d ago
-
Senior AI Software Engineer (Agentic AI) USD 145K-196KAPI Gateway | Async request handling | Async/Await | Authentication | Automated testingCollaborative team | Health and wellness programs | Remote-friendly environmentSenior-level Full TimeBoston, MA4d ago
-
AI Engineer, Evaluation USD 150K-250KEvaluation Pipelines | Experimentation | Golden tests | Grading | LLM100 percent medical dental and vision coverage | 401k match | Commuter benefits | Hybrid work model | In-office lunchMid-level Full TimeSan Francisco4d ago
-
A/B | A/B Testing | AWS | Apache Kafka | B testingDental insurance | Equity | Medical insurance | Remote work environment | Vision insuranceSenior-level Full TimeRemote, USA; Remote, Canada R4d ago
-
Analyst II, Commercial Intelligence & Analytics USD 103K-189KData Quality | Data Validation | Databricks | LLM | Machine LearningHealthcare benefits | Paid Holidays | Paid parental leave | Paid sick time | Paid vacationMid-level Full TimeChicago; Los Angeles; New York4d ago