Software Engineer, Inference – AMD GPU Enablement
Tasks
- Build integrate and tune collective communication libraries for parallel model execution
- Debug and optimize distributed inference workloads
- Design and optimize high performance GPU kernels for accelerators
- Integrate model serving infrastructure into GPU backed systems
- Own bring up correctness and performance of inference stack on AMD hardware
- Validate correctness performance and scalability on large GPU clusters
Perks/Benefits
- N/A
Skills/Tech-stack
CUDA | Collective communication | Distributed Systems | GPU Kernels | HIP | Mixed Precision | Model Parallelism | NCCL | Profiling | RCCL | Tensor Parallelism | Triton
Education
N/A
Roles
Regions
Countries
States
Related jobs
-
Computational Biologist - Protein Engineering USD 150K-250KAWS | Amazon Web Services | CUDA | Conda | Deep learningRelocation supportEntry-level Full TimeSan Francisco, CA, US1h ago
-
Entry-level Full TimeSan Francisco, CA, US1h ago
-
Automated Orchestration | Backup and Restore | C# | C++ | CXLSenior-level Full TimeSeattle, Washington, United States4h ago
-
Software Engineer, Generative AI, Workspace USD 147K-211KC++ | Distributed Systems | Generative AI | Information Retrieval | Integration TestingBenefits | Bonus | EquityMid-level Full TimeBoulder, CO, USA5h ago
-
Staff Software Engineer, Machine Learning, Google Chat USD 207K-300KAgentic Workflows | Caching | Cloud Spanner | Continuous Delivery | Continuous integrationSenior-level Full TimeSunnyvale, CA, USA5h ago
-
Software Engineer III, Database Internals AlloyDB USD 147K-211KACID | C# | C++ | CAP Theorem | Compiler TheoryEntry-level Full TimeSunnyvale, CA, USA5h ago
-
Agentic AI | C plus plus | C# | Cloud services | Data ProcessingMid-level Full TimeSan Francisco, CA, USA5h ago
-
Senior-level Full Time142019-NC-300 South Brevard, Charlotte, United States16h ago
-
Senior Machine Learning Engineer - AI/ML USD 148K-247KAWS | Airflow | Apache Flink | Apache Spark | Argo WorkflowsContinual development | Flexible work environment | Health and wellness benefits | Internal career growth | Paid relocation orientation processSenior-level Full TimeSan Mateo HQ, United States16h ago
-
Staff AI engineer USD 140K-160KAI Evaluation | AWS | Agent Orchestration | Caching | Data PipelinesFlexible working hours | Hybrid work culture | Unlimited time offSenior-level Full TimeSan Francisco1d ago
-
AWS | ArangoDB | Asynchronous programming | Context engineering | Distributed Systems401k | Health, dental, vision coverage | Learning stipend | Relocation assistanceSenior-level Full TimeGeorgia, Georgia, United States1d ago
-
AWS | Adapters | ArangoDB | Asynchronous programming | Context engineering401k | Dental insurance | Health insurance | Learning stipend | Relocation assistanceSenior-level Full TimeJacksonville, Florida, United States1d ago
-
AWS | ArangoDB | Asynchronous programming | Context engineering | Distributed Systems401k | Dental insurance | Health insurance | Learning stipend | On site work 5 days per weekSenior-level Full TimeMenlo Park, California, United States1d ago
-
AWS | Adapters | Asynchronous programming | Context engineering | Distributed Systems401k | Dental insurance | Health insurance | Learning stipend | Relocation assistanceSenior-level Full TimeWashington D.C., District of Columbia, United …1d ago
-
AWS | ArangoDB | Asynchronous programming | Context engineering | Distributed Systems401k | Dental insurance | Health insurance | Relocation assistance | Unlimited learning stipendSenior-level Full TimeCharlotte, North Carolina, United States1d ago
-
AWS | Adapters | ArangoDB | Asynchronous programming | Context engineering401k | Health, dental, vision coverage | Learning stipend | Relocation assistance | Visa sponsorshipSenior-level Full TimeMountain View, California, United States1d ago
-
Software Engineer, Inference - Multi Modal USD 295K-555KDistributed Systems | GPU | High Throughput | Inference | Language ModelsEntry-level Full TimeSan Francisco1d ago
-
APIs | Agent workflows | Authentication | Debugging | Distributed SystemsSenior-level Full TimeNew York City1d ago
-
Sr. Platform Software Engineer – AI and Innovation USD 142K-200KAgile | Apollo GraphQL | Cloud Architecture | Continuous Delivery | Distributed SystemsSenior-level Full TimeOak Brook, IL, United States1d ago
-
AI/HPC System Performance Engineer USD 152K-240KAI Workload Optimization | AI workload | Alerting | C++ | Capacity PlanningSenior-level Full TimeMenlo Park, CA2d ago
-
Cloud Computing | Computer Vision | Data Processing | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeKirkland, WA, USA; Seattle, WA, USA2d ago
-
Staff Software Engineer, AI/ML, Google Distributed Cloud USD 207K-300KAI-assisted coding | Agent Debugging | Assisted coding | Cloud platform | Cross-Functional CollaborationSenior-level Full TimeSunnyvale, CA, USA2d ago
-
Senior Staff Software Engineer, Google Cloud Storage USD 262K-365KAlgorithms | Data Structures | Distributed Systems | Large-Scale System Design | Large-scaleSenior-level Full TimeCambridge, MA, USA; Raleigh, NC, USA2d ago
-
Staff Software Engineer, AI/ML GenAI, YouTube USD 207K-300KAlgorithms | Computer Vision | Data Processing | Data Structures | Distributed SystemsSenior-level Full TimeMountain View, CA, USA2d ago
-
Senior Machine Learning Engineer, Search Assistant USD 361K-510KA/B | A/B Testing | Airflow | B testing | Bandit AlgorithmsDisability benefits | Equity awards | Health insurance | Life insurance | Paid time offSenior-level Full TimeSan Jose, California2d ago