AI/HPC Systems Performance Engineer
Tasks
- Benchmark communication system performance
- Debug host networking protocols
- Develop and deploy performance optimization solutions
- Identify performance bottlenecks across comms stack
- Monitor network and production issues
- Triage performance issues in distributed applications
- Troubleshoot RDMA workload performance
Perks/Benefits
- N/A
Skills/Tech-stack
C++ | Host Networking | IB | MPI | Machine Learning | NCCL | Network Performance | Network Performance Monitoring | Performance Benchmarking | Performance Monitoring | PyTorch | RDMA | RoCE | TensorFlow | UCX
Education
Regions
Countries
States
Cities
Related jobs
-
C++ | Constrained optimization | Controls theory | Differentiable physics | Dimensionality ReductionSenior-level Full TimeRedmond, WA2h ago
-
Research Engineer, Language USD 170K-251KData Processing | Deep learning | Distributed Systems | Efficient Inference | Efficient TrainingEntry-level Full TimeBurlingame, CA2h ago
-
Software Engineer, AI/ML, Geo Data Protection USD 147K-211KC++ | Data Processing | Debugging | Distributed Computing | Information RetrievalMid-level Full TimeMountain View, CA, USA3h ago
-
Software Engineer, AI/ML, Google Ads USD 174K-252KC plus plus | Data Processing | Data Storage | Data Structures | Data Structures and AlgorithmsMid-level Full TimeMountain View, CA, USA3h ago
-
Accelerator Virtualization | Artificial Intelligence | Container Orchestration | Container Runtime | Distributed SystemsSenior-level Full TimeSeattle, WA, USA; Kirkland, WA, USA3h ago
-
Senior Software Engineer, Compiler Optimization USD 174K-252KC# | C++ | Compiler optimization | Data Analysis | Data StructuresSenior-level Full TimeSunnyvale, CA, USA3h ago
-
Software Engineer III, AI/ML, Display Ads USD 147K-211KC++ | Data Analysis | Data Processing | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeMountain View, CA, USA3h ago
-
Staff Software Engineer USD 207K-300KArtificial Intelligence | C++ | Computer Vision | Data Processing | Embedding ModelsSenior-level Full TimeMountain View, CA, USA3h ago
-
Application development | C++ | Data Analysis | Data Processing | Data Processing PipelinesMid-level Full TimeSunnyvale, CA, USA3h ago
-
Senior Software Engineer, AI/ML, Google Cloud AI USD 174K-252KC++ | Code review | Data Processing | Debugging | Distributed ComputingSenior-level Full TimeSunnyvale, CA, USA3h ago
-
Senior Software Engineer, AI/ML GenAI, Google Play USD 174K-252KAlgorithms | C++ | Computer Vision | Data Processing | Data StructuresSenior-level Full TimeMountain View, CA, USA3h ago
-
Software Engineer III, AI/ML GenAI, Google Cloud Compute USD 147K-211KAudio generation | C++ | Data Processing | Debugging | Distributed ComputingSenior-level Full TimeKirkland, WA, USA3h ago
-
Software Engineer, Computer Vision, Geo USD 147K-211KAlgorithms | C++ | Computer Vision | Data Structures | Image classificationMid-level Full TimeMountain View, CA, USA; Seattle, WA, …3h ago
-
Senior-level Full TimeBelmont, CA, US, 9400214h ago
-
AI Agents | AI Search | AWS | AWS Bedrock | Agentic Workflows401k | Dental insurance | Medical insurance | Paid sick hours | Vision insuranceSenior-level Contract Full TimeRidgefield Park, NJ, United States14h ago
-
Senior Data & ML Engineer USD 151K-205KDBT | Data Governance | Data Observability | Data Quality | Data Security401k match | Dental insurance | Equity | Family planning resources | Flexible vacation policySenior-level Full TimeRemote - USA R14h ago
-
Senior Software Engineer, Knowledge Graph USD 196K-230KData Ingestion | Data Processing | Distributed Systems | Flink | Graph DatabasesSenior-level Full TimeUnited States14h ago
-
Senior Machine Learning Engineer USD 198K-287KArtificial Intelligence | Data Engineering | Fine Tuning | Foundation Models | GenAIOccasional travel for offsites | On-call rotation participation | Reasonable accommodation for applicantsSenior-level Full TimeRemote - US R14h ago
-
Staff Machine Learning Engineer USD 141K-210KCI/CD | Computer Vision | Deep learning | Distributed Training | DockerSenior-level Full TimeSunnyvale, CA, United States14h ago
-
Senior AI Systems Engineer USD 122K-188KAlerting | Bash | CI/CD | CMMC | Cause analysisFully remote option | Hybrid option | Onsite optionSenior-level Full TimeRaleigh, North Carolina, United States; Albuquerque, … R16h ago
-
AI Engagement Manager USD 180K-230KAccount Management | Customer Success | Data labeling | Enterprise Sales | EvaluationMid-level Full TimeCalifornia16h ago
-
Senior AI Engineer USD 170K-205KAWS | Agent Orchestration | Agentic Workflows | CI/CD | ClickHouseHealth benefits | Parental leave | Stock options | Tuition reimbursement | Unlimited PTOSenior-level Full TimeHybrid (NYC Metro) R17h ago
-
AI Engineer USD 110K-140KApplication design | Artificial Intelligence | Enterprise Architecture | Machine Learning | SDLC401k match | Dental insurance | Employee assistance | Life insurance | Medical insuranceSenior-level Full TimeSan Antonio, TX, United States17h ago
-
Senior AI Engineer USD 144K-210KAPI Development | Azure | Azure Machine Learning | Azure OpenAI | Cloud Computing401k retirement plan | Dental insurance | Education Course Access | Employee eyewear discount | Health insuranceSenior-level Full TimeDuluth, GA, United States17h ago
-
Staff Engineer, Storage Control Plane USD 165K-242KC++ | Ceph | ClickHouse | DAOS | Dashboards401k match | Employee stock purchase program | Flexible PTO | Flexible spending account | Health savings accountSenior-level Full TimeLivingston, NJ / New York, NY …17h ago