AI/HPC System Performance Engineer, PhD
Menlo Park, CA
USD 163K-225K (estimate) Entry-level Full Time Found 14d ago
Tasks
- Benchmark performance
- Develop solutions for large scale training systems
- Enhance network fabric and host networking
- Identify and resolve performance issues across stack
- Monitor and troubleshoot network performance
- Optimize communication libraries and scheduling infrastructure
Perks/Benefits
- N/A
Skills/Tech-stack
AI Training | AI Training Workloads | C++ | Distributed Systems | Host Networking | Learning frameworks | MPI | Machine Learning | Machine Learning Frameworks | NCCL | Network Protocols | Performance Tuning | PyTorch | RDMA | System Software | TensorFlow | Training workloads | Troubleshooting | UCX
Education
Bachelor's | Computer Engineering | Electrical Engineering | Master's | PhD in Computer Science
Regions
Countries
States
Cities
Language: en |
Views: 1 |
Clicks: 0
Related jobs
-
Senior Software Engineer USD 119K-267KAWS | Agile Development | Algorithms | Analytics | Data ProcessingSenior-level ContractAnnapolis, MD, US16h ago
-
Sr. Data Engineer USD 108K-158KAWS | Apache Spark | Azure | CI/CD | Data Modeling401k | Dental | Disability | Employee discounts | Health insuranceSenior-level Full TimeNew York-TONAWANDA20h ago
-
AI/ML Engineer (TS/SCI Poly) USD 107K-179KDashboard Development | Data Pipelines | Data Visualization | ELT workflows | ETL/ELTBroad benefits | Inclusive culture | Professional developmentMid-level Full TimeArlington/Rosslyn, Virginia, United States20h ago
-
Software Engineer, Machine Learning USD 219K-240KAlgorithms | Availability | C++ | Code editors | ConsistencyMid-level Full TimeNew York, NY21h ago
-
AI/HPC System Performance Engineer USD 163K-225KAI Training | AI Training Workloads | C++ | Communication libraries | Congestion ControlSenior-level Full TimeMenlo Park, CA21h ago
-
Software Engineer, AI Native USD 173K-247KAI Automation | AI Safety | AI orchestration | AI/ML | AI/ML techniquesSenior-level Full TimeMenlo Park, CA21h ago
-
Senior-level Full TimeMenlo Park, CA | Seattle, WA …21h ago
-
Code review | Data Filtering | Data Generation | Data Pipelines | Distributed SystemsSenior-level Full TimeMenlo Park, CA21h ago
-
Application development | C++ | Data Analysis | Distributed Computing | Large Software SystemsBenefits | Bonus | EquityMid-level Full TimeSunnyvale, CA, USA21h ago
-
AI Agents | Algorithms | Automation | C++ | Data StructuresBenefits | Bonus | EquitySenior-level Full TimeNew York, NY, USA21h ago
-
Senior Software Engineer, AI/ML GenAI, Google Workspace USD 166K-244KC++ | Computer Vision | Data Processing | Debugging | Distributed ComputingBenefits | Bonus | EquitySenior-level Full TimeNew York, NY, USA21h ago
-
Lead Machine Learning Engineer USD 190K-260KAI Coding Assistants | AI coding | AI tools | Apache Spark | Cloud NativeEvents and activities | Flexible PTO | Healthcare coverage | Inclusive environment | Ownership via equitySenior-level Full TimeSeattle, WA1d ago
-
Member of Technical Staff, Inference & RL Systems USD 225K-550KDistributed Systems | GPU | Inference Serving | Memory Management | Model execution401k with matching | Equity | Health insurance | Relocation stipend | Unlimited paid time offSenior-level Full TimeSan Francisco1d ago
-
Software Engineer USD 200K-550KAPI Design | Backend Development | Data Pipelines | Distributed Systems | Frontend workflows401k matching | Equity | Health, dental, vision insurance | Relocation stipend | Unlimited paid time offMid-level Full TimeSan Francisco1d ago
-
API Integration | Customer Engagement | Debugging | Distributed Systems | Event DrivenBonuses | Catered lunch | Equity | Impact from day one | Ownership and autonomySenior-level Full TimeSan Francisco or New York City R1d ago
-
Senior-level Full TimeOakland, CA, United States1d ago
-
A/B | A/B Testing | Airflow | Algorithms | B testingBenefits | Bonus | Employee travel credits | Equity | Inclusive cultureSenior-level Full TimeRemote-USA R1d ago
-
Senior Principal AI Engineer USD 140K-210KAWS | Azure | Collaboration | Communication | Data PreprocessingSenior-level Full TimeChantilly/Herndon, VA1d ago
-
Data Engineer USD 110K-149KAPIs | AWS | Agile methodologies | Azure | CI/CDComprehensive benefits | Supportive cultureSenior-level Full TimeFort Meade, MD1d ago
-
Senior Engineer, Datacenter Server Lifecycle USD 320K-405KAWS | Asset tracking | Failure analysis | Firmware upgrades | Fleet ManagementFlexible hours | Generous vacation | Office collaboration space | Parental leaveSenior-level Full TimeSan Francisco, CA | Seattle, WA1d ago
-
Software Engineer, Compute (8+ YOE) USD 196K-339KAWS | ArgoCD | CI/CD | CRDs | CloudFormationBenefits | Incentive compensation | Stock optionsSenior-level Full TimeSan Francisco, CA; New York, NY; … R1d ago
-
Technical Consultant- Enterprise Data Engineer USD 82K-138KArcGIS | Backup and Recovery | Data Management | Data loading | Database Design401k | Dental | Health benefits | Life insurance | Paid HolidaysMid-level Full TimeVienna, Virginia, United States1d ago
-
System Engineer- Enterprise Data Engineer USD 117K-197KAutomation | Cloud Database | Cloud database solutions | Coordinate systems | Data ArchitectureDental insurance | Health benefits | Life insurance | Paid Holidays | Paid leaveSenior-level Full TimeVienna, Virginia, United States1d ago
-
System Engineer- Enterprise Data Engineer USD 117K-197KAWS RDS | ArcGIS Enterprise | Automation | Azure SQL | Backup and Recovery401k | Dental | Health and welfare benefits | Life insurance | MedicalSenior-level Full TimeSt. Louis, MO - Globe1d ago
-
ABAC | Anomaly Detection | Audit Logging | Cloud Orchestration | Data ModelingBroad benefits | Impactful work | Inclusive culture | Professional developmentSenior-level Full TimeCincinnati, Ohio, United States R1d ago