Supercomputing Engineer (Network)
Tasks
- Analyze performance deviations and optimize network stack configurations
- Create burn in tests for device to device networking and stress testing
- Define system software metrics for high availability and performance
- Design and execute automated qualification tests for RDMA NICs and interconnects
- Design develop implement RDMA networking peering
- Develop tests for host processors NICs TORs and device network interfaces
- Implement and validate peer RDMA for accelerator to accelerator communication
- Modify kernel drivers and user space libraries for zero copy RDMA
- Optimize NIC and switch configurations for throughput congestion control and reliability
- Profile and benchmark inter node RDMA latency and bandwidth
- Root cause firmware driver and hardware issues impacting RDMA performance
- Validate new RDMA features with ODMs and silicon vendors
Perks/Benefits
- Daily lunch dinner
- Housing subsidy
- Medical, dental & vision coverage
- Relocation support
- Unlimited compute budget
- Wellness benefits
Skills/Tech-stack
Arista EOS | Bash | Benchmarking | C# | C++ | CI/CD | Cisco IOS | Docker | EBPF | EBPF tracing | Ftrace | GPUDirect | Go | Infiniband | Juniper Junos | Kernel driver | Kubernetes | Linux | Linux Kernel | Memory registration | NVLink | Perf | Perf profiling | Python | Queue pair | RDMA | RDMA verbs | RoCE | Rust | Server virtualization | Top of Rack | Top of rack switch | Version Control (Git) | Version control | Wireshark | Zero copy
Education
N/A
Regions
Countries
States
Cities
Related jobs
-
Featured Feat. Applied AI Engineer - Bay Area USD 211K-263KArtificial Intelligence | C plus plus | C# | Embeddings | Feature Engineering401k | Comprehensive health and wellness benefits | Learning and development opportunities | Unlimited time offMid-level Full TimeHQ (San Francisco)22d ago
-
Staff Software Engineer, Data Cloud Memorystore USD 207K-301KC++ | Cache | Compute Technologies | Data Structures | Data Structures and AlgorithmsBonus | Equity | Health insurance | Learning and development | Paid time offSenior-level Full TimeKirkland, WA, USA1h ago
-
Senior Software Developer, Embedded Systems/Firmware USD 100K-253KAlgorithms | Android | Artificial Intelligence | C# | C++Senior-level Full TimeKirkland, WA, USA; New York, NY, …1h ago
-
Quantum Error Correction Theorist USD 155K-185KDecoder algorithms | Error correction | Fault Tolerance | Git | Neutral Atom Qubits401k matching | Catered team lunches | Dental insurance | Dependent care benefits | FSAEntry-level Full TimeBerkeley, CA16h ago
-
Staff Algorithm Engineer / Data Scientist USD 150K-180KData Mining | MATLAB | Machine Learning | Python | R401k plan | Dental insurance | Flexible spending account | Flexible work environment | Health savings accountSenior-level Full TimeSan Diego, CA, US1d ago
-
AI Model Deployment | AI model | AI model development | Data-Driven Decision Making | Data-drivenSenior-level Full TimeSan Francisco, California, United States1d ago
-
Optimizely Solutions Architect USD 100K-200K.NET | ASP.NET MVC | ASP.Net Core | Automated testing | Azure DevOpsSenior-level Full TimeUnited States1d ago
-
Quantum Software Engineer USD 120K-150KAWS | Access Control | C++ | CI/CD | Cirq401k match | Employee assistance program | Employer paid medical/dental/vision | Flexible savings account | Health savings accountMid-level Full TimeChicago, Illinois, United States1d ago
-
Data Processing | GRPC | GraphQL | Large Scale Data | Large-scaleDirect product impact | Experimentation | Fast-paced startup culture | Rapid iteration | Remote OKMid-level Full TimeNew York, New York, United States R1d ago
-
Agile | Azure | Azure DevOps | C plus plus | DAQ401k matching | Dental insurance | Employee assistance program | HSA option | Health insuranceSenior-level Full TimeAustin, TX, United States1d ago
-
C++ | Computer hardware | DAQ | Digital I/O | Embedded Systems401k match | Dental insurance | Employee assistance program | HSA | Health insuranceSenior-level Full TimeMarkle, IN, United States1d ago
-
Inference Intern USD 60K-142KC++ | Collective communication | Compilers | Consensus Protocols | Consistency modelsDaily meals | Direct mentorship | Housing support | Paid internshipEntry-level InternshipSan Jose1d ago
-
Supercomputing Engineer (Test) USD 150K-275KBash | Benchmarking | CI/CD | Containerization | Data AnalysisDaily meals | Housing subsidy | Medical, dental & vision coverage | Relocation support | Unlimited compute budgetMid-level Full TimeSan Jose1d ago
-
Data Engineer USD 179K-273KAmazon Kinesis | Apache Airflow | Apache Kafka | Apache Spark | Data ModelingMid-level Full TimeFoster City, CA1d ago
-
Machine Learning Engineer: Perception and Planning USD 184K-275KAutomated testing | Behavior Prediction | C++ | Code review | Computer VisionSenior-level Full TimeOakland, CA1d ago
-
Robotics System Engineer USD 130K-230KAutonomous Systems | C++ | Data Analysis | Metrics pipelines | PythonSenior-level Full TimeOakland, CA1d ago
-
ML Infrastructure Engineer USD 120K-190KAirflow | Amazon SageMaker | Apache Spark | Argo | DatabricksEntry-level Full TimeOakland, CA1d ago
-
Space Operations Engineer (Embedded Software) USD 100K-160KAPIs | ARM | Algorithm Optimization | C# | C++Mid-level Full TimeSan Francisco, CA1d ago
-
Staff Data Engineer USD 192K-250KAWS RDS | Airflow | Amazon Aurora | Amazon Kinesis | Apache KafkaCatered lunches | Co-working travel perk | Commuter benefit | Disability insurance | Equipment allowanceSenior-level Full TimeSan Francisco, CA1d ago
-
Senior Embedded Software Engineer USD 166K-200KAbstraction layer | Automated testing | Bootloader | C# | C++401k | Dental insurance | Free lunch | Health insurance | Paid time offSenior-level Full TimeAlameda HQ1d ago
-
Senior AI Engineer USD 140K-180KAI Foundry | AI Platform | AI Services | Azure AI | Azure AI FoundrySenior-level Full TimeChicago, Illinois1d ago
-
Senior AI Engineer USD 140K-180KAI Foundry | AI Services | Azure AI | Azure AI Foundry | Azure AI ServicesClient facing consulting opportunities | Cross-functional collaboration | Work on enterprise AI deploymentsSenior-level Full TimeAtlanta, GA | Kansas City, MO …1d ago
-
Senior Data Engineer - Cloud Data Platform USD 116K-164KAgile | CI/CD | Cloud platform | Cloud services | Control-MOn-site collaborationSenior-level Full TimeAustin, TX, United States1d ago
-
Senior Data Engineer USD 150K-220KAWS | Anomaly Detection | DBT | Data Observability | Data QualityFully remoteSenior-level Full TimeRemote (U.S. based) R1d ago
-
Mid-level Full TimeKing George, VA, United States1d ago