Supercomputing Engineer (Network)
Tasks
- Analyze performance deviations and optimize network stack configurations
- Create burn in tests for device to device networking and stress testing
- Define system software metrics for high availability and performance
- Design and execute automated qualification tests for RDMA NICs and interconnects
- Design develop implement RDMA networking peering
- Develop tests for host processors NICs TORs and device network interfaces
- Implement and validate peer RDMA for accelerator to accelerator communication
- Modify kernel drivers and user space libraries for zero copy RDMA
- Optimize NIC and switch configurations for throughput congestion control and reliability
- Profile and benchmark inter node RDMA latency and bandwidth
- Root cause firmware driver and hardware issues impacting RDMA performance
- Validate new RDMA features with ODMs and silicon vendors
Perks/Benefits
- Daily lunch dinner
- Housing subsidy
- Medical, dental & vision coverage
- Relocation support
- Unlimited compute budget
- Wellness benefits
Skills/Tech-stack
Arista EOS | Bash | Benchmarking | C# | C++ | CI/CD | Cisco IOS | Docker | EBPF | EBPF tracing | Ftrace | GPUDirect | Go | Infiniband | Juniper Junos | Kernel driver | Kubernetes | Linux | Linux Kernel | Memory registration | NVLink | Perf | Perf profiling | Python | Queue pair | RDMA | RDMA verbs | RoCE | Rust | Server virtualization | Top of Rack | Top of rack switch | Version Control (Git) | Version control | Wireshark | Zero copy
Education
N/A
Regions
Countries
States
Cities
Related jobs
-
AWS Bedrock | Agent systems | Anthropic API | Autogen | Azure401k matching program | Adoption Assistance | Development and career growth opportunities | Fertility treatments | Flexible work schedulesSenior-level Contract Full TimeRemote, OR, United States R6h ago
-
Senior Data Engineer USD 90K-110KAWS | Agile | Apache NiFi | Data Architecture | Data ModelingAutonomy | Flexible working hours | Global employee assistance programme | Online training videos | Teambuilding eventsSenior-level Full TimeNew York, United States6h ago
-
Data Engineer USD 74K-133KAgile | Apache Airflow | BigQuery | Cloud Composer | Cloud Data401k retirement plan | Dental insurance | Disability insurance | Flexible time off | Health insuranceMid-level Full TimeLisle, IL, United States R8h ago
-
Principal Engineer - Data Platform USD 221K-387KAWS | Airflow | Apache Hive | Apache Iceberg | Apache ImpalaRemote workSenior-level Full TimeSanta Clara, California, United States R10h ago
-
Agile | Automated testing | CI/CD | Cloud Computing | CrewAIDental insurance | Health insurance | Vision insuranceMid-level Full TimeAshburn, VA, United States12h ago
-
AI Machine Learning Skill 2-FFPP-8904 USD 78K-250KC# | Data Governance | Data Modeling | Data pipeline | Java401k plan with company match | Dental insurance | Diverse inclusive workplace | Employee referral programs | Flexible spending accountsMid-level Full TimeHanover, MD13h ago
-
AWS | AWS SageMaker | Azure | Cloud Pak for Data | Cloud infrastructureAccess to national security mission work | Hybrid work | Travel opportunitiesSenior-level Full TimeUSA-VA-Herndon14h ago
-
AI-assisted software development | AWS | Agentic AI | Azure | Cloud ComputingSenior-level Full TimeUSA-VA-Herndon14h ago
-
Analytics Engineer USD 115K-150KAgile | Azure DevOps | CI/CD | DBT | Data GovernanceAdoption Assistance | Dental insurance | Disability insurance | Educational assistance | Flexible spending accountMid-level Full TimeHouston, Texas | Tulsa, Oklahoma | …14h ago
-
AI Engineer USD 180KAgent Orchestration | Cost Management | Data Pipelines | Distributed Systems | LLM401k | Commuter benefits | Dental insurance | Flexible spending | Health insuranceMid-level Full TimeNew York, New York, United States …14h ago
-
Embedded Firmware Engineer USD 70K-76KARM Cortex | ARM Cortex-M | Agile | C# | C++Dental insurance | Educational assistance | Flexible spending account | Health insurance | Health savings accountMid-level Full TimeNeenah, Wisconsin14h ago
-
Data & Analytics Specialist USD 87K-135KAPI Integration | Alteryx | DAX | JavaScript | Power AppsAdoption Assistance | Educational assistance | Flexible spending account | Health savings account | Life insuranceMid-level Full TimeWichita, Kansas14h ago
-
Data Platform & Engineering Specialist USD 100K-130KAWS | Amazon Kinesis | Azure | Azure Event | Azure Event HubsDental insurance | Educational assistance | Flexible spending accounts | Health insurance | Health savings accountsMid-level Full TimeLincoln, Nebraska14h ago
-
Machine Learning Leader - Optical Solutions USD 180K-300KAnomaly Detection | Data analytics | Image Processing | Java | Machine LearningAdoption Assistance | Disability insurance | Educational assistance | Flexible spending account | Health savings accountSenior-level Full TimeFremont, California15h ago
-
Process and Analytics Engineer USD 105K-140KAgile | Anomaly Detection | Asset Framework | HYSYS | HYSYS OnlineDental insurance | Disability insurance | Educational assistance | Flexible spending account | Health insuranceMid-level Full TimeWichita, Kansas15h ago
-
AI Architect USD 134K-237KAI Search | AI Security | API Gateway | API Integration | AWS BedrockAdoption Assistance | Dental insurance | Disability insurance | Educational assistance | Flexible spending accountsSenior-level Full TimeHouston, Texas | Tulsa, Oklahoma | …15h ago
-
Senior Finance Data Engineer / Data Analyst USD 100K-120KDAX | Dashboard Development | Data Modeling | Data Standardization | Data TransformationSenior-level Full TimeAuburn Hills, MI, United States16h ago
-
Software Engineer III, Generative AI USD 147K-211KComputer Vision | Data Processing | Debugging | Language Models | Language ProcessingSenior-level Full TimeKirkland, WA, USA16h ago
-
API Design | Agent systems | Agentic Workflows | Apache Beam | Artificial IntelligenceSenior-level Full TimeSunnyvale, CA, USA; Cambridge, ON, Canada16h ago
-
Staff Software Engineer, AI/ML, YouTube Ads USD 207K-301KA/B | A/B Testing | B testing | Data Structures | Data structures algorithmsSenior-level Full TimeMountain View, CA, USA16h ago
-
Machine Learning Engineer USD 120K-140KAI Pipelines | AI Workbench | AI endpoints | Apache Kafka | Automated testingEntry-level Full TimeDenver, Colorado, United States19h ago
-
Data Analyst - Forecasting and Optimization USD 124K-187KBacktesting | Deep learning | Feature Engineering | Gurobi | HiGHS401k matching | Disability insurance | Health insurance | Life insurance | Medical savings accountMid-level Full TimePhiladelphia, PA, United States22h ago
-
AWS Glue | AWS Lambda | AWS S3 | Access Control | Data GovernanceCareer growth opportunities | Collaborative and inclusive work environment | Diverse and inclusive culture | Flexible work arrangements | Permanent remote working modelSenior-level Full TimeCanada R23h ago
-
Data Modeling | Data analytics | Language Models | Large Language Models | Machine LearningCoaching | Hybrid work model | Mental health counseling | Mentorship | Paid volunteer timeMid-level Full TimeRaleigh, US, North Carolina23h ago
-
Principal AI Architect Engineer USD 118K-195KAWS | AWS Lambda | Amazon Bedrock | Amazon EC2 | Amazon EKSSenior-level Full TimeNew York, United States1d ago