Staff Engineer, Distributed Storage and HPC & AI Infrastructure
Tasks
- Build Kubernetes storage operators and controllers
- Design disaster recovery and backup runbooks
- Design multi petabyte AI storage systems
- Design multi tier caching and smart prefetching
- Enable automated provisioning and self service storage
- Implement NVMe over Fabrics and iSCSI
- Implement monitoring alerting SLOs and automated remediation
- Implement multi tenant isolation and quota enforcement
- Integrate parallel filesystems and object stores
- Mentor teams and contribute to open source
- Optimize RDMA Infiniband and data center networks
- Optimize data paths for GPU workloads
- Perform chaos engineering
- Plan capacity and optimize storage costs
- Tune parallel filesystems for high throughput
- Write documentation and postmortems
Perks/Benefits
Skills/Tech-stack
Ansible | ArgoCD | BeeGFS | CSI | Ceph | Custom Controllers | Disaster Recovery | Distributed Systems | Ext4 | GPFS | GitOps | Go | Grafana | Helm | ISCSI | Infiniband | Kubernetes | LVM | Linux | Linux Storage Stack | Linux storage | Lustre | Minio | NVMe | NVMe-oF | Object storage | Observability | PersistentVolumes | Prometheus | Python | R2) | RAID | RAID configurations | RDMA | S3 | SLO | StatefulSets | Storage operators | Storage stack | TCPIP | Terraform | Thanos | WekaFS | XFS
Education
Regions
Countries
States
Related jobs
-
Sr. Application Software Engineer, Data Analytics USD 160K-225KAngular | C# | CI/CD | Computer Vision | Continuous integrationExtended hours | Travel | Weekend workSenior-level Full TimeBastrop, TX10h ago
-
Senior-level Full TimeOnsite - Austin, TX11h ago
-
AWS Bedrock | Agent systems | Anthropic API | Autogen | Azure401k matching program | Adoption Assistance | Development and career growth opportunities | Fertility treatments | Flexible work schedulesSenior-level Contract Full TimeRemote, OR, United States R12h ago
-
Senior Data Engineer USD 90K-110KAWS | Agile | Apache NiFi | Data Architecture | Data ModelingAutonomy | Flexible working hours | Global employee assistance programme | Online training videos | Teambuilding eventsSenior-level Full TimeNew York, United States12h ago
-
AI Engineer 1 - Compliance, Quality & Testing USD 80K-150KAI-assisted testing | CI/CD | Code review | Cypress | Defect Tracking401k match | Dental insurance | Holidays | Medical insurance | Paid time offEntry-level Full TimeWashington, DC14h ago
-
Data Engineer USD 74K-133KAgile | Apache Airflow | BigQuery | Cloud Composer | Cloud Data401k retirement plan | Dental insurance | Disability insurance | Flexible time off | Health insuranceMid-level Full TimeLisle, IL, United States R14h ago
-
API Testing | Cypher | Data Quality | DataOps | DevOpsBenefits | Competitive pay | Growth opportunity | Remote work | Travel requiredSenior-level Full TimeReston, VA, United States R16h ago
-
Principal Engineer - Data Platform USD 221K-387KAWS | Airflow | Apache Hive | Apache Iceberg | Apache ImpalaRemote workSenior-level Full TimeSanta Clara, California, United States R16h ago
-
AI Engineer 1 - Platform Integration & AI/Data USD 80K-150KAWS | Agentic Systems | Backend Development | Backend Services | CI/CD401k match | Dental insurance | Holidays | Medical insurance | Paid time offEntry-level Full TimeWashington, DC16h ago
-
IT Infrastructure Engineer II USD 139K-178KAWS Organizations | Access Management | Amazon Web Services | Ansible | Application FirewallHealth and wellness resources | Remote work | Wellness FridaysSenior-level Full TimeRemote - United States R17h ago
-
Agile | Automated testing | CI/CD | Cloud Computing | CrewAIDental insurance | Health insurance | Vision insuranceMid-level Full TimeAshburn, VA, United States18h ago
-
Senior Data Engineer - Databricks USD 180K-248KAWS | Access Control | Amazon Web Services | Apache Spark | Automated testing401k match | Corporate Benefit Program | Discounted pet insurance | Educational resources | Employee Referral Bonus ProgramSenior-level Full TimeUS - Remote R19h ago
-
AI Machine Learning Skill 2-FFPP-8904 USD 78K-250KC# | Data Governance | Data Modeling | Data pipeline | Java401k plan with company match | Dental insurance | Diverse inclusive workplace | Employee referral programs | Flexible spending accountsMid-level Full TimeHanover, MD19h ago
-
AWS | AWS SageMaker | Azure | Cloud Pak for Data | Cloud infrastructureAccess to national security mission work | Hybrid work | Travel opportunitiesSenior-level Full TimeUSA-VA-Herndon20h ago
-
AI-assisted software development | AWS | Agentic AI | Azure | Cloud ComputingSenior-level Full TimeUSA-VA-Herndon20h ago
-
AI Engineer USD 180KAgent Orchestration | Cost Management | Data Pipelines | Distributed Systems | LLM401k | Commuter benefits | Dental insurance | Flexible spending | Health insuranceMid-level Full TimeNew York, New York, United States …20h ago
-
Data & Analytics Specialist USD 87K-135KAPI Integration | Alteryx | DAX | JavaScript | Power AppsAdoption Assistance | Educational assistance | Flexible spending account | Health savings account | Life insuranceMid-level Full TimeWichita, Kansas21h ago
-
Data Platform & Engineering Specialist USD 100K-130KAWS | Amazon Kinesis | Azure | Azure Event | Azure Event HubsDental insurance | Educational assistance | Flexible spending accounts | Health insurance | Health savings accountsMid-level Full TimeLincoln, Nebraska21h ago
-
Machine Learning Leader - Optical Solutions USD 180K-300KAnomaly Detection | Data analytics | Image Processing | Java | Machine LearningAdoption Assistance | Disability insurance | Educational assistance | Flexible spending account | Health savings accountSenior-level Full TimeFremont, California21h ago
-
Process and Analytics Engineer USD 105K-140KAgile | Anomaly Detection | Asset Framework | HYSYS | HYSYS OnlineDental insurance | Disability insurance | Educational assistance | Flexible spending account | Health insuranceMid-level Full TimeWichita, Kansas21h ago
-
AI Architect USD 134K-237KAI Search | AI Security | API Gateway | API Integration | AWS BedrockAdoption Assistance | Dental insurance | Disability insurance | Educational assistance | Flexible spending accountsSenior-level Full TimeHouston, Texas | Tulsa, Oklahoma | …21h ago
-
Senior Finance Data Engineer / Data Analyst USD 100K-120KDAX | Dashboard Development | Data Modeling | Data Standardization | Data TransformationSenior-level Full TimeAuburn Hills, MI, United States22h ago
-
Software Engineer III, Generative AI USD 147K-211KComputer Vision | Data Processing | Debugging | Language Models | Language ProcessingSenior-level Full TimeKirkland, WA, USA22h ago
-
API Design | Agent systems | Agentic Workflows | Apache Beam | Artificial IntelligenceSenior-level Full TimeSunnyvale, CA, USA; Cambridge, ON, Canada22h ago
-
Staff Software Engineer, AI/ML, YouTube Ads USD 207K-301KA/B | A/B Testing | B testing | Data Structures | Data structures algorithmsSenior-level Full TimeMountain View, CA, USA22h ago