Senior HPC Storage Engineer
US, CA, Santa Clara, United States
USD 184K-356K Senior-level Full Time
Tasks
- Automate monitoring and alerting
- Collaborate on infrastructure requirements
- Define build test and deployment methodologies
- Design scalable distributed storage services
- Develop infrastructure automation tooling
- Enable self service resource consumption
- Evaluate distributed file system technologies
- Implement distributed storage services
- Optimize storage performance and cost
- Perform performance analysis and optimizations
- Perform root cause analysis
- Research distributed storage services
- Suggest Corrective Actions
- Support deep learning workflows on clusters
Perks/Benefits
- N/A
Skills/Tech-stack
Bash | CUDA | CentOS | Ceph | Container Technologies | Deep learning | Distributed file systems | Docker | Enroot | File systems | GPFS | HDD | HPC Storage | High Performance | High-performance networking | Kernel development | Linux | Lustre | NCCL | NVMe | Network Appliance | Performance Tuning | PyTorch | Python | RHEL | SDN | SSD | Software-defined networking | Storage Kernel Development | Storage Performance Tuning | Storage performance | TensorFlow | Ubuntu
Education
Regions
Countries
States
Cities
Related jobs
-
Automation Testing | CI/CD | CSS | Cypress | Feature DevelopmentMedical, dental & vision coverage | Paid time off | Parental leave | Reimbursement programs | Retirement planMid-levelRaleigh, United States R18d ago
-
A/B | A/B Testing | B testing | C++ | Content Understanding401k match | Commuter benefits | Disability insurance | Life insurance | Medical/Dental/Vision insuranceSenior-level Full TimeSunnyvale, CA4h ago
-
Forward Deployed AI Engineer/Data Scientist USD 78K-195KA/B | A/B Testing | B testing | Chatbot Platforms | Clustering401k matching | Basic life insurance | Employee stock purchase plan | Health, dental, vision coverage | Long-term disabilityMid-level Full TimeUnited States (Remote) R5h ago
-
Staff AI Researcher / Engineer USD 200K-240KAttention Mechanisms | Data Modeling | Debugging | Deep learning | Diffusion ModelsDiversity and inclusionSenior-level Full TimeSan Jose, California, United States9h ago
-
Robotics Engineer, Maritime USD 191K-253KAnomaly Detection | C++ | Cameras | Computer Vision | Data Analysis401k retirement plan | Commuter benefits | Dental benefits | Disability insurance | Healthcare benefitsSenior-level Full TimeBoston, Massachusetts, United States9h ago
-
AI Search | AWS Bedrock | Agentic Workflows | Amazon SageMaker | Anthropic401k | Dental insurance | Medical insurance | Paid sick hours | Vision insuranceSenior-level Contract Full TimeRidgefield Park, NJ, United States9h ago
-
AWS S3 | Access Control | Active IQ | Ansible | Audit Logging401k | Dental insurance | Medical insurance | Paid sick hours | Vision insuranceSenior-level Contract Full TimeRidgefield Park, NJ, United States9h ago
-
Quantum Software Engineer: QCVV Focus USD 155K-185KC# | C++ | Error correction | Fault-tolerant | Fault-tolerant quantum computing401k matching | Catered lunches | Dental insurance | Dependent care benefits | DrinksMid-level Full TimeBerkeley, CA or Boulder, CO10h ago
-
Mid-level Full TimeDayton, Ohio, United States11h ago
-
Senior-level Full TimeNew York12h ago
-
Software Engineer, Cloud Inference Safeguards USD 405K-485KData Residency | Evaluation | IAM | Load Balancing | LoggingFlexible working hours | Generous vacation | Parental leaveSenior-level Full TimeSan Francisco, CA | Seattle, WA12h ago
-
Associate, AI/ML Engineer USD 81K-106KAI Search | AWS | Azure | Azure AI | Azure AI Search401 k retirement plan | 401-k match | Dental insurance | Health insurance | Paid parental leaveMid-level Full TimeSomerset, New Jersey, United States12h ago
-
Data Engineer, Analytics Data Products USD 110K-130KAWS | Airflow | BigQuery | Build tool | CI/CD401k match | Flexible spending accounts | Medical, dental, vision benefits | Paid parental leave | Paid sick daysMid-level Full TimeNew York, NY13h ago
-
Apache Spark | Automation | Big Data | Data Science | Distributed ComputingSenior-level Full TimeFoster City, CA13h ago
-
Data Engineer (Senior) (5164) (TS/SCI) (Ft. Belvoir, VA) USD 165K-180KAgile | Apache Spark | CI/CD | Git | JavaHealth insurance | Paid leave | RetirementSenior-level Full TimeFort Belvoir, VA14h ago
-
Staff Software Engineer, Data Engineering USD 193K-253KAWS | Airflow | Amplitude | BigQuery | CI/CD401k plan | Annual cash bonus | Dental insurance | Equity grants | Flexible time offSenior-level Full TimeRemote, USA R14h ago
-
Algorithms | C# | C++ | Code Quality | Code Quality Verification401k match | Dental insurance | Emotional and mental wellness support | Fitness programs | Learning and development programsSenior-level Full TimeSeattle, Washington, United States14h ago
-
Senior-level Full TimeRemote - USA R15h ago
-
Senior Data Engineer USD 84K-149KAgile | Apache Airflow | Apache Kafka | Apache Spark | BigQuery ML401k retirement plan | Dental insurance | Disability insurance | Flexible time off | Health insuranceSenior-level Full TimeLisle, IL, United States15h ago
-
Lead Data Engineer USD 167K-188KApache Airflow | Artificial Intelligence | Azure Data | Azure Data Factory | Azure Synapse401k | Dental insurance | Dog-friendly office | Health insurance | Life insuranceSenior-level Full TimeChanhassen, Minnesota, United States15h ago
-
Sr Analytics Engineer - GTM Strategy and Operations USD 133K-182KDashboards | Data Modeling | Databricks | Forecasting | GitHubSenior-level Full TimeNew York; San Francisco, California15h ago
-
Database Engineer USD 52K-66KData integration | Database performance | Database performance tuning | Microsoft SQL | Microsoft SQL ServerEntry-level Full TimeAmerican Fork, UT15h ago
-
DevOps Engineer, Data Ingestion USD 85K-175KAWS | AWS CloudFormation | Asana | Automation | AzureCollaborative work environment | Growth opportunities | Onsite work environment | Professional developmentEntry-level Full TimeBellevue, WA16h ago
-
Machine Learning Co-Op (Fall 2026) USD 84K-176KDAX | Deep learning | Git | GitHub | Machine LearningMid-level Full TimeCanton, OH, United States16h ago
-
Machine Learning Co-Op (Summer 2026) USD 96K-180KDAX | Data Analysis | Data Visualization | Deep learning | GitMid-level Full TimeCanton, OH, United States16h ago