Cloud Hardware Development Engineer, Cloud AI/ML/storage server teams
Tasks
- Collaborate with ODM and manufacturing partners
- Debug complex system failures
- Decompose complex server problems into deliverable tasks
- Design and implement system level solutions at scale
- Design predictive failure detection using telemetry and logs
- Develop design verification plans
- Develop functional specifications
- Develop test procedures
- Drive qualification and readiness milestones
- Identify and resolve technical risks early
- Implement zero touch operations automation
- Lead technical solutions for server and rack system architecture
- Own end to end NPI lifecycle for server platforms
- Own fleet health monitoring
- Partner with data center operations to close loop on field failures
- Perform root cause analysis across firmware kernel drivers thermal power and physical layers
Perks/Benefits
- N/A
Skills/Tech-stack
AWS | Artificial Intelligence | Automation | Cause analysis | Data center | Data center operations | Design verification | Device Drivers | Firmware | Functional Specifications | GPU | Kernel | Log Correlation | Machine Learning | New product introduction | Power Systems | Predictive Maintenance | Product introduction | Root Cause Analysis | Root cause | Sensors | Server Design | Server qualification | System Reliability | Telemetry | Test procedures | Thermal Management | X86 | Zero Touch | Zero Touch Operations
Education
Related jobs
-
Databricks Forward Deployed Engineer - GPS USD 97K-162KAPIs | AWS | Agent Bricks | Apache Airflow | Apache SparkMentorship | Professional development | Travel 50 PercentMid-level Full TimeArlington/Rosslyn, Virginia, United States; Atlanta, Georgia, …22h ago
-
OpenAI Forward Deployed Engineer - GPS USD 97K-163KAPI Integration | AWS | Agents SDK | Airflow | Assistants APIProfessional development | Security clearance support | Travel opportunitiesMid-level Full TimeArlington/Rosslyn, Virginia, United States; Atlanta, Georgia, …22h ago
-
Lead Snowflake Forward Deployed Engineer - GPS USD 102K-174KAPI Integration | AWS | Arctic Embed | CI/CD | Cortex AILeadership opportunities | Mentorship | Professional development | Travel opportunities | US government security clearance supportSenior-level Full TimeArlington/Rosslyn, Virginia, United States; Atlanta, Georgia, …22h ago
-
Senior Staff Software Engineer, AI/ML, Security USD 262K-365KAdversarial Machine Learning | Cloud | Data Privacy | Data Processing | Data StructuresSenior-level Full TimeKirkland, WA, USA; Seattle, WA, USA1d ago
-
Applied AI Staff Software Engineer, Looker USD 207K-300KData Processing | Data Visualization | Debugging | Distributed Systems | Fine TuningSenior-level Full TimeKirkland, WA, USA1d ago
-
Staff Software Engineer, Embedded Systems/Firmware USD 207K-300KAPI Design | Bare Metal | C# | C++ | DMASenior-level Full TimeSunnyvale, CA, USA1d ago
-
Staff Software Engineer, AI/ML, Google Public Sector USD 207K-300KAccelerator optimization | C++ | Cloud Object Storage | Deep learning | Distributed SystemsSenior-level Full TimeReston, VA, USA; Washington D.C., DC, …1d ago
-
Senior-level Full TimeOnsite - Austin, TX1d ago
-
Machine Learning Engineer, Geometry Team USD 175K-215K3D Geometry | C++ | Computer Vision | Distributed Training | Fine Tuning401k match | Baby bonding leave | Dental insurance | Disability insurance | Health insuranceSenior-level Full TimeKirkland, Washington, United States1d ago
-
Robotics System Engineer USD 70K-300KAutomated testing | Automated testing pipelines | Autonomous Vehicles | Autonomy | ControlsSenior-level Full TimeIrvine, CA1d ago
-
Forward Deployed Data Scientist Expert USD 198K-420KA/B | A/B Testing | Airflow | Anomaly Detection | Apache KafkaContinuous learning | Flexible working models | Great benefits | Health and wellbeing | Inclusive cultureSenior-level Full TimePalo Alto, CA, US, 943041d ago
-
Machine Learning Engineer - Reinforcement Learning USD 150K-250KData Processing | Deep learning | Distributed Training | Evaluation metrics | Generative ModelsDental insurance | Family leave | Free food and snacks | Health insurance | Life insuranceSenior-level Full TimeFremont, California, United States1d ago
-
Forward Deployed Application/ ML Engineering Expert USD 198K-420KAgentic AI | Cloud Native | Data Pipelines | Distributed Systems | Language ModelsContinuous learning | Flexible working models | Health and well-being benefitsSenior-level Full TimePalo Alto, CA, US, 943041d ago
-
Data Science Engineer USD 121K-154KArtificial Intelligence | Data Visualization | Entity Resolution | Fine Tuning | Generative AI401k | Education reimbursement program | Flexible schedule | Hybrid work | Relocation assistanceEntry-level Full TimeLivermore, CA, United States R1d ago
-
Associate Consultant, Generative AI USD 80K-110KAWS | Azure | CI/CD | Cloud platform | DockerEmployee assistance program | Health insurance | Paid time off | Parental leave | Wellness programsMid-level Full TimeNew York, NY, United States1d ago
-
Sr. Machine Learning Solutions Architect USD 156K-216KAWS | Amazon EMR | Amazon Redshift | Apache Spark | AzureAutonomy to Deliver Results | Continuous improvement culture | Professional mentorship | Remote-first cultureSenior-level Full TimeUS-Remote R1d ago
-
Software Engineer, ML Performance Optimization USD 185K-260KC++ | CUDA | Distributed Training | GPU | Model CompressionMid-level Full TimeFoster City, CA1d ago
-
Senior AI Engineer, Forward Deployed USD 191K-253KAPI | AWS | Amazon S3 | Asynchronous programming | BAAOccasional on site customer engagement | Remote work flexibilitySenior-level Full TimeUnited States R1d ago
-
Helix AI Engineer, Backend Infrastructure USD 150K-400KAWS | Alerting | Azure | C plus plus | Cloud platformSenior-level Full TimeSan Jose, CA1d ago
-
Senior Analytics & Activation Engineer USD 147K-221KAPIs | AWS | Amazon Web Services | Apache Spark | CI/CD401k matching | Dental insurance | Employee discounts | Medical insurance | Paid time offSenior-level Full TimeUnited States, Los Angeles, CA1d ago
-
Senior Staff data science engineer USD 141K-206KAzure ML | Batch inference | CI/CD | Data Pipelines | DatabricksDisability insurance | Employee assistance program | Flexible spending account | Health savings account | Life insuranceSenior-level Full TimeMilpitas, CA, United States1d ago
-
Data Scientist USD 100K-165KAWS | Anomaly Detection | Apache Kafka | Apache Spark | Artificial IntelligenceMid-level Full TimeTampa, Florida1d ago
-
Data Engineer USD 60K-180KAWS Glue | AWS Lambda | AWS Step Functions | Amazon Redshift | Amazon S3Onsite work | TS/SCI clearance requiredSenior-level Full TimeSpringfield, VA - TS/SCI clearance required1d ago
-
Lead Machine Learning Engineer I, Lifetime Value USD 164K-205KAWS | AWS SageMaker | Azure | Debugging | DeploymentMentorship | Remote friendly work locationSenior-level Full TimeRemote (United States) R1d ago
-
AI Compliance and Systems Engineer USD 119K-258KAI RMF | AI Risk | AI Risk Assessment | API Integration | AWSEmployee assistance program | Hybrid work schedule | Medical dental and vision insurance provided | Onsite gym | Paid time offSenior-level Full TimeState College, Pennsylvania, United States1d ago