Cloud Hardware Development Engineer, Cloud AI/ML/storage server teams
USD 136K-212K Mid-level Full Time
Tasks
- Collaborate with ODM and manufacturing partners
- Debug complex system failures
- Decompose complex server problems into deliverable tasks
- Design and implement system level solutions at scale
- Design predictive failure detection using telemetry and logs
- Develop design verification plans
- Develop functional specifications
- Develop test procedures
- Drive qualification and readiness milestones
- Identify and resolve technical risks early
- Implement zero touch operations automation
- Lead technical solutions for server and rack system architecture
- Own end to end NPI lifecycle for server platforms
- Own fleet health monitoring
- Partner with data center operations to close loop on field failures
- Perform root cause analysis across firmware kernel drivers thermal power and physical layers
Perks/Benefits
- N/A
Skills/Tech-stack
AWS | Artificial Intelligence | Automation | Cause analysis | Data center | Data center operations | Design verification | Device Drivers | Firmware | Functional Specifications | GPU | Kernel | Log Correlation | Machine Learning | New product introduction | Power Systems | Predictive Maintenance | Product introduction | Root Cause Analysis | Root cause | Sensors | Server Design | Server qualification | System Reliability | Telemetry | Test procedures | Thermal Management | X86 | Zero Touch | Zero Touch Operations
Education
Related jobs
-
Software Engineer, Systems ML USD 141K-208KC plus plus | CUDA | Co-design | Compiler optimization | Deep learningSenior-level Full TimeBellevue, WA | Menlo Park, CA …5h ago
-
Network Engineer, Foundation & Support USD 120K-184KAI Assisted Development | Automation | C# | C++ | Distributed SystemsGlobal team collaboration | Mentorship | On-the-job trainingEntry-level Full TimeDenver, CO | Reston, VA | …5h ago
-
RTL Design Engineer, Machine Learning Accelerators USD 138K-198KASIC design | Code review | Machine Learning | Machine Learning Accelerators | Memory hierarchyMid-level Full TimeSunnyvale, CA, USA6h ago
-
Agentic Workflows | Automated testing | Computer Vision | Data Processing | Function CallingSenior-level Full TimeMountain View, CA, USA6h ago
-
Technical Lead, AI/ML Infrastructure USD 207K-301KC# | C++ | Compute architecture | Cryptography | Distributed SystemsSenior-level Full TimeSunnyvale, CA, USA6h ago
-
Research Software Engineer USD 207K-301KData Structures | Data structures algorithms | Distributed Computing | Information Retrieval | Language ModelsBonus | Career development | Equity | Health insurance | Paid time offSenior-level Full TimeMountain View, CA, USA6h ago
-
Artificial Intelligence Developer (AI) USD 114K-218KAmazon Web Services | C++ | Conda | Data Modeling | ETL401k matching | Employer Covered Dental Insurance | Employer Covered Disability Insurance | Employer Covered Vision Insurance | Employer-covered health insuranceMid-level Full TimeChantilly, VA16h ago
-
Sr. Embedded Software Engineer - Radar & DSP USD 165K-220KAgile | Anomaly Detection | C# | C++ | ClassificationHealth insurance | Onsite work | Professional development | Retirement plansSenior-level Full TimeHuntington Beach, CA16h ago
-
Distinguished Machine Learning Engineer - Safety USD 399K-457KComputer Vision | Data Architecture | Data Processing | Distributed Systems | Language ModelsEquity compensation | Onsite work schedule | Workplace inclusion cultureSenior-level Full TimeSan Mateo, CA, United States R16h ago
-
Data Engineer USD 125K-160KAWS | AWS AppFlow | AWS CloudFormation | AWS Glue | AWS LambdaIn-office workSenior-level Full TimeMeridian, ID, US17h ago
-
Gen AI Engineer USD 112K-168KAKS | AWS | Agile | Agile frameworks | Apache Spark401k match | Dental insurance | Financial education resources | Health insurance | Life insuranceMid-level Full TimeGA-ATLANTA, 740 W PEACHTREE ST NW, …17h ago
-
Lead Cloud Data and AI/ML Engineer, AVP USD 90K-157KAPI | AWS | AWS Lambda | Agentic AI | AirflowDental insurance | Employee assistance program | Family care benefits | Health insurance | Incentive compensationSenior-level Full TimeQuincy, Massachusetts, United States17h ago
-
Machine Learning Engineer USD 137K-275KAWS | C++ | Docker | Java | KubernetesHybrid work | Remote work options | Work-life balanceMid-level Full TimeSeattle (WA), United States17h ago
-
Data Engineer II USD 93K-100KAmazon Web Services | CI/CD | Cloud platform | Deep learning | Distributed ComputingPaid Holidays | Paid time off | Remote workMid-level Full TimeColumbia, MD, US17h ago
-
AI Engineer USD 165K-240KAPI Design | AWS | Agentic Workflows | Asynchronous processing | BM25401k enrollment | Gym membership stipend | Health coverage | Hybrid work environment | Paid HolidaysSenior-level Full TimeNew York17h ago
-
Senior Analytics Engineer USD 158K-220KAutomation | CI/CD | Code review | DBT | Data LineageBetterUp coaching | Charitable contribution 401k self contribution | Company holidays | Development plan | Flexible paid time offSenior-level Full TimeUnited States (Hybrid) R18h ago
-
Staff Embedded Software Engineer USD 160K-200KARM Cortex | ARM Cortex-M | C# | CI/CD | Continuous integration401k | Commuter benefits | Dental insurance | Disability insurance | Employee assistance programSenior-level Full TimeMountain View, CA18h ago
-
AWS | AWS CDK | AWS Glue | Airflow | Athena401k | Health insurance | PTO | Paid Company Holidays | Phone stipendSenior-level Full TimeSan Carlos - Hybrid R18h ago
-
Machine Learning Engineer (NCG 2026) USD 140K-160KAgentic AI | C++ | Context engineering | Data Pipelines | Deep learningSenior-level Full TimeSan Jose, California, United States18h ago
-
Senior Machine Learning Engineer USD 130K-160KData Pipelines | Deep learning | Distributed Training | Experimentation | Feature Engineering401k plan | Company paid life insurance | Company-Provided Technology Package | Health savings account | Hybrid workSenior-level Full TimeSan Francisco R19h ago
-
Applied AI Engineer, Advertising Agents USD 135K-185KA/B | A/B Testing | Agent Orchestration | Artificial Intelligence | Asynchronous programming401k match | Commuter benefits | Dental insurance | Flexible spending account | Health insuranceEntry-level Full TimeMountain View, California, United States19h ago
-
Senior AI Integrations Engineer USD 115K-173KAI | C# | C++ | Compression | DebuggingCommute subsidy | Employee assistance program | Employee resource groups | Employee stock ownership | Generous vacation and personal daysSenior-level Full TimeSan Francisco, CA, USA19h ago
-
Data Science Engineer USD 130K-160KA/B | A/B Testing | B testing | Data pipeline | Deep learning401k matching | Company housing for 90 days | Company technology package | Dental insurance | Flexible vacationSenior-level Full TimeSan Francisco R19h ago
-
Senior AI Engineer - Customer Agent USD 148K-222KAWS | Alembic | Amazon SQS | Apache Spark | Asynchronous processingSenior-level Full TimeBoston, MA20h ago
-
Sr. Lead AI Engineer USD 216K-324KAWS | Alembic | CI/CD | Celery | DjangoCross hub collaboration | Health and wellbeing benefits | Hybrid work 3x/week | Mentorship | Professional developmentSenior-level Full TimeBoston, MA20h ago