Principal Site Reliability Engineer (Intelligent Automation)

South San Francisco, United States

USD 162K-302K Senior-level Full Time

@ G...

Apply Save

Found 1mo ago

Tasks

Automate deployment of ML pipelines and HPC clusters
Build resilient highly available architectures for ML and HPC
Design and implement infrastructure as code solutions
Develop automation scripts and workflows for infrastructure management
Ensure security governance and regulatory compliance
Implement disaster recovery and business continuity
Lead AIOps incident management
Mentor and train engineers in IaC and HPC
Monitor and optimize cloud usage and costs
Partner with cross functional teams to align solutions to business goals
Provide technical leadership to engineers
Provision and manage cloud infrastructure for ML and HPC workloads
Run chaos engineering experiments
Set up monitoring logging and alerting

Perks/Benefits

Skills/Tech-stack

Apply Save

Language: en Views:

1 Clicks:

0 Saves: 0

Related jobs

Software Engineer, ML Data Infrastructure USD 160K-220K

Algorithms | Cloud platform | Data Pipelines | Data Processing | Data Storage

Access to compute resources | Dental insurance | Health insurance | In-office meals | Learning and mentorship

Entry-level Full Time

Toronto

4h ago
Computer Vision AI & ML Engineer USD 100K-300K

3D Geometry | 3D Scene | 3D Scene Understanding | Computer Vision | Data Augmentation

Entry-level Full Time

San Mateo, CA

8h ago
Autonomy Algorithm Engineer, Planning & Prediction USD 140K-225K

Agent modeling | C++ | GPU Inference | HD Mapping | Imitation Learning

Mid-level Full Time

Houston, TX or San Francisco Bay …

9h ago
Algorithm Engineer, Autonomy Planning & Prediction USD 150K-200K

C++ | CI/CD | Code review | Data Engineering | Debugging

Mid-level Full Time

Houston, TX or San Francisco Bay …

9h ago
SDE II, Experience Analytics (Agent Experiences & Data Products) USD 143K-194K

APIs | AWS | Amazon Web Services | Application Programming | Application Programming Interfaces

Career growth | Flexible work arrangements | Knowledge sharing | Mentorship

Mid-level Full Time

Seattle, Washington, USA

10h ago
Business Intelligence Engineer, Rapid & Rural Logistics (R2L) Science & AI USD 99K-185K

Amazon Redshift | Data Modeling | Data Pipelines | ETL | Experimentation

Mid-level Full Time

Bellevue, Washington, USA

10h ago
Senior AI Engineer USD 216K-277K

AWS | Agent systems | Autonomous Agents | Azure | By Design

Career development | Home workspace stipend | MacBook provided | Medical, dental, and vision insurance | Mental health benefits

Senior-level Full Time

Austin, TX R

10h ago
AI Engineer at Lumion USD 148K-200K

API Design | Data Modeling | Debugging | Guardrails | Language Models

401k | Health dental vision reimbursement | MacBook Pro | Relocation support | Tech stipend

Mid-level Full Time

South Jordan, UT, US, CA, US

11h ago
Enterprise AI Platform Engineer USD 130K-150K

API Gateway | AWS | AWS IAM | Audit Logging | Azure

401k | Dental insurance | Flexible PTO | Growth opportunities | Medical insurance

Senior-level Full Time

Austin, TX

11h ago
Software Engineer, Storage USD 140K-265K

Alerting | Autoscaling | C++ | Cloud Native | Databases

401k contribution | Dental insurance | Education stipend | Healthy lunches | Home office improvement stipend

Mid-level Full Time

Mountain View, CA

11h ago
Senior Data Engineer USD 135K-165K

Amazon Kinesis | Amazon Redshift | Apache Airflow | Apache Iceberg | Apache Kafka

401k | Healthcare benefits | Paid time off

Senior-level Full Time

Virtual, United States R

12h ago
Software Engineer II, Machine Learning USD 145K-175K

AWS | Airflow | CI/CD | Data Preparation | Experimentation

401k employer match | Commuter subsidy | Concierge medical membership | Dental insurance | Fitness membership subsidy

Mid-level Full Time

Palo Alto, California

12h ago
AI Architect - Business Applications USD 138K-225K

A/B | A/B Testing | API | B testing | Experimentation

Hybrid work option

Senior-level Full Time

Bellevue, WA, United States

12h ago
Trial Principal, Data & Analytics, Roivant Health USD 130K-220K

Analysis Plan | Bayesian analysis | Claims data | Clinical Trial Data | Clinical Trial Data Analysis

Comprehensive benefits package | Equity participation

Senior-level Full Time

Roivant Sciences, Inc., 1 Pennsylvania Plaza, …

12h ago
Staff Data Engineer, Finance USD 170K-230K

AWS | Asset-based lending | DBT | Data Ingestion | Data Modeling

401k matching | Dental insurance | Employee assistance program | Employee recognition | Employee referral program

Senior-level Full Time

Seattle, Washington, United States

13h ago
Software Engineer, Data Foundation USD 183K-240K

Application Architecture | Distributed Systems | Java | JavaScript | SQL

401k matching | Dental insurance | Enhanced parental leave | Holiday pay | Medical insurance

Senior-level Full Time

San Francisco, US (Hybrid) R

14h ago
Junior Data Engineer USD 64K-90K

Databricks | ELT | ETL | Git | Jira

Active Secret security clearance

Entry-level Full Time

Arlington, VA

14h ago
Staff Data Engineer USD 180K-200K

AWS | Airbyte | Airflow | Apache Iceberg | Automated testing

401k match | Dental insurance | Disability insurance | Employee assistance program | Life insurance

Senior-level Full Time

Waltham, MA

14h ago
Staff Data Engineer USD 180K-200K

API Development | AWS | Airbyte | Airflow | Amazon Kinesis

401k match | Dental insurance | Disability insurance | Employee assistance program | Health and wellness resources

Senior-level Full Time

North Bethesda, MD

14h ago
Principal Computational Engineer, 3D CAD & Geometric Intelligence USD 200K-230K

AWS | B-Rep | C++ | CAD interoperability | CMake

Conference participation | Design review leadership | Mentorship | Technical leadership opportunities

Senior-level Full Time

Waltham, MA

14h ago
Principal Computational Engineer, 3D CAD & Geometric Intelligence USD 200K-230K

ACIS | AWS | B-Rep | C++ | CAE

Senior-level Full Time

North Bethesda, MD

14h ago
Senior Software Engineer, Embedded Systems USD 170K-210K

C# | C++ | CI/CD | Docker | Embedded Systems

401k | Dental insurance | Disability insurance | Employee stock options | Health insurance

Senior-level Full Time

Glen Cove, NY

14h ago
Senior Embedded Firmware Engineer USD 120K-153K

ADC | Bare Metal | C plus plus | C# | Control Algorithms

Ownership equity | Work-life balance

Senior-level Full Time

Wilmington, MA

14h ago
Data Engineer USD 170K-220K

Apache Airflow | Apache Beam | Apache Flink | Apache Spark | BigQuery

Comprehensive benefits

Senior-level Full Time

San Francisco or Denver

14h ago
Principal AI Engineer, AI Solutions USD 172K-216K

API Design | AWS | Agent Frameworks | Automation | Data Pipelines

Career Development Programs | Commuting cost coverage | Daily free lunch | Employee resource groups | Equity grants

Senior-level Full Time

Boston, Massachusetts, United States R

14h ago

Principal Site Reliability Engineer (Intelligent Automation)

Tasks

Perks/Benefits

Skills/Tech-stack

Education

Roles

Regions

Countries

States

Cities

Related jobs