Senior Site Reliability Engineer - Observability
San Francisco Office (Fremont St)
USD 240K-401K Senior-level Full Time
Tasks
- Automate deployment and operation of observability systems
- Deploy observability platforms for logging metrics and distributed tracing
- Develop platform software to improve product reliability
- Lead other engineering teams on monitoring solutions
- Set up monitoring for AI HPC cluster infrastructure
Perks/Benefits
- 401k match
- Commuter stipend
- Dental insurance
- Flexible paid time off
- Health insurance
- Vision insurance
- Wellness stipend
Skills/Tech-stack
Ansible | Dashboard Design | DevOps | Distributed tracing | Go | Kubernetes | Linux | Logging | Metrics | Monitoring | Network Monitoring | OTel Instrumentation | Observability | OpenTelemetry | OpenTelemetry Collector | PromQL | Prometheus | Querying | System Administration | Terraform
Education
N/A
Regions
Countries
States
Related jobs
-
Summer 2026 Data Engineer USD 41K-50KAPIs | Agile | Azure Data | Azure Data Factory | Azure Data LakeExposure to real-world projects | Learning and development opportunities | MentorshipEntry-level InternshipBoston, MA, United States5h ago
-
Alerting | Ansible | Bash | CI/CD | CephRemote workSenior-level Full TimeUnited States, United States R7h ago
-
Ansible | Bash | CI/CD | CentOS | CephContract-to-hire | No sponsorship | Remote workSenior-level Full TimeUnited States, United States R7h ago
-
Senior Data Engineer (TS/SCI Clearance) USD 130K-220KData Visualization | Database performance | Database performance tuning | ETL | High PerformanceEmployee development | High employee morale | RetentionSenior-level Full TimeHuntsville, United States9h ago
-
Amazon S3 | Data Engineering | Data Modeling | Data Pipelines | Data QualitySenior-level Full TimeNew York10h ago
-
Amazon S3 | Automation | Data Engineering | Data Modeling | Data Pipelines401k match | Dental insurance | Life insurance | Long-term disability | Medical insuranceSenior-level Full TimePrinceton10h ago
-
Senior Databricks Forward Deployed Engineer - GPS USD 119K-198KAPI Integration | AWS | Airflow | Azure | CI/CDTravelSenior-level Full TimeArlington/Rosslyn, Virginia, United States; Atlanta, Georgia, …10h ago
-
Lead Databricks Forward Deployed Engineer - GPS USD 189K-372KAPI Integration | AWS | Airflow | Apache Spark | AzureSenior-level Full TimeArlington/Rosslyn, Virginia, United States; Atlanta, Georgia, …10h ago
-
Lead AI and Data Solutions Engineer II USD 137K-229KAmazon Web Services | Apache Spark | Application Programming | Application Programming Interfaces | Cloud ComputingSenior-level Full TimeSacramento, California, United States; Tempe, Arizona, …10h ago
-
Fullstack Software Engineer, GenAI, DeepMind USD 174K-252KC++ | Communication Protocols | Computer Architecture | Embedded Systems | Field-programmable gate arraysMid-level Full TimeMountain View, CA, USA11h ago
-
Agentic AI Engineer USD 103K-158KAI Search | Agent Orchestration | Anthropic Claude | App Service | Autogen401k match | Flexible paid time off | Hybrid work flexibility | Medical/Dental/Vision | Paid parental leaveMid-level Full TimeGolden, CO, US12h ago
-
CAN | DNP3 | Data Visualization | Docker | Firmware Over The AirSenior-level Full TimeSan Francisco, California, United States15h ago
-
Sr. Presales Storage Solution Engineer USD 150K-185KCause analysis | Diagnostics | Enterprise IT | Firmware | Hardware DesignSenior-level Full TimeSan Jose, California, United States16h ago
-
Robot Autonomy Engineer-Federal USD 70K-120KC++ | Cloud infrastructure | Containerization | Debugging | Diagnostic toolsFlexible working hours | On-site work | Travel opportunities | U S Person eligibilityMid-level Full TimeIrvine, CA18h ago
-
Software Engineer, Data Infrastructure USD 155K-185KAWS | Apache Airflow | Apache Flink | Apache Kafka | Apache SparkMid-level Full TimeMountain View, CA20h ago
-
Senior Staff Software Engineer - Data Platform USD 200K-250KAWS Glue | AWS IAM | Amazon EMR | Amazon S3 | AmundsenDevelopment dollars | Employee stock purchase program | Family-forming benefits | Financial coaching | Flexible time offSenior-level Full TimeRemote, USA R20h ago
-
Senior Staff Software Engineer - Data Platform USD 200K-250KAWS EMR | AWS Glue | AWS IAM | AWS S3 | Apache AirflowDevelopment dollars | Financial coaching | Flexible remote work | Flexible time off | Free therapy sessionsSenior-level Full TimeRemote, USA R20h ago
-
Senior-level Full TimeHuntington Beach21h ago
-
AWS | Airflow | Ansible | Apache Spark | ArgoCDAdditional vacation days | English courses | Flexible remote options | Health insurance | Hybrid work optionsMid-level Full TimeGeorgia21h ago
-
Staff Machine Learning Engineer USD 189K-389KCalibration | Contextual Bandits | Contextual Decisioning | Data Validation | EmbeddingsEquity eligible | In Office 1 Day Per WeekSenior-level Full TimeSan Francisco, CA, US; Remote, US R22h ago
-
Senior-level Full TimeEl Segundo, California, United States22h ago
-
Software Engineer- BIS (Baseten Inference Stack) USD 180K-360KAutoscaling | Backend Engineering | Distributed Runtime | Distributed Systems | GPU WorkloadsCompany 401K | Family building stipend | Flexible PTO | Medical/Dental/Vision insurance | Paid parental leaveSenior-level Full TimeSan Francisco22h ago
-
Gen AI Engineering Analyst - Vice President USD 113K-170KAWS | Accuracy | Apache Kafka | Apache Spark | Azure401k | Accident insurance | Disability insurance | Life insurance | Medical, dental, and vision coverageExecutive-level Full Time14000 CITI CARDS WAY BUILDING C …23h ago
-
APIs | Compliance | Distributed Systems | Enterprise Integration | Generative AIOccasional evening calls | Remote workSenior-level Full TimeRemote - US Based R23h ago
-
AV Safety Engineering Analytics Engineer (GPSSC) USD 160K-246KCI/CD | Dash | Docker | GitHub | JenkinsRemote workMid-level Full TimeWork From Home - United States, … R23h ago