Site Reliability Architect
A USD 155K-190K (estimate) Senior-level Full Time
Tasks
- Analyze upstream downstream dependencies
- Build Prometheus and Grafana dashboards
- Configure ELK and EFK pipelines
- Define SLIs, SLOs, and error budgets
- Design unified observability dashboards
- Detect anomalies and predict incidents with AIOps
- Enrich and manipulate JSON telemetry
- Implement Dynatrace metrics traces logs and Davis AI
- Implement alerting with static and dynamic thresholds
- Integrate OpenTelemetry
- Monitor Kafka and streaming platform signals
- Monitor and troubleshoot distributed microservices
- Perform root cause analysis
- Recommend runbooks with LLMs
- Reduce alert noise with alert correlation
- Suggest auto remediation actions with GenAI
- Summarize incidents with GenAI
Perks/Benefits
- N/A
Skills/Tech-stack
AI machine learning | AIOps | AWS | Alert Correlation | Alerting | Anomaly Detection | Azure | Cause analysis | Cloud platform | Davis AI | Dependency analysis | Distributed Systems | Dynamic Thresholds | Dynatrace | EFK Stack | ELK Stack | Error budget | GenAI | Google Cloud | Google Cloud Platform | Grafana | Incident Management | Infrastructure as Code | JSON | Kafka | Language Models | Large Language Models | Machine Learning | Microservices | Noise Reduction | Observability | OpenTelemetry | Prometheus | Reliability Engineering | Root Cause Analysis | Root cause | SLI | SLO | Series analysis | Site Reliability | Site Reliability Engineering | Static Thresholds | Streaming Platforms | Telemetry enrichment | Terraform | Time Series | Time Series Analysis | Unified Observability | “as-code”
Education
N/A
Related jobs
-
Partner Engineering GenAI - US USD 140K-203KAPI Integration | Agent Orchestration | Artificial Intelligence | Bias Mitigation | C++Senior-level Full TimeMenlo Park, CA | Seattle, WA …2h ago
-
Computer Science Research - US - IC5 USD 166K-244KData Pipelines | Deep learning | Experimentation | Generative Models | Image-to-videoKnowledge sharing | Mentoring | Open source contributionsMid-level Full TimeBellevue, WA | Menlo Park, CA2h ago
-
API Design | Agentic Workflows | C plus plus | C# | Computer VisionSenior-level Full TimeRedmond, WA2h ago
-
Machine Learning Solutions Engineer, Google Cloud USD 153K-222KApache Beam | C++ | ELT | ETL | Generative AISenior-level Full TimeChicago, IL, USA; Atlanta, GA, USA3h ago
-
Software Engineer III, AI/ML GenAI, Google Ads USD 147K-211KC++ | Data Processing | Data Storage | Debugging | Distributed ComputingSenior-level Full TimeMountain View, CA, USA3h ago
-
Software Engineer, AI/ML, Platforms and Devices USD 147K-211KAndroid | C plus plus | Data Processing | Debugging | Distributed SystemsMid-level Full TimeMountain View, CA, USA3h ago
-
Software Engineer, Agentic AI Infrastructure USD 147K-211KC++ | Compute Technologies | Data Structures | Data Structures and Algorithms | Distributed SystemsMid-level Full TimeNew York, NY, USA3h ago
-
Staff Software Engineer, YouTube Ads, AI/ML USD 207K-300KAlgorithms | Data Processing | Data Structures | Debugging | Distributed ComputingEmployee discounts | Health insurance | Paid time off | Professional development | Retirement plansSenior-level Full TimeMountain View, CA, USA3h ago
-
Fullstack - Data Platform (Autonomy) USD 170K-210KAWS | Amazon RDS | Database Indexing | Database Query | Database Query OptimizationBonuses | Equity compensation | Medical/Dental/Vision insurance | Overtime pay | Paid time offSenior-level Full TimeSouth San Francisco, California, USA10h ago
-
3D Perception Engineer - Autonomy (Droid) USD 180K-265K3D Geometry | Aerial survey | Autonomy | CNN | Camera CalibrationBonus pay | Dental insurance | Equity compensation | Medical insurance | Paid time offMid-level Full TimeSouth San Francisco, California, USA10h ago
-
Autonomy Perception Engineer - CV / 3D Reconstruction USD 180K-265K3D Reconstruction | Camera Calibration | Computer Vision | Convolutional Neural Networks | Data AnnotationDental insurance | Equity compensation | Medical insurance | Paid time off | Vision insuranceMid-level Full TimeSouth San Francisco, California, USA10h ago
-
Senior Software Engineer (Cloud & Data Platforms) USD 140K-165KAPI Design | AWS CloudWatch | AWS Copilot | AWS Lambda | Amazon DynamoDB401k matching | Disability insurance | Health insurance | Life insurance | Medical savings accountSenior-level Full TimePhiladelphia, PA, United States11h ago
-
Senior Applied AI Engineer USD 182K-207KAPIs | Causal Inference | Data Pipelines | Data Storage | Distributed SystemsOnsite workSenior-level Full TimeSan Francisco HQ13h ago
-
Machine Learning Engineer (Active Secret Clearance) USD 175K-205KAgile | Algorithms | Asynchronous programming | CI/CD | Data Structures401k plan | FSA | HSA | Medical/Dental/Vision insurance | Paid disability insuranceMid-level Full TimeSchofield Barracks, Hawaii, United States13h ago
-
Data Architect USD 110K-174KAWS Redshift | Amazon Web Services | Azure Synapse | Azure Synapse Analytics | Cloud ComputingTravel opportunitiesSenior-level Full TimeTallahassee, United States13h ago
-
Senior Machine Learning Engineer, AI Personalization USD 194K-343KAWS | Agentic Engineering | Automated testing | Code generation | Data ExperimentationFlexible time off | Medical insurance | Modern family planning | Remote work | Retirement savings plansSenior-level Full TimeBay Area, CA, United States of …15h ago
-
Senior-level Full TimeChicago, Illinois, USA R15h ago
-
Mid-level Full TimeUnited States15h ago
-
Agentic AI | Information Retrieval | LLM Evaluation | Language Models | Language ProcessingFlexible work environment | Health benefits | Remote work optionsSenior-level Full TimeMountain View, CALIFORNIA, United States16h ago
-
Golang | Information Retrieval | Language Processing | Log Analysis | Machine LearningFlexible remote workSenior-level Full TimeMountain View, CALIFORNIA, United States16h ago
-
MLOps / AI Platform Engineer Subject Matter Expert USD 120K-160KAI Foundry | Azure AI | Azure AI Foundry | Azure Cloud | Azure Machine LearningFlexible schedule | Remote workSenior-level Full TimeU.S. Remote R16h ago
-
Sr. Embedded Detection Analyst USD 140K-207KAI tools | Alert Correlation | Cause analysis | Data Analysis | Detection engineeringSenior-level Full TimeRemote - USA R16h ago
-
Tech Lead, ML Engineer - AV Product engineering USD 175K-264KAction models | C++ | CUDA | Closed Loop | Closed Loop EvaluationHybrid work policy | Mentorship opportunities | On-site collaboration | Work from home flexibilitySenior-level Full TimeSunnyvale16h ago
-
Data Engineer USD 115K-120KAzure | Data Performance Optimization | Data Quality | Data Security | Data StorageEntry-level Full TimeRedmond, WA17h ago
-
Entry-level Full TimeRedmond, WA17h ago