aijobs.net

Senior Lead Site Reliability Engineer - Manager-AI/ML and Data Platforms

Jersey City, NJ, United States

USD 186K-215K (estimate) Senior-level Full Time

Apply Save
Found 18h ago
Tasks
Perks/Benefits
Skills/Tech-stack

AWS | Alerting | Black box monitoring | Black-box | CI/CD | Data Lake | Data Pipelines | Databricks | Datadog | Disaster Recovery | Disaster Recovery Planning | Distributed Systems | Docker | Dynatrace | Error Budgets | Grafana | Infrastructure as Code | Kubernetes | Monitoring | Observability | Prometheus | Python | Recovery Planning | Reliability Engineering | Resiliency | SLA | SLA management | SLI | SLI SLO SLA Management | SLI/SLO | SLI/SLO/SLA | SLO | SLO/SLA Management | Site Reliability | Site Reliability Engineering | Spark | Splunk | System design | Telemetry | Telemetry Collection | Terraform | White Box Monitoring | White-box | “as-code”

Education

N/A

Roles

Engineer | Manager | Reliability Engineer | SRE | SRE Manager | Site Reliability Engineer

Regions

North America

Countries

United States

States

New Jersey, US

Cities

Jersey City, New Jersey, US

Apply Save
Language: en Views: 1 Clicks: 0 Saves: 0

Related jobs