Senior Site Reliability Engineer, Data Infrastructure
Tasks
- Define and manage SLIs, SLOs, and SLAs
- Design and operate active active geo replicated systems
- Design and operate highly available multi region systems
- Harden security posture and evolve DevSecOps practices
- Implement automation observability and resilience
- Implement metrics, logging, and tracing
- Lead incident response and drive postmortems
- Manage traffic routing and failover strategies
- Own reliability and performance of Kubernetes based data platform
- Scale infrastructure and improve deployment pipelines
Perks/Benefits
- 401k match
- Flexible PTO
- Medical, dental, and vision insurance
- Paid parental leave
- Tuition reimbursement
Skills/Tech-stack
Active/Active | Argo CD | CI/CD | Capacity Planning | Distributed Systems | Error budget | Failover | GDPR | Geo-replication | GitHub Actions | Grafana | HIPAA | Helm | Incident Response | Infrastructure as Code | Kubernetes | Network policies | OpenShift | OpenTelemetry | Postmortem | Prometheus | Pulumi | Resource Optimization | SLA | SLI | SLO | SOC 2 | SOX | Secrets management | Terraform | Traffic Routing | Vulnerability scanning | “as-code”
Education
N/A
Regions
Countries
States
Related jobs
-
Azure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake Storage | Azure SynapseMid-level Full TimeMiami, FL, United States9h ago
-
Azure Data Engineer (Telecommunications) USD 135K-165KAzure | CI/CD | DBT | Data Quality | DatabricksSenior-level ContractFrisco, United States9h ago
-
AWS | Alteryx | Amazon SageMaker | Azure | Azure DataMid-level Full TimeNew York, NY, United States10h ago
-
Data Processing | Data Storage | Data Structures | Data Structures and Algorithms | Distributed SystemsSenior-level Full TimeMountain View, CA, USA11h ago
-
Applied AI ML Lead - LLM SUITE ENGINEERING USD 176K-215KAPI Design | AWS | Agentic AI | Caching | Cloud NativeBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimeWilmington, DE, United States20h ago
-
Senior-level Full TimeRaleigh, NC, US22h ago
-
Senior AI Engineer USD 107K-199KAKS | API Design | Alerts | Anomaly Detection | Apache SparkHybrid work environment | Inclusion support | Learning opportunities | Well-being supportSenior-level Full TimeUSA, Massachusetts, Boston, 200 Berkeley Street, …23h ago
-
Entry-level Full TimeUnited States - Remote R23h ago
-
CI/CD | Docker | Drift Detection | Embeddings | Experiment trackingMentorship | Remote workSenior-level Full TimeUnited States - Remote R23h ago
-
Data Engineer USD 85K-141KAPI Gateways | CI/CD | Cloud Databases | Data Governance | Data Lakes401k retirement plan | Adoption Assistance | Flexible spending accounts | Health savings account | Parental leaveMid-level Full TimeClient Office: Aberdeen, MD, United States23h ago
-
Principal Software Engineer - Spark USD 320KAmazon Web Services | Apache Airflow | Apache Iceberg | Apache Spark | Cloud NativeContinued Career Development | Employee resource groups | Flexible WFH | Generous PTO | Paid volunteer timeSenior-level Full TimeUS-Texas-Austin, United States23h ago
-
Apache Spark | Azure | Azure Data | Azure Data Factory | Azure StorageMid-level Full TimeCincinnati, OH23h ago
-
Senior Data Engineer USD 82K-172KAWS | Apache Spark | Artificial Intelligence | BERT | BitbucketContinuing education | Family support benefits | Flexible time off | Healthcare benefits | Learning resourcesSenior-level Full Time606 KING OF PRUSSIA PA, United …23h ago
-
Staff AI/ML Engineer USD 108K-227KAWS | Adversarial Networks | Bitbucket | CUDA | CupyFlexible time off | Learning resources | MentoringSenior-level Full Time606 KING OF PRUSSIA PA, United …23h ago
-
Staff AI/ML Engineer (LLMs) USD 108K-227KAWS Bedrock | Agentic AI | Arize Phoenix | Bitbucket | CUDAFlexible time off | Learning and development resourcesSenior-level Full Time606 KING OF PRUSSIA PA, United …23h ago
-
Machine Learning Engineer II USD 131K-184KAzure | Batch inference | Data Pipelines | Databricks | Distributed SystemsContinuous learning | Flexible ways of working | Growth mindset cultureMid-level Full TimeUSA TX Houston Hybrid, United States R23h ago
-
Spark Data Engineer, Senior USD 77K-176KAWS | Agile | Apache Kafka | Apache Spark | CassandraDependent care | Paid leave | Professional development | Recognition awards program | Tuition assistanceSenior-level Full TimeUndisclosed Location - USA, VA, Chantilly, …23h ago
-
Senior, Data Scientist (Machine Learning Engineer) USD 110K-220KAccessibility guidelines | Airflow | CI/CD | Computer Vision | Container OrchestrationSenior-level Full Time(USA) Crossman Respect Building CA SUNNYVALE …23h ago
-
Agentic AI Machine Learning Engineer USD 99K-225KAPI Integration | Cloud Computing | Computer Vision | Confluent | Deep learningDependent care | Disability insurance | Health insurance | Life insurance | Paid leaveMid-level Full TimeUSA, DC, Washington (901 15th St …23h ago
-
Senior Data/AI Engineer USD 123K-176KACID | Agentic Frameworks | Apache Spark | Artificial Intelligence | Automated testing401k savings plan | Flexible spending accounts | Health and lifestyle programs | Health savings account | Long-term disabilitySenior-level Full TimeUS-Nationwide-FIELD, United States23h ago
-
Machine Learning Engineer I USD 99K-184KA/B | A/B Testing | AWS | Azure | B testingEmployee wellness program | Health insurance | Life and disability insurance | Paid Holidays | Retirement savings planEntry-level Full TimeCA Burbank Bldg. 750, Second Century, …23h ago
-
Backend Software Engineer (GenAI) USD 104K-180KAPI Development | AWS | Amazon Bedrock | Amazon SageMaker | CI/CDMid-level Full TimeUnited States Remote, United States R23h ago
-
Staff AI Software Engineer USD 130K-260KAWS | Agentic Systems | Azure | CI/CD | Conversational AIDental insurance | Medical insurance | Paid time off | Retirement savings | Vision insuranceSenior-level Full TimeWork At Home-California, United States23h ago
-
Sr. Data Engineer USD 93K-124KAWS | Airflow | Amazon DMS | Azure? N/A | BashDisability insurance | Employee assistance program | Life insurance | Paid parental leave | Paid time offSenior-level Full TimeRemote, United States R23h ago
-
Confluent Kafka Lead & Python Developer USD 132K-193KAWS | ArgoCD | Azure | CI/CD | Confluent KafkaIn office collaboration 3 days per weekSenior-level Full TimeColumbus, OH23h ago