Data Reliability Engineer
Tasks
- Conduct root cause analysis and implement durable fixes
- Create and maintain runbooks SOPs and operational documentation
- Define and improve data SLAs including freshness latency and completeness
- Design monitoring alerting and observability for data systems
- Develop automation tooling to reduce operational toil
- Diagnose data pipeline failures delays and data quality issues
- Own production data pipeline reliability and stability
- Perform incident response including triage mitigation and resolution
- Perform off hours production support when required
- Support disaster recovery planning including backup validation and recovery workflows
Perks/Benefits
- 401k matching
- Dental insurance
- FMLA leave
- Life insurance
- Medical insurance
- Paid Long Term Disability
- Paid Short Term Disability
- Paid parental leave
- Paid time off
- Paid volunteer time
- Tuition reimbursement
- Vision insurance
Skills/Tech-stack
AWS | Alerting | Amazon DynamoDB | Amazon EMR | Amazon Kinesis | Amazon Redshift | Amazon S3 | Automation | Backup validation | Cause analysis | Disaster Recovery | Kafka | Monitoring | Observability | Python | Root Cause Analysis | Root cause | SQL | Spark | Streaming
Education
N/A
Related jobs
-
ABAC | Access Control | Airflow | Apache Spark | Attribute Based FilteringSenior-level Full TimeHouston, TX, United States3h ago
-
ABAC | Airflow | Apache Spark | Automated testing | Azure DevOpsSenior-level Full TimeHouston, TX, United States3h ago
-
Data Engineer (Senior) USD 170K-225KAmazon Web Services | Apache NiFi | Apache Spark | Data Pipelines | Data VisualizationSenior-level Full TimeHuntsville, United States5h ago
-
Senior Manager / Principal Data Science USD 155K-178KAI Agents | Dashboards | Data Pipelines | Data Visualization | LLMHybrid workSenior-level ContractDeerfield Beach, United States5h ago
-
API Development | AWS | Airflow | BigQuery | Cloud ComputingMid-level Full TimeDearborn, United States5h ago
-
Apache Spark | Cloud Computing | Data Governance | Data Lineage | Data WarehousingSenior-level Full TimeTexas-Austin5h ago
-
Gen AI Developer Specialist USD 118K-204KAWS | Access Management | Agents | Alerting | Amazon BedrockSenior-level Full TimeUnited States6h ago
-
GenAI Engineer III USD 110K-218KArtificial Intelligence | Containerization | Data Pipelines | Docker | Generative AISenior-level Full TimeArlington/Rosslyn, Virginia, United States6h ago
-
Generative AI Engineer III USD 110K-218KArtificial Intelligence | Data Pipelines | Docker | Kubernetes | Language ModelsSenior-level Full TimeAustin, Texas, United States; Boston, Massachusetts, …6h ago
-
Senior Software Engineer, Managed Spark, OpenSource USD 174K-253KApache Hudi | Apache Iceberg | Benchmarking | C++ | CloudSenior-level Full TimeSunnyvale, CA, USA7h ago
-
Staff Software Developer, AI/ML, Safety and Security USD 207K-301KClassification | Computer Vision | Data Processing | Debugging | Fine TuningSenior-level Full TimeWaterloo, ON, Canada; New York, NY, …7h ago
-
Artificial Intelligence | C++ | Data Analysis | Data Storage | Data VisualizationSenior-level Full TimeSunnyvale, CA, USA7h ago
-
Senior Software Engineer, Storage AI/ML USD 174K-253KAlgorithms | Benchmarking | Cloud Storage | Data Structures | Deep learningSenior-level Full TimeSeattle, WA, USA7h ago
-
Senior Software Engineer, AI/ML Computer Vision, XR USD 174K-253KComputer Vision | Computer Vision Systems | Data Processing | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeSan Jose, CA, USA7h ago
-
C++ | Capacity Planning | Cloud APIs | Cloud Computing | Data StructuresSenior-level Full TimeKirkland, WA, USA; Sunnyvale, CA, USA7h ago
-
Senior Staff Software Engineer, Cloud AI, Full Stack USD 262K-365KC++ | CSS | Data Storage | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeSunnyvale, CA, USA7h ago
-
Senior Software Engineer, Google Cloud AI, Full Stack USD 174K-253KArtificial Intelligence | C++ | CSS | Data Storage | Data StructuresSenior-level Full TimeNew York, NY, USA7h ago
-
Senior Software Engineer, AI/ML, Creative Intelligence USD 174K-253KAd Creative Optimization | Ad creative | C++ | Creative Optimization | Data ProcessingSenior-level Full TimeMountain View, CA, USA7h ago
-
Databricks Solution Architect USD 180K-247KAWS S3 | Apache Spark | Autoscaling | Azure Data | Azure Data LakeSenior-level Full TimeUnited States R14h ago
-
Lead Analytics Manager USD 120K-150KAffiliate Marketing | Business Intelligence | Dashboard Design | Data Ingestion | Data LiteracySenior-level Full TimeAustin, TX (remote); Dallas, TX (remote); … R15h ago
-
Perception Engineer USD 170K-275K2D vision | 3D Vision | C plus plus | Camera Calibration | Computer Vision401k | Dental insurance | End-of-year shutdown | Medical insurance | Mental health supportSenior-level Full TimeNew York, NY16h ago
-
Security Engineer (Embedded & Networking) USD 130K-175KApplication Firewall | C# | C++ | Cloud Security | GoExtended hours weekends as needed | Onsite work requirementMid-level Full TimeCape Canaveral, FL17h ago
-
Security Engineer (Embedded & Networking) USD 130K-180KAPI | Access Control | Access Management | Application Firewall | Authentication401k plan | Dental coverage | Disability insurance | Employee stock purchase plan | Life insuranceMid-level Full TimeHawthorne, CA17h ago
-
Senior Quantitative Developer USD 165K-240KAWS | Alerting | Amazon Kinesis | Cloud platform | Data Transformation401k match | Dental insurance | Hardware provided | Health insurance | Unlimited PTOSenior-level Full TimeNew York17h ago
-
Security Engineer (Embedded OT) USD 130K-180KAccess Control | Access Management | Application Firewall | Automation | Best practices401k retirement plan | Employee stock purchase plan | Health, dental, and vision insurance | Life insurance | Long-term disability insuranceMid-level Full TimeCape Canaveral, FL17h ago