Site Reliability Engineer II
USD 130K-140K Senior-level Full Time
Tasks
- Automate monitoring and incident response
- Build and maintain CI/CD pipelines
- Collaborate with cross-functional teams
- Deploy and maintain SaaS platform
- Design and deploy AI/ML infrastructure
- Develop operations automation tools
- Drive disaster recovery process
- Enforce policies and audit production systems
- Estimate engineering effort plan rollout changes
- Implement security controls
- Integrate MLOps tools
- Manage GPU resources
- Participate in on-call rotation
- Perform patching and configuration management
- Perform root cause analysis and blameless post-mortems
- Provide deployment and operations support
- Scale infrastructure to meet demand
Perks/Benefits
- Health benefits
- Life insurance
- On-call compensation
- Paid time off
- Parental leave
- Retirement benefits
Skills/Tech-stack
AKS | Amazon Web Services | Ansible | Argo CD | ArgoCD | Auto-remediation | CI/CD | CIS | Cause analysis | Cloud platform | CloudFormation | Docker | EKS | Elasticsearch | FIPS 140-3 | FIPS-140 | GPU provisioning | GitHub | GitHub Actions | GitOps | Go | Google Cloud | Google Cloud Platform | Grafana | Helm | Incident Response | Infrastructure as Code | JSON | Java | Jenkins | Kubeflow | Kubernetes | Kubernetes Upgrades | Learning operations | Linux | MLflow | Machine Learning | Machine Learning Operations | Microsoft Azure | MongoDB | MySQL | NVIDIA Triton | Post-mortems | PostgreSQL | Prometheus | Python | REST | Root Cause Analysis | Root cause | SELinux | STIG | Solr | Terraform | Vector Database | Web Services | “as-code”
Education
Related jobs
-
Mid-Level Data Engineer USD 90K-98KAPI Development | Azure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake StorageRemote workMid-level Full TimeWork from home, VA, United States R9h ago
-
Senior Data Engineer USD 165K-180KAPIs | Anomaly Detection | Azure | Azure Data | Azure Data FactorySenior-level Full TimeWork from home, VA, United States R9h ago
-
Senior Data Engineer USD 129K-165KAWS | Airflow | CI/CD | Data Modeling | Django401k | Half-day Fridays | Medical/Dental/Vision insurance | Paid Holidays | Remote workSenior-level Full TimeChicago, IL, US R23h ago
-
Solutions Engineer USD 150K-180KAI | Apache Flink | Apache NiFi | Apache Spark | Applied ScienceContinued Career Development | Employee resource groups | Flexible work from home | Generous paid time off | Paid volunteer timeMid-level Full TimeUS-California-Remote, United States R1d ago
-
Digital Technical Specialist (Associate/Sr Associate) - Heathcare Data, Analytics & Automation - Remote USD 120K-171KAI | Automation | Data Transformation | Data integration | EHRDental insurance | Healthcare coverage | Remote work | Travel opportunity | Vision insuranceSenior-level Full TimeChicago - 550 Van Buren, United … R1d ago
-
Senior Embedded Software Engineer - Audio USD 115K-150KAIDL | ALSA | AOSP | ASoC | AndroidAdoption support | Adoption/support reimbursement | Child care assistance | Dental insurance | Employee resource groupsSenior-level Full TimeDearborn, MI, United States R1d ago
-
Senior Software Engineer, Storage USD 166K-210KAmazon CloudWatch | Amazon EC2 | Datadog | Go | MemcachedAnnual refresh grants | Equity grant | Remote work flexibilitySenior-level Full TimeUnited States - Remote R1d ago
-
AI Solutions Engineer, East USD 125K-175KAWS | Azure | Cloud platform | Dspy | Generative AI401k plan | Dental insurance | Medical insurance | Mental wellness support | Parental leaveMid-level Full TimeRemote (New York) R1d ago
-
Senior Data Engineer, Sentinel (Pacific Time Zone) USD 153K-210KAWS | Airflow | Alerting | CI/CD | DatabricksSenior-level Full TimeUnited States R1d ago
-
Mid-level Full TimeAnywhere USA, United States R1d ago
-
Machine Learning Engineer USD 104K-131KAWS | Active Learning | Airflow | Argo | Azure401k match | Corporate discounts | Education assistance | Flexible work options | Maternity leaveSenior-level Full TimeRemote, REMOTE, United States R1d ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | Data loading | Distributed Training | Gradient Computation | Kernel Fusion401k match | Dental insurance | Health Accounts | Health insurance | Health savings accountSenior-level Full TimeBoston, Massachusetts, United States R1d ago
-
AWS EKS | Access Control | Apache Iceberg | Apache S3 | CI/CDDental and vision coverage | ESPP | Flexible spending wallets | Remote-first work | Subsidized medical coverageSenior-level Full TimeRemote US R1d ago
-
Senior Data Engineer - Agentic AI Engineering USD 138K-173KAWS | Access Control | Airflow | Azure | DBTSenior-level Full TimeUnited States of America R1d ago
-
Principal Data Engineer USD 152K-190KApache Spark | Artificial Intelligence | CI/CD | Cloud Platforms | Code Coverage401k company match | Dental insurance | Flexible paid time off | Life insurance | Long-term disabilitySenior-level Full TimeDallas, TX - Hybrid (3x in … R1d ago
-
Software / Computer Science Intern USD 42K-52KData Parsing | Data Pipelines | Debugging | Networking | PythonHybrid work flexibility | Professional developmentEntry-level InternshipMonroeville, PA R1d ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | Kernel Fusion | NVIDIA Nsight | PyTorch | PyTorch Profiler401k match | Dental insurance | Health insurance | Health savings account | Life insuranceSenior-level Full TimeRemote U.S. R1d ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | Data loading | Distributed Training | Kernel Fusion | NsightMedical Dental Vision 401k with company match Health Savings Account Life Insurance Pet InsuranceSenior-level Full TimeLas Vegas, Nevada, United States R1d ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | Kernel Fusion | Nsight | Profiling tools | PyTorch401k match | Dental insurance | Health insurance | Health savings account | Life insuranceSenior-level Full TimePittsburgh, Pennsylvania, United States R1d ago
-
Azure Data | Azure Data Factory | Azure Synapse | Azure Synapse Analytics | CI/CDCollaborative work environment | Dental insurance | Flexible remote working options | Health insurance | Professional development and training opportunitiesSenior-level Full TimeWashington R1d ago
-
Azure Data | Azure Data Factory | Azure Synapse | Azure Synapse Analytics | CI/CDCollaborative work environment | Dental benefits | Health benefits | Impactful national security projects | Professional development and trainingSenior-level Full TimeVirginia R1d ago
-
Azure Data | Azure Data Factory | Azure Synapse | Azure Synapse Analytics | CI/CDCollaborative work environment | Dental benefits | Health benefits | Impactful projects | Professional development and trainingMid-level Full TimePennsylvania R1d ago
-
Azure Data | Azure Data Factory | Azure Synapse | Azure Synapse Analytics | CI/CDDental insurance | Health insurance | Professional development and training | Remote work flexibility | Retirement benefitsSenior-level Full TimeTexas R1d ago
-
Azure Data | Azure Data Factory | Azure Synapse | Azure Synapse Analytics | CI/CDDental benefits | Health benefits | Professional development and training | Remote work flexibility | Retirement benefitsSenior-level Full TimeNorth Carolina R1d ago
-
Azure Data | Azure Data Factory | Azure Synapse | Azure Synapse Analytics | CI/CDCollaborative innovative work environment | Flexible remote working options | Health dental and retirement benefits | Professional development and training opportunitiesSenior-level Full TimeNew York R1d ago