Site Reliability Engineer II
USD 130K-140K Senior-level Full Time
Tasks
- Automate monitoring and incident response
- Build and maintain CI/CD pipelines
- Collaborate with cross-functional teams
- Deploy and maintain SaaS platform
- Design and deploy AI/ML infrastructure
- Develop operations automation tools
- Drive disaster recovery process
- Enforce policies and audit production systems
- Estimate engineering effort plan rollout changes
- Implement security controls
- Integrate MLOps tools
- Manage GPU resources
- Participate in on-call rotation
- Perform patching and configuration management
- Perform root cause analysis and blameless post-mortems
- Provide deployment and operations support
- Scale infrastructure to meet demand
Perks/Benefits
- Health benefits
- Life insurance
- On-call compensation
- Paid time off
- Parental leave
- Retirement benefits
Skills/Tech-stack
AKS | Amazon Web Services | Ansible | Argo CD | ArgoCD | Auto-remediation | CI/CD | CIS | Cause analysis | Cloud platform | CloudFormation | Docker | EKS | Elasticsearch | FIPS 140-3 | FIPS-140 | GPU provisioning | GitHub | GitHub Actions | GitOps | Go | Google Cloud | Google Cloud Platform | Grafana | Helm | Incident Response | Infrastructure as Code | JSON | Java | Jenkins | Kubeflow | Kubernetes | Kubernetes Upgrades | Learning operations | Linux | MLflow | Machine Learning | Machine Learning Operations | Microsoft Azure | MongoDB | MySQL | NVIDIA Triton | Post-mortems | PostgreSQL | Prometheus | Python | REST | Root Cause Analysis | Root cause | SELinux | STIG | Solr | Terraform | Vector Database | Web Services | “as-code”
Education
Related jobs
-
Availability Management | Database Administration | Database Integrity | Incident Management | On-CallRemote workMid-level Full TimeNew York, United States of America R6h ago
-
Agile | Algorithms | CI/CD | Data Engineering | Data Structures401k plan | Accident insurance | Dependent care FSA plan | HSA and FSA | Hospital indemnity insuranceSenior-level Full TimeAustin, Texas or Remote R17h ago
-
Pre-Sales Data Scientist USD 70K-90KAPIs | AUC | Credibility Testing | Cross-validation | Data AnalysisCompany sponsored volunteering days | Discounted private health insurance | Extra paid time off | Fully remote within continental United States | Generous parental leaveMid-level Full TimeCarlsbad, CA, United States R17h ago
-
Senior Data Engineer USD 162K-242KCloud Platforms | Data Lakes | Data Modeling | Data Pipelines | Data Transformation401k match | Company holidays | Dental insurance | Life insurance | Long-term disabilitySenior-level Full TimeUSA - MA - Cambridge, United … R20h ago
-
Senior Applied ML Engineer USD 180K-200KAWS | Amazon Web Services | Computer Vision | Distributed Systems | Docker401k match | Disability Leave | Family building benefits | Health benefits | Life insuranceSenior-level Full Time06083 GameChanger, United States R20h ago
-
Freelance Creative Technologist, Applied AI USD 150K-190KAPI | Agentic Workflows | ComfyUI | ControlNet | Embeddings401k match | Dental | Healthcare | Paid Holidays | Paid time offMid-level FreelanceUnited States R20h ago
-
Senior AI/ML Engineer USD 192K-240KAWS | Agentic Orchestration | Automated Labeling | Automated Labeling Pipelines | AzureHybrid work scheduleSenior-level Full TimeRedwood City, CA (Hybrid); San Francisco, … R22h ago
-
Senior Software Engineer, Data Platform USD 146K-230KAPI Gateway | API Versioning | AWS Lambda | AWS RDS | AWS S3401k match | Catered lunch | Commuter benefit | Dental insurance | Fully paid parental leaveSenior-level Full TimeNew York, New York, United States R23h ago
-
Senior Forward Deployed AI Engineer USD 160K-226KData Pipelines | DevOps | Distributed Systems | Edge Computing | JAX401k match | Continuing education support | Function health subscription | Health & wellness stipend | Health, dental, vision benefitsSenior-level Full TimeAustin, TX R23h ago
-
Machine Learning Engineer – Search & Retrieval Systems USD 225K-280KA/B | A/B Testing | ANN indexing | B testing | CTR401k plan | Company holidays | Equity stock options | Flexible PTO | Fully remote United StatesSenior-level Full TimeRemote - USA R1d ago
-
Data Engineer USD 150K-200KAWS | Apollo | ClickHouse | Data Pipelines | Data Warehousing401k | Dental insurance | Equity compensation | FSA | HSAMid-level Full TimeRemote - US R1d ago
-
Data Scientist III - AI & Machine Learning USD 126K-149KAI Pipelines | Airflow | Apache Spark | Bayesian Inference | Bias Variance401k matching | Equity options grant | Flexible time off | Get Out Get Active funds | Health benefitsSenior-level Full TimeMissoula, Montana, United States R1d ago
-
AI Engineer, Marketing USD 112K-160KA/B | A/B Testing | AI-powered analytics | Agentic Workflows | Artificial IntelligenceCommuter benefits | Employee assistance program | Equity | Health savings account | Home office reimbursementMid-level Full TimeArlington, VA R1d ago
-
Senior Staff Data Engineer USD 127K-158KAI machine learning | API Integration | AWS | Amazon ECR | Amazon S311 company-paid holidays | 401k match | Employee assistance program | Employee ownership program | Life and AD and D insuranceSenior-level Full TimeRemote United States R1d ago
-
Data Engineer USD 120K-160KAWS | Airflow | DBT | Data Warehousing | ELT401k plan | Commuter benefits | Employee Meals Snacks | FSA | Flexible time offSenior-level Full TimeLos Angeles; New York; Remote; San … R1d ago
-
Agile | Backend Development | Code review | Data Science | DebuggingFully remote | Mentorship | Paid apprenticeshipEntry-level Apprenticeship ContractRemote (United States) R1d ago
-
AI Software Engineer - Minneapolis USD 80K-120KAI coding | AI coding assistant | Agents | Automation | C#Collaborative environment | Comprehensive benefits package | Employee ownership | Flexible workplace | Innovative cultureEntry-level Full TimeSt. Louis Park, Minnesota, United States R1d ago
-
Machine Learning Engineer USD 130K-194KAI machine learning | AWS AI | AWS AI Machine Learning | Amazon DynamoDB | Amazon EC2Professional development | Work from homeMid-level Full TimeRemote, NY, US R1d ago
-
Open Source Program Developer USD 153K-187KC# | Documentation | Go | Java | JavaScriptBuddy program | Community guilds | Employee stock purchase plan | Inclusion talks | Mental health benefitsMid-level Full TimeDistrict of Columbia, USA, Remote; Massachusetts, … R1d ago
-
Senior Optimization Engineer USD 140K-180KAPIs | Asynchronous orchestration | Auditability | CP-SAT | Combinatorial OptimizationRemote workSenior-level Full TimeSchenectady, New York, United States, Remote R1d ago
-
Data Scientist, Analytics (Technical Leadership) USD 160K-190KAI Workflow Optimization | AI workflow | Agent Orchestration | Bias Mitigation | Causal InferenceCareer development | World class analytics communitySenior-level Full TimeRemote, US | Bellevue, WA | … R1d ago
-
Machine Learning Engineer/Data Scientist USD 130K-150KAWS | Azure | Data Preprocessing | Deep learning | Feature EngineeringMentorship | Personal development budget | Remote workMid-level Full TimeChicago, IL R1d ago
-
Machine Learning Engineer/Data Scientist USD 130K-150KAWS | Azure | Google Cloud | Matplotlib | NumPyAccess to mentors | Daily meals in office | Diverse team | Personal development time | Remote work weeksMid-level Full TimeChicago, IL R1d ago
-
Data Engineer USD 89K-141KAWS Glue | AWS Lambda | Access Control | Amazon Kinesis | Amazon QuickSightMid-level Full TimeUnited States R1d ago
-
Senior / Staff ML Optimization Engineer USD 141K-249KBazel | C++ | CPU Profiling | CUDA | Distributed TrainingAnnual performance bonus | Catered meals | Equity awards | Flexible hours | Health and wellness benefitsSenior-level Full TimeRemote US & Canada R1d ago