Software: Operations & Reliability Lead
Tasks
- Automate operational tasks
- Build monitoring dashboards
- Conduct security audits
- Coordinate NOC and SOC support
- Enforce configuration baselines
- Implement automated alerts
- Implement cost optimization
- Integrate AI Ops capabilities
- Maintain operational readiness
- Manage backup and recovery workflows
- Manage incident response
- Perform performance testing
- Perform root cause analysis
- Provision cloud and on premise resources
- Run penetration testing
- Test disaster recovery plans
Perks/Benefits
- N/A
Skills/Tech-stack
AI Ops | Automated Alerts | Azure Monitor | Backup and Recovery | CI/CD | Cause analysis | Cloud Platforms | Configuration Management | Cost Optimization | Datadog | Disaster Recovery | Incident Response | Infrastructure as Code | Monitoring | NOC Support | New Relic | Penetration Testing | Performance Testing | Prometheus | Resilience Engineering | Resource Governance | Root Cause Analysis | Root cause | SOC support | Security Compliance | Security auditing | Threat detection | “as-code”
Education
Related jobs
-
AI Solutions Engineer (PUERTO RICO) USD 77K-163KAI | AIOps | AWS | Ansible | Azure401k plan | Dental insurance | Disability coverage | Employee assistance program | Employee scholarship programMid-level Full TimeUS-PR-SANTA ISABEL-B1 ~ Felicia Industrial Park …26d ago