Site Reliability Engineering (English required)

Mexico City, Mexico City, Mexico - Remote

DaCodes

Empower your company's future with DaCodes' world-class software solutions and expert team. Scale efficiently, innovate, and transform your business today.

View all jobs at DaCodes

Apply now Apply later

Join DaCodes!

We are a high-impact software and digital transformation firm.

For over 10 years, we have developed technology-driven and innovative solutions thanks to our team of 220+ talented #DaCoders, including developers, architects, UX/UI designers, PMs, QA testers, and more. Our team collaborates on projects with clients across LATAM and the United States, delivering outstanding results.

At DaCodes, you will have the opportunity to grow professionally, work on a variety of projects across different industries, and optimize and maintain highly available, scalable infrastructure for our clients.

Our DaCoders play a crucial role in the success of our company and our clients. You will have the chance to work with disruptive startups and global brands while contributing your expertise to impactful projects.

Sounds interesting?

We are looking for talented professionals to join our team—let’s work together!

Requirements

Site Reliability Engineer (SRE)

Role Overview

We are looking for a Site Reliability Engineer (SRE) who thrives in solving operational and development challenges using cutting-edge technologies and methodologies. The ideal candidate will have a strong understanding of cloud infrastructure, automation, container orchestration (Kubernetes), and CI/CD pipelines. This role involves working with diverse teams to ensure system reliability, automation, and optimized performance in production environments.

Key Responsibilities

Automate infrastructure management using tools such as Terraform, Ansible, and CloudFormation.
Develop and manage CI/CD pipelines using tools like Jenkins.
Architect and maintain scalable systems in data centers and cloud environments.
Manage containerized environments, with hands-on experience in Kubernetes and ECS.
Automate routine tasks, optimize deployments, and ensure reliability of production systems.
Collaborate with cross-functional teams to improve performance, reliability, and scalability.
Analyze and debug issues, ensuring timely resolutions and minimal downtime.
Monitor applications, systems, and databases using tools like Prometheus, Grafana, and Elasticsearch.
Troubleshoot network issues and automate network configurations with pipeline tools.
Participate in technical discussions, bringing real-world solutions and contributing to architectural decisions.

Required Qualifications

🔹 5+ years of experience in Site Reliability Engineering or similar roles.
🔹 Proficiency in cloud computing platforms like AWS, with advanced expertise in network infrastructure (load balancers, subnets, gateways, NAT, etc.).
🔹 Strong experience with container orchestration tools like Kubernetes, ECS, and Docker.
🔹 Advanced skills with CI/CD tools (Jenkins, ArgoCD, Terraform, CloudFormation).
🔹 Experience with monitoring tools such as Prometheus, Grafana, and Elasticsearch.
🔹 Proficient in scripting and development languages (Go, Python, Ruby, Bash).
🔹 Experience with system and application debugging, and ensuring high availability.
🔹 Strong problem-solving and troubleshooting abilities in cloud and on-prem environments.
🔹 In-depth understanding of networking (IPv4, IPv6, BGP, etc.).
🔹 Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).
🔹 Excellent communication and interpersonal skills to collaborate effectively with teams.

Nice-to-Have Skills (Preferred)

AWS Certified Solutions Architect or SysOps Administrator.
✅ Familiarity with Agile software development methodologies, such as Scrum or Kanban.
Experience with application monitoring and alerting systems.
Familiarity with Machine Learning applications for infrastructure optimization.

Benefits

🚀 Work with global brands and disruptive startups.
🏡 Remote work / Home office.
📍 If a hybrid or on-site model is required, you will be informed from the first session.
Work schedule aligned with the assigned project/team.
📅 Monday to Friday schedule.
⚖️ Legal benefits (Applicable for Mexico).
🎉 Day off on your birthday.
🏥 Private health insurance (Applicable for Mexico).
🛡️ Life insurance (Applicable for Mexico).
🌎 Multicultural teams.
🎓 Access to courses and certifications.
📢 Meetups with industry experts and top universities.
📡 Virtual networking events and interest groups.
📢 English classes.
🏆 Opportunities within our different business lines.
🏅 Proudly certified as a Great Place to Work.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  1  0  0

Tags: Agile Ansible AWS CI/CD CloudFormation Computer Science Docker ECS Elasticsearch Engineering Grafana Jenkins Kanban Kubernetes Machine Learning Pipelines Python Ruby Scrum Terraform UX

Perks/benefits: Career development Health care Team events

Regions: Remote/Anywhere North America
Country: Mexico

More jobs like this