Senior Site Reliability Engineer

Toronto - Remote

Apply now Apply later

About the Role

Are you passionate about ensuring the seamless operation of large-scale, distributed, and robust systems? Do you thrive on optimizing performance, increasing reliability, and automating tasks to create more efficient processes? Are you hungry for learning? If so, we would want to chat to you!

As a Senior Site Reliability Engineer (SRE) / DevOps Engineer at our organization, you'll play a pivotal role in combining software and systems engineering to build, maintain, and enhance our mission-critical services. You'll be responsible for guaranteeing the reliability and uptime of both internal and external systems, all while driving continuous improvement at a rapid pace.

Minimum Requirements

  • 5 years of experience as a Site Reliability Engineer or DevOps Engineer working with software and infrastructure.

  • A bachelor’s degree in Computer Science, a related technical field, or equivalent practical experience.

  • Proficiency with one or more of the following: Python, Javascript, Ruby, Groovy, PHP, or Bash.

  • A deep, hands-on understanding of AWS, GCP, or (preferably) Azure.

  • Bonus: Experience with high availability systems, troubleshooting and debugging production code, application deployment, data pipelines, distributed systems, Snowflake and/or relational databases are all considered to be assets. 

Role Responsibilities

  • Collaborate with a diverse team of software engineers, engaging in iterative processes and effective task planning to drive our projects forward.

  • Take ownership of the end-to-end availability and performance of our services, proactively identifying potential issues, and implementing automation to prevent the recurrence of problems.

  • Participate in an on-call rotation, ensuring our systems remain stable and responsive even during off-hours.

  • Foster collaboration with other engineering teams, promoting the reuse of existing frameworks and gaining insights into their operation.

  • Lead the development, implementation, and achievement of service-level objectives that are instrumental in maintaining product reliability.

  • Collaborate with software engineering teams to design, implement, and maintain CI/CD pipelines, enabling rapid and reliable software releases.

  • Automate and optimize our infrastructure provisioning, configuration, and management processes using Infrastructure as Code (IaC) principles with tools like Terragrunt and Ansible, ensuring repeatability, security, and version-controlled infrastructure.

  • Implement and manage containerization and orchestration technologies to enhance scalability and resource utilization.

  • Maintain and enhance version control systems and repositories for codebase management.

  • Steer and drive the SRE / DevOps roadmap with your team while actively engaging in negotiation and strategic planning to ensure its successful execution.

  • Stay current with industry trends, emerging technologies, and best practices in SRE, DevOps, and automation.

  • Lead and contribute to blameless postmortems following incidents, driving a culture of learning and accountability by documenting root causes and ensuring follow-through on corrective actions.

  • Design, implement, and maintain comprehensive observability and monitoring solutions (metrics, logging, tracing) to ensure deep visibility into service health and to enable rapid root cause analysis and alerting.

About Vantage

Vantage plays in a $250BN addressable market in North America that is seeing significant disruption. Retailers are transforming their digital marketing practices to drive customer acquisition and are looking for new profit centers in retail media networks.

Vantage is uniquely positioned in this space, having established a technology platform that is custom-built for retail media. We offer the only turnkey platform for integrated retail media networks. We significantly outperform online media benchmarks by leveraging automation, machine learning, and AI. Ours is the market-leading platform and we have real traction with some of the biggest names in retail.

We’re a fully remote team rooted in Toronto, made up of diverse, creative, and fun individuals. While we enjoy the flexibility of remote work, we come together in person once a month to foster meaningful, face-to-face collaboration.

At Vantage, we believe our commitment to diversity and inclusion makes us stronger, more thoughtful, and more innovative. We also believe people thrive when they have the freedom to shape their own work and life environments—and when they're surrounded by brilliant teammates, tackling ambitious challenges together.

In addition to your compensation, enjoy the rewards of an organization that puts our heart into building a team. Vantage offers a full range of medical, dental, and vision benefits. All employees are also owners, as everyone is enrolled into the Vantage Employee Stock Option Plan. Vantage provides strong maternity / parental leave benefits in all jurisdictions.

Vantage also offers numerous well-being programs, education assistance, development courses, and discount programs with participating partners. As for time off, Vantage employees enjoy generous vacations, as well as paid holidays throughout the calendar year.

Vantage and Diversity

Vantage Analytics is fueled by the diversity of our talented employees. We are an equal opportunity employer and embrace ALL individuals and what makes them unique. We believe our employees should be happy and healthy, with peace of mind and a sense of fulfillment. We encourage all individuals to apply for positions that fit their passions. 

We promote equality and strive to provide all current and prospective employees with support and opportunities. Reasonable accommodations are available to job applicants on request and throughout the application process.

We thank all applicants in advance for their interest in this position; however, only those selected for an interview will be contacted.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0

Tags: Ansible AWS Azure CI/CD Computer Science Data pipelines DevOps Distributed Systems Engineering GCP JavaScript Machine Learning PHP Pipelines Python RDBMS Ruby Security Snowflake

Perks/benefits: Career development Equity / stock options Flex vacation Health care Medical leave Parental leave Salary bonus Team events

Regions: Remote/Anywhere North America
Country: Canada

More jobs like this