Engineering Manager - DevOps

Columbus, OH

⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️

Loop

Loop is the returns management software that helps ecommerce brands save time and money, retain more revenue, and drive customer loyalty. Book a demo today.

View all jobs at Loop

Apply now Apply later

About the Engineering Organization:The Engineering Team at Loop is a balance of agility, consistency, and performance. These are the pillars that allow the team to constantly and consistently deliver value that matters to customers. That customer intimacy is what allows our engineering teams to be the best in our space, and bring the best ideas to the market.
About the Role:We’re looking for a DevOps Engineering Manager who can lead a high-performing DevOps group while staying technically close to the work. Your north star is self-service: giving every engineer the power to ship, observe, and recover their services without opening a ticket. You’ll own the evolution of our AWS-based platform, designing multi-region architectures, advanced deployment patterns (blue/green, canary, feature flags), and resilient machine learning pipelines, so Loop can keep doubling traffic and transaction volume without doubling headcount. In partnership with Staff and Directors, you’ll help set the vision, mentor the team, and roll up your sleeves to write Terraform, tune Kubernetes, and debug prod incidents when it counts.
Our Blended Work Environment: At Loop, we’re intentional about the way we work so that we can do our best work. We call this our Blended Working Environment. We work from our HQ in Columbus, OH, or one of our Hub or Secluded locations, and are distributed throughout the United States, select Canadian provinces, and the United Kingdom. For this position, we’re looking for someone to join us in Columbus, OH; Chicago, IL; Austin, TX; Los Angeles, CA; or fully remote.
Our Tech Stack: AWS Cloud (Kubernetes, Serverless architecture, Redis, Aurora, DynamoDB, and identity management), Docker, MLFlow, Gitlab, Airflow, PHP/Laravel, Linux, Terraform, Datadog, Snowflake, dbt

What You’ll Do:

  • Drive self-service and automation at Loop by designing golden-path workflows so product teams can provision infrastructure, integrate monitoring, and release safely on their own.
  • Lead the strategy and execution for scaling to multi-region by implementing active-active/active-standby architectures, cross-region data replication, and global traffic management.
  • Champion the evolution of deployment patterns, including blue/green, canary, feature-flag, and immutable-infra releases that minimize risk and Mean Time To Recovery (MTTR).
  • Implement Site Reliability Objectives (SLOs), error budgets, chaos testing, and auto-remediation playbooks to raise the reliability bar, and own the infrastructure on-call rotation culture.
  • Mentor and develop a diverse team of DevOps engineers, DBAs, and MLOps Engineers with a wide variety of technical skillsets, cultivating the next wave of engineering leaders.
  • Partner hand-in-hand with Product Engineering Teams, our Data Team, and other stakeholders to align roadmaps and unlock velocity.
  • Contribute hands-on by writing Terraform modules, optimizing Helm charts, reviewing merge requests, helping support the team with reactive work, and joining high-severity incident calls when needed.

Your experience:

  • 4+ years of proven DevOps leadership managing DevOps or Site Reliability Engineering (SRE) teams, along with 7+ years in hands-on platform or infrastructure roles.
  • You have a strong self-service track record, having delivered internal platforms or portals that empowered hundreds of engineers to ship autonomously.
  • You bring large-scale AWS expertise, demonstrated by designing and operating multi-region, high-throughput systems that support over $100 million in annual Gross Merchandise Value (GMV).
  • An expert in advanced CI/CD pipelines and Infrastructure as Code (IaC), proficient with tools like GitLab CI (or similar), Terraform, and Kubernetes, and are comfortable introducing progressive delivery, policy-as-code, and secrets management at scale.
  • Deep understanding of metrics, tracing, logging, and alerting, reflecting an observability mindset, with experience using Datadog or comparable stacks.
  • Familiar with cloud security best practices, least privilege access, and regulated-data environments such as PCI, SOC 2, and GDPR, demonstrating security and compliance awareness.
  • You exhibit empathetic leadership, with demonstrated success fostering psychological safety, inclusion, and continuous feedback while driving accountability and high performance.
#LI-ST1
Loop Story
In a perfect world, Loop wouldn't exist. If we had our way, we'd live in a world where we're mindful about how we consume, we love every product we own, and we share values with the brands who create them. In reality, commerce isn't perfect and often breaks. Loop creates second chances.
We're starting by revolutionizing the post-purchase experience. We've taken one of the most fragile commerce interactions - returns - and turned it into something consumers actually love, and that deepens our connection to brands and products.
We take connection seriously on the inside, too. We're building a work experience that allows you to Be A Human First and prioritizes empathy and wellbeing. We view Loop as a special place in your career to shape the future of an industry and become a better person while doing it. You can grow faster here in a shorter amount of time - we'll give you space and trust you to fill it.
Learn more about us here: https://loopreturns.com/careers.
You can review our privacy notice here.
Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0

Tags: Airflow Architecture AWS CI/CD dbt DevOps Docker DynamoDB Engineering GitLab Helm Kubernetes Linux Machine Learning MLFlow MLOps PHP Pipelines Privacy Security Snowflake Terraform Testing

Regions: Remote/Anywhere North America
Country: United States

More jobs like this