Senior Machine Learning Ops Engineer

Remote - EU, US & Canada

Overstory

We help electric utilities optimize resources, mitigate vegetation risk, and future-proof their operations and maintenance programs for the growing climate and market challenges of our time. This intelligence helps ops teams direct...

View all jobs at Overstory

Apply now Apply later

The climate crisis is the defining challenge of our time—but it’s also the greatest opportunity for innovation, and a challenge we’re proud to take on. At Overstory, we’re harnessing cutting-edge technology to enable a resilient electrical grid that keeps communities thriving as our world changes.

The grid is the backbone of life as we know it. It powers hospitals, keeps food fresh, and ensures communities stay connected. But extreme weather, aging infrastructure, and growing wildfire risks are putting this critical system under pressure. All of this combined makes the electric utility industry the greatest opportunity for tackling climate change. 

One of the leading causes of catastrophic wildfires and power outages? Trees and brush coming into contact with power lines. 

That’s where we help. At Overstory, we use AI and advanced satellite imagery to pinpoint and prioritize vegetation risks before they materialize. By giving utilities critical analysis on those risks, we’re helping prevent outages, reduce wildfire risks, and accelerate the transition to a safer, more resilient grid.

Our team spans the Americas and Europe, and we work with utility partners across the Americas and beyond. We’re outdoor enthusiasts, musicians, artists, athletes, parents, and adventurers—15 nationalities strong and growing. What unites us is a passion for solving complex problems, a commitment to climate action, and the belief that technology should be a force for good.

Join us to help us build a more resilient world together.

The role

As a Senior Machine Learning Ops Engineer at Overstory, you will design and build the foundations of our machine learning operations, ensuring our models are reliable, maintainable, and deliver real value to customers. You’ll help architect end-to-end systems for experiment tracking, data management, and scalable deployment. As one of our first dedicated MLOps hires, you’ll have significant ownership and influence over our technical direction, balancing best practices with pragmatic delivery to help our teams move fast while maintaining trust and reliability in production. You’ll also collaborate closely with data engineers, data scientists, and machine learning engineers, as well as future MLOps colleagues.

What you’ll do

In collaboration with your data and ML colleagues, you will design, build, and maintain processes and systems such as:

  • automated pipelines for training, testing, and deploying ML models
  • experiment tracking systems for performance metrics, data and model versioning, and documentation
  • processes and systems for the full model lifecycle, including registries, release and rollback strategies, and scalable model serving
  • monitoring and alerting for prediction quality, system health, and cost optimization

You will also influence the direction of data and ML within Overstory by:

  • advocating for a balance between MLOps best practices and quick slices of value
  • aligning technical solutions with customer needs in collaborating with both engineering and product
  • ensuring our MLOps systems support regulatory, privacy, and security requirements

About you

  • You love working in a remote-first, fast-moving environment where collaboration and adaptability are essential.
  • 8+ years of experience with designing and building production-grade ML pipelines and systems – but don’t filter yourself out if you feel you’re a strong candidate with 5+ years.
  • Strong knowledge of experiment tracking, model deployment strategies, data versioning, and monitoring.
  • Experience with ML infrastructure tools (e.g. MLflow, Kubeflow, Airflow, feature stores, model registries).
  • Familiarity with GCP and VertexAI preferred, but not required.
  • Strong communication skills and ability to align technical solutions with business goals.
  • Comfortable making architectural decisions and balancing best practices with practical trade-offs.

Nice-to-haves

  • Experience in remote-first or globally distributed teams.
  • Background in geospatial or spatio-temporal data processing.
  • Prior work on real-time prediction systems or active-learning loops.
  • Knowledge of regulatory, privacy, or security considerations in ML.
  • Experience optimizing cloud infrastructure costs for ML workloads.
  • Familiarity with Overstory’s mission domains (e.g. satellite imagery, forestry, utilities)

What you get

  • To be part of truly mission-driven work that reduces wildfires, protects earth’s natural resources and helps solve our climate crisis.
  • Flexible working environment with a lot of autonomy. We build our work days around our lives, not the other way around.
  • Other benefits like a remote working budget, an educational budget and time to develop new skills.
  • To be surrounded by an excellent, vibrant, smart team who have each other's back and believe in a culture of openness, tolerance and respect.
  • Equity and a competitive salary.

About our team

We are a group of 85 people from all over the world. Fifteen nationalities are represented in our team. We work remotely from nine different countries and we are looking for candidates that are also living and working in one of these countries: United States, the Netherlands, United Kingdom, Ireland, Estonia, Portugal, France, Sweden, Denmark, Switzerland, and Canada. We meet up once a year in-person for our unforgettable team gathering event. We also offer the option to occasionally meet up for in-person collaboration.

Diversity & Inclusion

We place enormous value on diversity and inclusion and strive to continually bring in people of all genders, races, creeds, ethnicities, abilities and backgrounds. We believe that the best ideas emerge when people with different perspectives and approaches work together on a problem.

We’re always looking to diversify our team further, but we’re proud of the fact that four out of the nine people on our leadership team are female, 46% of the overall team are female and 20% of the team are people of color. Our team speaks fifteen languages: English, Dutch, French, Spanish, German, Italian, Portuguese, Russian, Luxembourgish, Lithuanian, Bulgarian, Cantonese, Estonian, Danish and Korean.

Our values

Tackling the climate crisis is our greatest mission.

We act with urgency.

Our curiosity fuels our growth.

We recognize that change is constant, and we find joy and power in exploration.

We’re rooted in diversity.

Just as ecosystems need biodiversity to thrive, our resiliency comes from our differences.

We care for each other.

We love the power of machines but we nurture each other as humans.

Trust is fundamental.

We assume the best in everyone, and we share ideas openly so that we have a positive impact.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  1  0  0

Tags: Airflow Data management Engineering GCP Kubeflow Machine Learning MLFlow ML infrastructure ML models MLOps Model deployment Pipelines Privacy Security Testing Vertex AI

Perks/benefits: Career development Competitive pay Equity / stock options Flex hours Health care

Regions: Remote/Anywhere North America
Countries: Canada United States

More jobs like this