Infrastructure/DevOps Engineer

Remote - USA

Abnormal Security

Advanced email protection to prevent credential phishing, business email compromise, account takeover, and more.

View all jobs at Abnormal Security

Apply now Apply later

About the Role

We’re looking for an Infrastructure/DevOps Engineer to join our IT team supporting AI platforms at Abnormal Security. In this high-impact role, you will enable our AI software engineers to move fast by building and maintaining reliable, scalable, and secure infrastructure. You will partner closely with IT, security, and AI/ML engineering teams to ensure that our foundational systems support experimentation, deployment, and monitoring of advanced AI tools and solutions.

This role is ideal for someone who thrives at the intersection of systems engineering and AI enablement, and who loves solving complex operational challenges to unlock innovation across the company.  This is a fully Remote role for candidates living in the United States, or Canada.

Who you are

A skilled Infrastructure/DevOps Engineer with a solid background in cloud infrastructure and platform engineering. You’re someone who loves building scalable systems that help other engineers move faster and work more efficiently. You thrive in collaborative, cross-functional environments and have a customer-first mindset—your customers are internal teams, and your goal is to remove blockers and boost productivity. You’re passionate about automation, self-service tools, and building reliable systems. You value security, observability, and operational excellence, and you communicate clearly through both documentation and collaboration. You stay up to date with trends in DevOps and AI infrastructure, and you're driven by impact, ownership, and continuous improvement—always moving fast without sacrificing quality.

What you will do

  • Architect and manage infrastructure that supports AI/ML pipelines, tools, and data platforms.
  • Implement and maintain containerization (e.g., Docker) and orchestration (e.g., Kubernetes) environments.
  • Develop CI/CD systems that integrate with ML workflows and ensure reproducible AI experiments.
  • Collaborate with security and compliance teams to ensure infrastructure meets data protection standards.
  • Automate provisioning and deployment using IaC tools like Terraform or Pulumi.
  • Monitor and troubleshoot infrastructure issues with tools like Prometheus, Grafana, and ELK stack.
  • Partner with AI and software engineers to optimize platform performance and resource utilization.
  • Maintain clear, accessible documentation to scale platform knowledge across the org.

Must Haves

  • 4+ years of experience in DevOps, SRE, or Infrastructure Engineering roles.
  • Proficiency with cloud providers (AWS preferred), Kubernetes, and Docker.
  • Experience with infrastructure as code tools (Terraform, Ansible, or Pulumi).
  • Strong scripting skills in Python, Bash, or similar.
  • Familiarity with CI/CD systems such as GitHub Actions, Jenkins, or CircleCI.
  • Understanding of networking, security, and identity management in cloud environments.
  • Experience supporting ML workloads and GPU-based infrastructure.
  • Ability to troubleshoot complex system issues in a distributed environment.
  • Comfortable working across functional teams and communicating with technical and non-technical stakeholders.

Nice to Have

  • Familiarity with MLOps tools like MLflow, Kubeflow, or SageMaker.
  • Experience with AI platform infrastructure (e.g., model serving, feature stores).
  • Knowledge of logging and monitoring frameworks (e.g., Fluentd, Loki).
  • Background in supporting data platforms like Snowflake, Databricks, or Hadoop.
  • AWS Certified
  • Experience working in high-growth startups or tech companies.

#LI-MA1


At Abnormal AI, certain roles are eligible for a bonus, restricted stock units (RSUs), and benefits. Individual compensation packages are based on factors unique to each candidate, including their skills, experience, qualifications and other job-related reasons. We know that benefits are also an important piece of your total compensation package. Learn more about our Compensation and Equity Philosophy on our Benefits & Perks page.

Base salary range:$114,800—$135,000 USD
Apply now Apply later
Job stats:  2  0  0
Category: Engineering Jobs

Tags: Ansible AWS CI/CD Databricks DevOps Docker ELK Engineering GitHub GPU Grafana Hadoop Jenkins Kubeflow Kubernetes Machine Learning MLFlow ML infrastructure MLOps Pipelines Python SageMaker Security Snowflake Terraform

Perks/benefits: Equity / stock options Salary bonus Startup environment

Regions: Remote/Anywhere North America
Country: United States

More jobs like this