Senior AI/ML Engineer

US - Remote

ValidMind

ValidMind is the most efficient solution for governance & risk management of AI and LLMs. Automate model documentation and validation for all model types.

View all jobs at ValidMind

Apply now Apply later

Senior AI/ML Engineer

Location: Remote (US)

About ValidMind

ValidMind empowers financial services organizations to bring more trust and transparency to the world’s AI/ML/LLM models. With the rapid evolution of AI, increased regulatory scrutiny, and lack of fit-for-purpose tooling, financial services’ Model Risk Management (MRM) and AI Governance functions are under enormous pressure to ensure compliance. We are passionate about helping these organizations seamlessly and confidently test, validate, and document their business’ AI models while ensuring compliance with domestic and international AI and model risk regulations.

With its platform growing in scope and adoption, ValidMind is seeking an experienced and motivated Lead ML Engineer to help design, build, and scale our machine learning validation tools. In this role, you will play a key part in developing and maintaining the core frameworks that support a global community of model developers and validators.

Senior AI/ML Engineer

As a Senior AI & Machine Learning Engineer at ValidMind, you will be a hands-on contributor to our Python‑based platform for LLM‑powered model validation, documentation, and risk management. You will implement and optimize robust, scalable frameworks that operationalize both traditional machine‑learning and generative‑AI solutions for model governance and compliance. This role partners closely with product, data‑science, and platform engineering teams to deliver reliable, high‑quality AI‑driven capabilities for our customers.

What You’ll Do & Your Impact:

  • Design, build, and maintain production‑grade AI/Agentic components for ValidMind’s products (e.g., Model Risk framework, Risk Assessment engine, AI‑powered documentation tools).
  • Contribute to the evolution of our core Python and client libraries, integrating LLMs and agentic workflows with a focus on scalability, observability, and performance.
  • Implement best practices in prompt engineering, LLM orchestration, and automated evaluation to ensure model quality, safety, and transparency.
  • Collaborate with cross‑functional teams to translate regulatory and compliance requirements into resilient technical solutions.
  • Own end‑to‑end features, from technical design and experimentation through code review, testing, deployment, and monitoring, while following CI/CD and infrastructure‑as‑code standards.
  • Author clear, user‑focused technical documentation and offer technical guidance to peers in Python, AI/ML, and LLM topics.
  • Stay current with advancements in machine learning, generative AI, model governance, and the Python ecosystem, proactively proposing improvements to the platform.

Who You Are & What Makes You Qualified:

  • 5+ years of professional experience developing production software in Python, with a strong background in data science and machine learning.
  • Demonstrated expertise building modular, maintainable code for AI/ML systems, including experience with LLMs/generative‑AI models in production.
  • Proficiency with key ML frameworks (e.g., scikit‑learn, PyTorch, XGBoost), AI/LLM frameworks (e.g., OpenAI, LangChain), and data‑processing libraries (e.g., Pandas, NumPy).
  • Hands‑on experience operationalizing LLMs: prompt design, orchestration pipelines, evaluation, and monitoring.
  • Strong problem‑solving skills and the ability to communicate complex technical concepts to both technical and non‑technical stakeholders.

Nice-to-Haves:

  • Domain experience in financial services or similar regulated industries.
  • Familiarity with regulatory or compliance considerations in machine learning and AI (model risk management, documentation, or related areas).
  • Knowledge of LLM safety, prompt‑engineering best practices, and quantitative evaluation techniques.
  • Experience developing SDKs or open‑source libraries for AI/LLM applications.
  • Familiarity with cloud platforms (e.g., AWS) and tools for deploying AI/LLM workloads at scale (Docker, Kubernetes, Terraform).
  • Background in cross‑platform library development or multi‑language bindings.
  • Contributions to open‑source AI/ML, LLM, or data‑science communities.

 

What does ValidMind offer?

  • A total compensation package, including base salary, equity incentive, paid time off, and benefits
  • An inclusive, remote-first work culture with regular get-togethers around the world
  • Work-life balance with a flexible work schedule
  • Great career development prospects

At ValidMind, we create the most efficient solution for organizations to automate testing, documentation, and risk management for AI and statistical models. Working here means being at the forefront of AI risk management, but it’s also more personal than that: we promote an inclusive culture where we value your ideas and creativity. We want you to have a sense of ownership over your work, to build mutual trust with your peers, and to feel supported in everything you do. As a VC-backed company in the early stages of growth, there is ample room to grow.

Salary: $183,000-$229,000 USD base salary, depending on experience and location (paid in local currency)

Apply now Apply later
Job stats:  1  0  0

Tags: AI governance AWS CI/CD Docker Engineering Generative AI Kubernetes LangChain LLMs Machine Learning NumPy OpenAI Pandas Pipelines Prompt engineering Python PyTorch Scikit-learn Statistics Terraform Testing XGBoost

Perks/benefits: Career development Equity / stock options Flex hours Flex vacation Startup environment

Regions: Remote/Anywhere North America
Country: United States

More jobs like this