Staff Software Engineer (ML Platform)
New York, NY
EvolutionIQ
EvolutionIQ is the leading insurance claims guidance platform, providing faster recoveries and smarter decisions for carriersAbout Us: EvolutionIQ’s mission is to improve the lives of injured and disabled workers and enable them to return to the workforce, saving billions of dollars in avoidable costs and lost productivity to the US and global economies and make insurance more affordable for everyone. We are currently experiencing massive growth and to accomplish our goals, we are hiring world-class talent who want to help build and scale internally, and transform the insurance space. We’re backed by First Round Capital, FirstMark Capital, Foundation Capital, Brewer Lane Ventures, and have been named as Inc.’s top places to work! Our headquarters is in NYC and we are also remote friendly.
At EvolutionIQ, we are bringing together world-class technical talent who want to invent, solve, and create in an entirely new technology category. For our experts in machine learning, data science, applications, and technology integration, cracking the insurance industry’s previously ‘impossible’ big data problem with deep learning AI is our version of summiting K2 or Everest.
We are seeking a highly skilled and motivated Staff Software Engineer - ML Platform to lead the architecture, deployment, and scaling of our machine learning (ML) and artificial intelligence (AI) infrastructure. This role combines deep technical expertise with strategic impact to drive innovation in the ML pipeline, optimize end-to-end workflows, and ensure robust deployment in production environments. As a Staff ML Platform Engineer, you will collaborate closely with machine learning engineers and senior leadership to streamline experimentation, improve model observability, and enhance overall system performance. You’ll play a pivotal role in setting the MLOps standards across the organization.
In this Role, You Will:
- Design, build, and launch scalable ML and data processing systems supporting multi-machine data processing (e.g., MapReduce), GPU/TPU model training, and automated model monitoring systems on cloud platforms.
- Automate model lifecycle management, including training, evaluation, and deployment, to enable fast, safe, and consistent updates across environments.
- Introduce modern, scalable frameworks for model monitoring, feature engineering, hyperparameter tuning, and continuous re-training, ensuring robust model performance over time.
- Lead the deployment of models through REST and gRPC APIs, enabling smooth integration with application frontends and real-time user interaction.
- Continuously research, evaluate, and implement the latest MLOps tools, frameworks, and platforms to improve efficiency, scalability, and reliability of ML operations.
- Implement and manage monitoring systems to track model and data performance, proactively identifying and mitigating issues using tools like Prometheus and Grafana.
- Apply best practices in secure data handling and model integrity within ML workflows, ensuring regulatory and security compliance norms.
- Share MLOps knowledge and improvements in ML engineering workflows through internal training sessions and presentations.
- Support the next phase of ML pipeline development, focusing on building, maintaining, and monitoring pipelines to handle 10-100x increases in data volume.
We are looking for someone who:
- Exudes our ambitious, collaborative, and empathetic values
- Hold yourself to high standards, ensuring high quality and passion for the field in which you operate
- Remain agile and move between rapid prototyping and stable production development
- Write design documents, perform code reviews, and maintain state of the art engineering practices
- Open to giving and receiving critical feedback
- Believes in the mission of the company, cares about fundamental fairness
- Enthusiasm for team work and pair work
- Kind, empathetic, polite, and professional
- Utilizes data and metrics to drive decision-making, ensuring that the ML platform is optimized for performance, reliability, and scalability.
- Proactively identifies and addresses bottlenecks in the ML pipeline, leveraging their expertise to develop and implement innovative solutions that enhance productivity and performance.
Requirements:
Technical skills needed:
- 8+ years of software development experience with a focus on platform development with AI/ML applications of scale
- Experience in providing technical leadership to ML Infra / ML Platform teams.
- Experience in shipping products at scale.
- Expertise in clean and efficient coding with a focus on Python.
- Experience with orchestration frameworks such as Dagster/Airflow
- Expertise in one or more Cloud platforms (GCP preferred but not required)
- Bachelor’s Degree or higher in Computer Science, Mathematics, or related field
Additional skills needed:
- Excellent document writing skills (additional to presenting results through Jupyter notebooks)
- Extreme creativity and resourcefulness, appetite to solve previously unsolved problems
Work-life, Culture & Perks:
- Compensation: The range is $230-250K with flexibility depending on a candidate’s background and experience plus meaningful equity (stock options).
- Well-Being: Full medical, dental, vision, short- & long-term disability, 401k matching. 100% of the employee contribution up to 3% and 50% of the next 2%
- Work/Life Balance: For this role we are hoping this person can work out of the NYC office regularly with much of our leadership with flexibility. We also have a flexible vacation/PTO policy.
- Home & Family: 100% paid parental leave (4 months for primary caregivers and 3 months for secondary caregivers), sick days, paid time off. For new parents returning to work we offer a flexible schedule. We also offer sleep training to help you and your family navigate life schedules with a newborn
- Office Life: Catered lunches, happy hours, and pet-friendly office space. $500 for your in home office setup and $200/year for upgrades every year after your initial setup
- Growth & Training: $1,000/year for each employee for professional development, as well as upskilling opportunities internally
- Sponsorship: We are open to sponsoring candidates currently in the U.S. who need to transfer their active H1-B visa
EvolutionIQ appreciates your interest in our company as a place of employment. EvolutionIQ is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Airflow APIs Architecture Big Data Computer Science Dagster Deep Learning Engineering Feature engineering GCP GPU Grafana Jupyter Machine Learning Mathematics MLOps Model training Pipelines Prototyping Python Research Security
Perks/benefits: 401(k) matching Career development Equity / stock options Flex hours Flex vacation Health care Home office stipend Insurance Medical leave Parental leave Pet friendly Startup environment Team events
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.