Member of Technical Staff - Machine Learning Engineer, Inference (Pytorch)

Boston

Full Time Senior-level / Expert USD 147K - 274K * ^est.

Liquid AI

We build capable and efficient general-purpose AI systems at every scale. Liquid Foundation Models (LFMs) are a new generation of generative AI models that achieve state-of-the-art performance at every scale, while maintaining a smaller memory...

View all jobs at Liquid AI

Apply now Apply later

Posted 4 weeks ago

Liquid AI, an MIT spin-off, is a foundation model company headquartered in Boston, Massachusetts. Our mission is to build capable and efficient general-purpose AI systems at every scale.
Our goal at Liquid is to build the most capable AI systems to solve problems at every scale, such that users can build, access, and control their AI solutions. This is to ensure that AI will get meaningfully, reliably and efficiently integrated at all enterprises. Long term, Liquid will create and deploy frontier-AI-powered solutions that are available to everyone.
We are hiring an ML Engineer (Inference) to build and optimize the end-to-end serving stack for Liquid AI’s foundation models. You will develop the pipeline between a trained model checkpoint and a production-grade, low-latency API. This is a highly technical role operating on the frontier of AI inference research and production

Desired Experience

PyTorch
Python
Model-serving frameworks (e.g. TensorRT, vLLM, SGLang)

You're A Great Fit If

You have experience building large-scale production stacks for model serving.
You have a solid understanding of ragged batching, dynamic load balancing, KV-cache management, and other multi-tenant serving techniques.
You have experience with applying quantization strategies (e.g., FP8, INT4) while safeguarding model accuracy.
You have deployed models in both single-GPU and multi-GPU environments and can diagnose performance issues across the stack.

What You'll Actually Do

Optimize and productionize the end-to-end pipeline for GPU model inference around Liquid Foundation Models (LFMs).
Facilitate the development of next-generation Liquid Foundation Models from the lens of GPU inference.
Profile and robustify the stack for different batching and serving requirements.
Build and scale pipelines for test-time compute.

What You'll Gain

Hands-on experience with state-of-the-art technology at a leading AI company.
Deeper expertise in machine learning systems and efficient large model inference.
Opportunity to scale pipelines that directly influence user latency and experience with Liquid's models.
A collaborative, fast-paced environment where your work directly shapes our products and the next generation of LFMs.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats: 1 1 0

Categories: Engineering Jobs Leadership Jobs Machine Learning Jobs

Tags: APIs GPU Machine Learning Model inference Pipelines Python PyTorch Research TensorRT vLLM

Region: North America

Country: United States

More jobs like this

« Back to job search To the top ↑

Explore more career opportunities

Find even more open roles below ordered by popularity of job title or skills/products/technologies used.

Member of Technical Staff - Machine Learning Engineer, Inference (Pytorch)

Boston

Full Time Senior-level / Expert USD 147K - 274K * ^est.

Liquid AI

Desired Experience

You're A Great Fit If

What You'll Actually Do

What You'll Gain

More jobs like this

Staff Data Scientist Featured

Senior AI Cloud Developer

Staff ML Systems Engineer

Staff ML Systems Engineer

Staff ML Systems Engineer

Staff ML Systems Engineer

Staff ML Systems Engineer

Staff ML Systems Engineer

ARPA-H Software Developer

Senior Business Intelligence Engineer

Explore more career opportunities

Member of Technical Staff - Machine Learning Engineer, Inference (Pytorch)

Boston

Full Time Senior-level / Expert USD 147K - 274K * est.

Liquid AI

Desired Experience

You're A Great Fit If

What You'll Actually Do

What You'll Gain

More jobs like this

Staff Data Scientist Featured

Senior AI Cloud Developer

Staff ML Systems Engineer

Staff ML Systems Engineer

Staff ML Systems Engineer

Staff ML Systems Engineer

Staff ML Systems Engineer

Staff ML Systems Engineer

ARPA-H Software Developer

Senior Business Intelligence Engineer

Explore more career opportunities

Full Time Senior-level / Expert USD 147K - 274K * ^est.