ML Engineer (LLM)
Berlin
â ď¸ We'll shut down after Aug 1st - try foođŚ for all jobs in tech â ď¸
- Remote-first
- Website
- @SynthflowAI đ
- Search
Synthflow AI
Create custom AI phone call agents effortlessly with Synthflow. No coding or tech skills neededâjust your data and ideas for powerful automation.Synthflow AI is a no-code platform for deploying voice AI agents that automate phone calls across contact center operations and business process outsourcing (BPO) at scale. We help mid-market and enterprise companies manage routine calls to save teams time and resources.
Our agents have already delivered measurable impact:
Over 5 million hours of contact center operations saved
35% more calls answered compared to non-AI operators
45 million calls handled with a 99.9% uptime
Backed by Accel, Atlantic Labs, and Singular and trusted by over 1,000 customers, our growth leads an industry shift toward sophisticated and accessible conversational AI.
The Role
We are looking for a handsâon ML Engineer who lives at the intersection of TTS, STT and large language models. You will design and ship new lowâlatency voice capabilities, working closely with product, research and infrastructure teams to push the boundaries of natural, multilingual conversation.
What Youâll Do
Architect & implement realâtime speech pipelines (ASR â LLM â TTS) that meet stringent latency and quality targets.
Evaluate and fineâtune stateâofâtheâart ASR, LLM and TTS modelsâboth commercial and openâsourceâand integrate the best performers into production.
Optimise inference through quantisation, distillation, hardwareâaware graph compilation and reinforcementâlearningâbased tuning.
Expose scalable APIs & microâservices with Python/FastAPI, gRPC or WebSocket streaming, backed by robust observability and autoscaling.
Own deployment across cloud and onâprem environments, collaborating on containerisation (Docker), orchestration (Kubernetes) and CI/CD workflows.
Stay ahead of the curve by tracking research, running experiments and sharing learnings with the broader team.
What weâre looking for
Python Engineering: 5+ years writing productionâgrade, wellâtested Python; deep familiarity with async, typing and performance profiling
Speech / Audio: Handsâon experience building realâtime ASR, TTS, voice chat or streaming audio products
LLM Tooling: Fineâtuning, prompt design, evaluation, retrievalâaugmented generation; familiarity with frameworks such as Openpipe/ART, LangChain, LlamaIndex or similar
Systems & MLOps: Containerisation, GPU scheduling, observability, DevOps on GCP or AWS; infrastructureâasâcode principles
API Design: Building and maintaining highâthroughput REST/gRPC/FastAPI services; securing and monitoring them in production
Bonus Points
Model compression expertise (quantisation, pruning, ONNX/TensorRT)
Knowledge of audio and acoustics
Experience with reinforcementâlearningâfromâhumanâfeedback (RLHF) or direct preference optimisation
Contributions to openâsource ML/speech projects (share your GitHub!)
Familiarity with GPU inference servers (Triton, KServe) or distributed compute frameworks (Ray)
Founded in Berlin in 2023 by serial entrepreneurs Albert Astabatsyan, Hakob Astabatsyan, and Sassun Mirzakhan-Saky, Synthflow AI democratizes access to advanced voice AI with a no-code platform that lets enterprises easily create, deploy and scale natural-sounding, cost-effective voice agents tailored to their business needs.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index đ°
Tags: APIs ASR AWS CI/CD Conversational AI DevOps Docker Engineering FastAPI GCP GitHub GPU KServe Kubernetes LangChain LLMs Machine Learning MLOps ONNX Pipelines Python Research RLHF Streaming TensorRT
Perks/benefits: Career development
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.