Principal Software Engineer - AI Platforms & Communications
Weave - Headquarters (Lehi, UT)
Weave
Weave brings together a world-class phone system and a suite of communication tools, so it's easy to automate more tasks, keep schedules full, get paid faster, collect more reviews and much more.Weave is searching for a Principal Engineer who can set technical vision, drive cross‑team execution, and mentor engineers while solving hard, high‑scale problems. You will architect and evolve the AI platform that powers our voice, text, and email products—bringing together LLMs, retrieval‑augmented generation (RAG), vector search, and multi‑tenant data pipelines to help millions of providers serve hundreds of millions of patients every day.
This position will be fully remote in the US (option to work from Lehi, UT HQ)
This position reports to: SVP of Engineering
What you’ll own
Set architecture & strategy for multi‑tenant, AI‑powered communications services (voice, SMS, email).
Design and build RAG pipelines, vector‑store–backed retrieval layers, and fine‑tuning workflows that deliver low‑latency, context‑aware experiences.
Lead end‑to‑end platform initiatives: data ingestion, event processing, model hosting, continuous evaluation, and cost/latency optimization.
Champion best practices for LLM safety, prompt management, experiment tracking, observability, and auto‑scaling.
Influence roadmaps across Product, Design, and multiple engineering squads; unblock teams with reusable SDKs, templates, and reference architectures.
Mentor & level‑up senior and staff engineers; foster a culture of ownership, experimentation, and inclusive collaboration.
What you’ll need to accomplish the job
10+ years building distributed, customer‑facing systems; 3+ years designing ML or LLM production workloads at scale.
Proven success with LLMs, RAG architectures, vector databases (e.g., Pinecone, Weaviate, PG Vector), embedding models, and fine‑tuning techniques.
Deep knowledge of multi‑tenant SaaS patterns, data isolation, and usage‑based cost modeling.
Fluency in at least one of Go, Python, or Java (Go/Python preferred) plus modern cloud stacks (GCP or AWS).
Experience integrating or operating communications stacks—SIP/WebRTC for voice, SMS gateways, and transactional email services.
Expertise in Kubernetes, IaC/GitOps, observability, and CI/CD for high‑compliance environments (HIPAA/PCI).
Track record of shipping large, cross‑org initiatives on time and mentoring others to do the same.
What Will Make Us Love You
Prior work with conversation intelligence, speech‑to‑text, or generative voice synthesis.
Hands‑on with Dagster/Kubeflow, Triton, or similar model‑ops tooling.
Contributions to open‑source AI/ML or communications projects.
Weave is an equal opportunity employer that is committed to fostering an inclusive workplace where all individuals are valued and supported. We welcome anyone who is hungry to learn, problem-solve and progress regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, or other applicable legally protected characteristics. If you have a disability or special need that requires accommodation, please let us know.
All official correspondence will occur through Weave branded email. We will never ask you to share bank account information, cash a check from us, or purchase software or equipment as part of your interview or hiring process.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture AWS CI/CD Dagster Data pipelines Engineering GCP Java Kubeflow Kubernetes LLMs Machine Learning Pinecone Pipelines Python RAG Weaviate
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.