LLM Backend Engineer
Mumbai, India
Healf
Healf connects you to the world’s best wellbeing brands, offering tools and rituals to empower your wellbeing.Build the Future of Wellbeing
Do Your Life’s Best Work
If modern wellbeing were redesigned from scratch, it wouldn’t live in a GP’s office or a cluttered supplement aisle. It would be digital-first, beautifully curated, and powered by data that actually helps you feel your best.
That’s what we’re building at Healf—an ecommerce platform at the intersection of personalised health and curated wellbeing. We connect customers with the world’s most effective products across EAT MOVE MIND and SLEEP, and we’re just getting started.
We combine culture-shaping storytelling, cutting-edge health tech (like our new blood testing platform, Healf Zone), and a best-in-class product experience to help people build rituals that work.
Backed by investors behind Soho House, Alo Yoga, Cult Beauty, and Innocent, we’re scaling fast—and redefining how the world shops and lives well.
The Role
We’re looking for a backend engineer with deep Python expertise and a curiosity for pushing the boundaries of LLM-powered applications. In this role, you’ll design and deploy scalable FastAPI services, integrate cutting-edge LLM workflows (think embeddings, RAG pipelines, prompt engineering), and architect secure AWS infrastructure. You’ll work closely with product teams to bring experimental AI features into production—fast, clean, and cost-aware. If you thrive in high-ownership environments and enjoy building systems that blend backend engineering with the future of AI, we’d love to meet you.
📍 Location: Remote — Open to candidates based in Pakistan or India
Key Responsibilities
Build high‑throughput, async APIs and micro‑services in Python using FastAPI.
Integrate large‑language‑model workflows (prompt engineering, embeddings, RAG pipelines) with providers like OpenAI, Anthropic, or custom models.
Design scalable, secure AWS infrastructure (ECS/EKS, Lambda, ElasticCache, S3, DynamoDB) with infrastructure‑as‑code.
Implement observability for AI and API services—structured logging, tracing, latency/cost dashboards, guardrails.
Automate blue‑green or canary deployments through containerised CI/CD pipelines (GitHub Actions, ArgoCD).
Collaborate with product teams to translate experimental AI related proofs‑of‑concept into production‑ready features.
Qualifications
5 + years backend Python, including 1 + year shipping LLM‑powered applications.
Mastery of async Python patterns (asyncio, uvicorn, pydantic v2) and strong typing discipline.
Hands‑on experience architecting cloud‑native, horizontally scalable systems on AWS.
Familiarity with vector databases/search (PGVector, ChromaDB, Pinecone) and streaming token handling.
Solid grounding in DevOps practices—IaC, automated testing, security scanning, performance profiling.
Nice to Have
Experience with LangChain, Agents-SDK, etc.
Machine Learning understanding and ideally some experience with training models.
Why Join Healf?
- Do your life’s best work – Build something that matters, with a team that moves fast and aims high
- Surround yourself with A+ talent – You’ll work with high-performers who care deeply and raise the standard every day
- Be a builder – This isn’t a cog-in-the-machine role. You’ll help shape our voice, culture, and growth
- Wellbeing is the lifestyle – From office yoga to Healf Zone insights, everything we do is rooted in our pillars: EAT MOVE MIND SLEEP
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Anthropic APIs AWS CI/CD DevOps DynamoDB E-commerce ECS Engineering FastAPI GitHub Lambda LangChain LLMs Machine Learning OpenAI Pinecone Pipelines Prompt engineering Python RAG Security Streaming Testing
Perks/benefits: Career development Yoga
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.