Python Developer - LLM/AI Agents
Vietnam
Unlike traditional outsourcing, our approach goes beyond executing tasks—we advise, challenge, and co-create solutions that enable businesses to scale effectively. Whether it’s expanding into new markets, streamlining operations, or driving digital transformation, we are committed to delivering impact that lasts.
Why NFQ?🔹 Fairness – We treat our colleagues, clients, shareholders, and the environment with integrity and respect.🔹 Entrepreneurship – We think like entrepreneurs, taking smart risks and turning ideas into real impact.🔹 Excellence – We go beyond expectations to deliver results that truly make a difference.
We’re hiring a Python Developer to help us build intelligent, autonomous AI agents using Large Language Models (LLMs). You'll design and orchestrate complex agent behaviors using LangChain, while operating models across cloud, local, and edge environments. This role bridges cutting-edge LLM research with real-world deployment — from prompt engineering and fine-tuning to monitoring and observability.
You’ll work at the frontier of LLMs, agent architecture, and real-world deployment.
In this role, you will
- Architect and implement modular LLM-driven agents that handle reasoning, planning, and tool usage.
- Operate and integrate LLMs across environments — from cloud-hosted models (e.g., OpenAI, Anthropic, Azure, Bedrock) to local/edge deployments (e.g., llama.cpp, vLLM, Ollama).
- Fine-tune and customize open-source LLMs using LoRA, QLoRA, PEFT, and other efficient tuning strategies.
- Monitor, evaluate, and debug agent performance using LangSmith, OpenInference, Promptfoo, or custom observability tools.
- Build retrieval-augmented generation (RAG) pipelines using vector stores like FAISS, Chroma or Pinecone.
- Orchestrate queries
- Collaborate with ML engineers, DevOps, and product teams to move prototypes into stable, production-ready systems.
What you will bring
- Strong experience in Python 3.x, with a focus on modular, well-tested code.
- Hands-on experience with LLMs in the cloud (OpenAI, Azure OpenAI, Claude, Bedrock, Hugging Face Inference Endpoints, etc.).
- Experience operating LLMs locally/on-device (e.g., using Ollama, llama.cpp, vLLM, GGUF/GPTQ models).
- Solid understanding of LLM agent frameworks (LangChain, CrewAI, AutoGPT) and prompt chaining.
- Practical experience fine-tuning models (e.g., using Hugging Face Transformers, PEFT, LoRA/QLoRA).
- Proficiency in using observability and evaluation tools (e.g., LangSmith, Promptfoo, TruLens) to trace, log, and improve performance.
- Experience with embeddings, vector databases, and contextual memory systems.
- Experience deploying LLMs to Kubernetes, serverless platforms, or edge devices (e.g., Jetson, mobile, Raspberry Pi).
- Familiarity with quantization and optimization for low-latency inference.
- Built or contributed to multi-agent systems or long-context memory chains.
- Experience with model monitoring, alerting, and auto-evaluation pipelines in production.
Nice to Have:
Why you will love working here
- 🏆 Join Vietnam’s Best IT Company – Recognized by ITViec for 7 consecutive years, including 2 successive years as the Winner. Work with some of the best minds in the industry and be part of a company that’s redefining how businesses scale through technology.
- 🌍 Career Growth & Leadership Development – Work closely with NFQ’s leadership team, gain mentorship from experienced executives, and have direct exposure to high-level strategic decisions. Your growth is limitless, as long as you’re ready to step up, opportunities will always be there for you.
- 💰 Competitive Compensation – We believe great talent deserves great rewards. Expect an attractive salary, performance-based bonuses, and a benefits package that reflects your impact. We value talent over salary budgets—exceptional contributions deserve exceptional rewards.
- ✨ And Many More Benefits to Explore! But most importantly healthy work-life balance and an environment where you can thrive—professionally and personally. Including: - Laptop is provided. - Community Tech activities. - A fun & dynamic environment and freedom to be creative. - Modern office with a flexible relaxing zone. - 13th-month salary pro-rata (based on business situation/performance). - Performance review 2 times/ year. - Extra Premium Healthcare & Annual Health-check. - 15 days annual leaves.
- Ho Chi Minh office: Unit No.PF-08 on Podium Floor, Sapphire Tower 2, 92 Nguyen Huu Canh street, Ward 22, Binh Thanh District.
- Da Nang office: 23rd Floor, G8 Golden Tower, 65 Hai Phong, Thach Thang Ward, Hai Chau District.
- Ha Noi office: 10th Floor, Hancorp Plaza, 72 Tran Dang Ninh Street, Dich Vong Ward, Cau Giay District.
- Can Tho office: No. 47, 30/4 Street, An Lac Ward, Ninh Kieu District.
Working time: Monday – Friday (9AM - 6PM)Locations: NFQ welcomes you to our offices in Vietnam, choose the location that suits you best and join us on a journey of innovation and excellence.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Anthropic Architecture Azure Claude Consulting DevOps Engineering FAISS Kubernetes LangChain LLaMA LLMs LoRA Machine Learning OpenAI Open Source Pinecone Pipelines Prompt engineering Python RAG Research Transformers vLLM
Perks/benefits: Career development Competitive pay Flex hours Gear Health care Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.