Technical Program Manager

USA - Remote, United States

⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️

Apply now Apply later

While technology is the heart of our business, a global and diverse culture is the heart of our success. We love our people and we take pride in catering them to a culture built on transparency, diversity, integrity, learning and growth.


If working in an environment that encourages you to innovate and excel, not just in professional but personal life, interests you- you would enjoy your career with Quantiphi!

About Quantiphi:

Quantiphi is an award-winning Applied AI and Big Data software and services company, driven by a deep desire to solve transformational problems at the heart of businesses. Our signature approach combines groundbreaking machine learning research with disciplined cloud and data-engineering practices to create breakthrough impact at unprecedented speed.

Company Highlights:

Quantiphi has seen 2.5x growth YoY since its inception in 2013, we don’t just innovate - we lead. Headquartered in Boston, with 4,000+ Quantiphi professionals across the globe. As an Elite/Premier Partner for Google Cloud, AWS, NVIDIA, Snowflake, and others, we’ve been recognized with:

  • 17x Google Cloud Partner of the Year awards in the last 8 years.
  • 3x AWS AI/ML award wins.
  • 3x NVIDIA Partner of the Year titles.
  • 2x Snowflake Partner of the Year awards.
  • We have also garnered top analyst recognitions from Gartner, ISG, and Everest Group.
  • We offer first-in-class industry solutions across Healthcare, Financial Services, Consumer Goods, Manufacturing, and more, powered by cutting-edge Generative AI and Agentic AI accelerators.
  • We have been certified as a Great Place to Work for the third year in a row- 2021, 2022, 2023.

Be part of a trailblazing team that’s shaping the future of AI, ML, and cloud innovation. Your next big opportunity starts here!

Work Location: Bedminster, NJ or Dallas, TX

Responsibilities:

  • Lead AI/ML program execution, ensuring timely delivery of scalable, production-grade RAG/LLM/Agentic solutions.
  • Define program roadmaps through PI planning sessions, milestones, and deliverables for AI-driven initiatives across multiple teams.
  • Manage LLM infrastructure, GPU optimization, AI inferencing pipelines, and large-scale model deployment strategies.
  • Oversee the implementation of RAG, Agentic Workflows, multi-agent LLM systems, and Retrieval-augmented QA pipelines.
  • Managing client engagement and delivery per terms of the contract expectations.
  • Manage project delivery, team and ensure positive customer relations.
  • Drive project margins optimization using Gen AI based tools, accelerators.
  • Collaborate with our diverse and global teams to deliver committed results to our clients.
  • Lead AI-driven engagements, ensuring alignment with business goals, technical feasibility, and governance frameworks.
  • Develop and execute strategic roadmaps for LLM-based solutions, including RAG (Retrieval-Augmented Generation), Agentic RAG, and Agent-driven workflows.
  • Manage cross-functional teams, including ML engineers, data scientists, software developers, and consultants to deliver AI solutions.
  • Collaborate with stakeholders to define technical architecture, infrastructure requirements, and optimization techniques.
  • Implement scalable AI agent architectures, ensuring integration with LangChain, NVIDIA NeMo, and Triton Inference Server.
  • Track project performance, set KPIs, and provide executive-level reporting on outcomes and ROI.
  • Guide AI model evaluation, MLOps pipeline integration, and fine-tuning strategies for scalable AI solutions.
  • Support AI compliance strategies, ensuring alignment with data privacy, security, and responsible AI practices.

Skill Set Required:

  • More than 8 years of program management experience.
  • Strong leadership and multi-stakeholder management skills.
  • Multi-Workstream Project Management ensuring customer success & account growth.
  • Maintaining positive work environment & ensure career growth of the team members.
  • Tight Delivery execution and reporting to senior management at client organization and at Quantiphi.
  • Mentoring team members for career progression & upskilling to drive better solution outcomes.
  • Team leading experience and ability & experience to work as project lead.
  • Excellent Communication, presentation & storytelling skills.
  • Must have experience with Cloud GCP or AWS or Azure (LLM hosting, GPU-based inference, cost optimization).
  • Experience managing large-scale AI projects leveraging LLMs (e.g., Llama, GPT, Claude, Mistral).
  • Strong expertise in RAG, Agentic RAG, AI Agents, Vector DBs (e.g., FAISS, Pinecone, Weaviate, ChromaDB).
  • Knowledge of LLM-based fine-tuning techniques, Low-Rank Adaptation (LoRA), Quantization (AWQ, GPTQ, FP8, INT4).
  • Familiarity with Multi-GPU parallelization, model pruning, and knowledge distillation.
  • Understanding of Governance frameworks (e.g., AI Ethics, Explainability, Risk Mitigation).
  • Proficiency in NVIDIA NeMo, Triton Inference Server, and LangChain for agentic workflows.

What is in it for you:

  • Be part of a team and company that has won NVIDIA's AI Services Partner of the Year three times in a row with an unparalleled track record of building production AI applications on DGX and Cloud GPUs.
  • Strong peer learning which will accelerate your learning curve across Applied AI, GPU Computing and other softer aspects such as technical communication.
  • Exposure to working with highly experienced AI leaders at Fortune 500 companies and innovative market disruptors looking to transform their business with Generative AI.
  • Access to state-of-the-art GPU infrastructure on the cloud and on-premise.
  • Be part of the fastest-growing AI-first digital transformation and engineering company in the world

If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  1  0  0
Category: Leadership Jobs

Tags: Architecture AWS Azure Big Data Claude Engineering Excel FAISS GCP Generative AI Google Cloud GPT GPU KPIs LangChain LLaMA LLMs LoRA Machine Learning MLOps Model deployment Pinecone Pipelines Privacy RAG Research Responsible AI Security Snowflake Weaviate

Perks/benefits: Career development

Regions: Remote/Anywhere North America
Country: United States

More jobs like this