Generative AI Engineer

Gurugram, India

⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️

Inizio Advisory

Inizio unlocks the value of your healthcare innovation by connecting best-in-class strategic, analytic and creative capabilities.

View all jobs at Inizio Advisory

Apply now Apply later

 

Role Overview

We are looking for highly skilled with 4 to 5 years experienced Generative AI Engineer to design and deploy enterprise-grade GenAI systems. This role blends platform architecture, LLM integration, and operationalization—ideal for engineers with strong hands-on experience in large language models, RAG pipelines, and AI orchestration.

Responsibilities

  • Platform Leadership: Architect GenAI platforms powering copilots, document AI, multi-agent systems, and RAG pipelines.
  • LLM Expertise: Build/fine-tune GPT, Claude, Gemini, LLaMA 2/3, Mistral; deep in RLHF, transformer internals, and multi-modal integration.
  • RAG Systems: Develop scalable pipelines with embeddings, hybrid retrieval, prompt orchestration, and vector DBs (Pinecone, FAISS, pgvector).
  • Orchestration & Hosting: Lead LLM hosting, LangChain/LangGraph/AutoGen orchestration, AWS SageMaker/Bedrock integration.
  • Responsible AI: Implement guardrails for PII redaction, moderation, lineage, and access aligned with enterprise security standards.
  • LLMOps/MLOps: Deploy CI/CD pipelines, automate tuning/rollout, handle drift, rollback, and incidents with KPI dashboards.
  • Cost Optimization: Reduce TCO via dynamic routing, GPU autoscaling, context compression, and chargeback tooling.
  • Agentic AI: Build autonomous, critic-supervised agents using MCP, A2A, LGPL patterns.
  • Evaluation: Use LangSmith, BLEU, ROUGE, BERTScore, HIL to track hallucination, toxicity, latency, and sustainability.

Skills Required

  • 4–5 years in AI/ML (2+ in GenAI)
  • Strong Python, PySpark, Scala; APIs via FastAPI, GraphQL, gRPC
  • Proficiency with MLflow, Kubeflow, Airflow, Prompt flow
  • Experience with LLMs, vector DBs, prompt engineering, MLOps
  • Solid foundation in applied mathematics & statistics

Nice to Have

  • Open-source contributions, AI publications
  • Hands-on with cloud-native GenAI deployment
  • Deep interest in ethical AI and AI safety

2 Days WFO Mandatory

 

Don't meet every job requirement? That's okay! Our company is dedicated to building a diverse, inclusive, and authentic workplace. If you're excited about this role, but your experience doesn't perfectly fit every qualification, we encourage you to apply anyway. You may be just the right person for this role or others.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0

Tags: Airflow APIs Architecture AWS CI/CD Claude Engineering FAISS FastAPI Gemini Generative AI GPT GPU GraphQL Kubeflow LangChain LLaMA LLaMA2 LLMOps LLMs Machine Learning Mathematics MLFlow MLOps Open Source Pinecone Pipelines Prompt engineering PySpark Python RAG Responsible AI RLHF SageMaker Scala Security Statistics

Region: Asia/Pacific
Country: India

More jobs like this