Generative AI Engineer

Mumbai, Maharashtra, India

⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️

Sia

Sia is a new kind of management consulting group. We were born digital, and our work is augmented by data science, enhanced by creativity and driven by responsibility.

View all jobs at Sia

Apply now Apply later

Company Description

Sia is a next-generation, global management consulting group. Founded in 1999, we were born digital. Today our strategy and management capabilities are augmented by data science, enhanced by creativity and driven by responsibility. We’re optimists for change and we help clients initiate, navigate and benefit from transformation. We believe optimism is a force multiplier, helping clients to mitigate downside and maximize opportunity. With expertise across a broad range of sectors and services, our consultants serve clients worldwide. Our expertise delivers results. Our optimism transforms outcomes. 

Heka.ai is the independent brand of Sia Partners dedicated to AI solutions. We host many AI-powered SaaS solutions that can be combined with consulting services or used independently, to provide our customers with solutions at scale.  

Job Description

We are seeking a skilled Generative AI Engineer to join our team where you will harness model capabilities to implement cutting-edge algorithms and solutions across myriad industries.   

You will serve as a pivotal link between Data Scientists, ML and Platform Engineers to unleash the potential of Generative AI technology by implementing business-centric solutions. You will help customers find the appropriate level of refinement among semantic search, RAG, agents, and ultimately fine-tuning to reach their value delivery threshold in the most cost-effective way.  

Beyond crafting prompts, you will be responsible for designing and building robust and scalable products starting with benchmarks of candidate FMs through targeted requests, rapidly iterating prototypes, and validating product ideas. Your expertise in orchestrating the entire AI workflow will ensure the seamless integration of advanced models' capabilities into applications, optimizing performance, security, compliance, scalability, and efficiency. You will competently navigate between prompts, chains, and agents while mastering the underlying infrastructure challenges. 

We invest in your success through comprehensive training, combining internal programs with resources from our technology partners.  

Join us if you are passionate about pushing the boundaries of AI technology and making a significant impact in enabling our customers to create GenAI-powered applications with confidence and a fast time to market. 

Key Responsibilities 

You are part of a cross-functional consulting team that drives the adoption of Generative AI in every imaginable sector, working step-by-step with customers to understand business requirements to design then build bespoke GenAI solutions. 

  • Build applications powered by LLMs (OpenAI, Claude, Mistral, etc.) using LangChain, LlamaIndex, and related GenAI frameworks. 
  • Implement RAG pipelines with vector DBs (Pinecone, FAISS, pgvector, ChromaDB) for grounding LLM responses with internal knowledge 
  • Develop multimodal AI solutions (text, audio, image) and build autonomous agents where relevant. 
  • Drive MLOps excellence: CI/CD (ML pipelines), drift detection, canary releases, retraining schedules. 
  • Design robust and reusable prompt templates using CoT, ReAct, Graph-of-Thought, and Agent flows. 
  • Continuously improve model reliability, relevance, and UX by tuning prompt flows 
  • Deploy GenAI models on AWS/GCP/Azure using services like SageMaker, Bedrock, Vertex AI 
  • Ensure performance observability, security guardrails, and compliance (GDPR, Responsible AI) 
  • Work with DevOps teams to integrate GenAI solutions into microservices and APIs (FastAPI/Flask) 
  • Benchmark open-source and commercial LLMs for use-case fit and cost-performance tradeoffs 
  • Evaluate fine-tuning strategies (PEFT, LoRA, RLHF) where applicable for proprietary use cases 
  • Support solution architects and cross-functional teams in delivering PoCs and enterprise-grade rollouts 
  • Document frameworks, best practices, risks, and learnings for future scaling 

Qualifications

Qualifications :

  • Education: Bachelor’s/master's degree in computer science, AI , or a related field. 
  • Experience: 5+ years of experience in NLP/ML/AI with at least 3 year hands-on in GenAI. 

Skills

  • Strong coding skills in Python with frameworks like PyTorch, Hugging Face, LangChain, and LlamaIndex. 
  • Proven experience with cloud-based AI services (AWS/GCP/Azure) and APIs (OpenAI, Anthropic, Hugging Face). 
  • Experience with vector databases: Qdrant, pgvector, Pinecone, FAISS, Milvus, or Weaviate. 
  • Familiarity with prompt engineering, transformer architectures, and embedding techniques. 
  • Excellent communication skills, with the ability to convey complex technical concepts to both highly technical and also non-technical stakeholders. 
  • Sharp problem-solving skills. 
  • Ability to collaborate with diverse teams. 

Additional Information

What We Offer 

  • Opportunity to lead cutting-edge AI projects in a global consulting environment. 

  • Leadership development programs and training sessions at our global centers. 

  • A dynamic and collaborative team environment with diverse projects. 

Position based in Mumbai (hybrid)

Sia is an equal opportunity employer. All aspects of employment, including hiring, promotion, remuneration, or discipline, are based solely on performance, competence, conduct, or business needs. 

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  2  0  0

Tags: Anthropic APIs Architecture AWS Azure CI/CD Claude Computer Science Consulting DevOps Engineering FAISS FastAPI Flask GCP Generative AI LangChain LLMs LoRA Machine Learning Microservices MLOps NLP OpenAI Open Source Pinecone Pipelines Prompt engineering Python PyTorch RAG React Responsible AI RLHF SageMaker Security UX Vertex AI Weaviate

Region: Asia/Pacific
Country: India

More jobs like this