AI Data Scientist - Voice & Multimodal AI

Bengaluru, Karnataka, India

⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️

Apply now Apply later

About Apna:

Apna is India’s largest jobs and professional networking platform for frontline workers. We’re building the infrastructure to power hiring, skill-building, and career growth for 300 million+ working Indians. As we expand our AI-first platform across voice, text, and multimodal workflows — we’re looking for a bold and curious AI Data Scientist who wants to shape the future of applied Gen AI.

Requirement: 1

Location: Bengaluru (Work from Office - Domlur)

Team: AI & Machine Learning

Experience: 3–5 years

Requirements

What You'll do:

  • Fine-tune and deploy LLMs, TTS, STT, and voice models for use in real-time conversations with millions of users.
  • Convert unstructured, messy real-world audio/text data into clean, high-quality datasets for training and evaluation.
  • Build inference pipelines optimized for low-latency, high-accuracy voice agents and multimodal interfaces.
  • Work closely with infra and product teams to ship production-grade GenAI models with observability, fallback, and monitoring.
  • Experiment with GANs, diffusion models, audio generation, and multimodal fusion to power next-gen AI agents.
  • Own the full model lifecycle — from research and training to deployment, testing, and iteration.

What we're Looking for:

  • 3–5 years of hands-on experience in AI / ML roles, ideally in startups or product-driven teams.
  • Strong grasp of LLM fine-tuning, instruction tuning, or pretraining techniques.
  • Familiarity with TTS/STT systems, Whisper, Tacotron, VITS, or commercial tools like ElevenLabs.
  • Experience with multimodal architectures, generative audio, GANs, or diffusion-based models.
  • Ability to work with real-world messy data, design training pipelines, and debug model failure modes.
  • Fluency in frameworks like PyTorch, HuggingFace, TensorFlow, and ecosystem tools (ONNX, Triton, LangChain, etc.).
  • Passion for building high-impact AI features that ship to real customers.

Benefits

Why Join Us:

  • Work at the cutting edge of LLMs, voice AI, and generative models — and ship real products, not just prototypes.
  • Directly impact millions of users by powering AI agents that help with hiring, learning, and career growth.
  • Collaborate with a world-class team of AI engineers, researchers, and product minds who move fast and ship boldly.
  • Freedom to explore: Own experiments, propose architecture, or contribute to foundational model training.
  • Startup speed, enterprise scale — best of both worlds. Rapid iteration and direct customer feedback.
  • Multilingual India - first problems that push the boundaries of speech, reasoning, and personalization.
Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0

Tags: Architecture Diffusion models GANs Generative AI Generative modeling HuggingFace LangChain LLMs Machine Learning Model training ONNX Pipelines PyTorch Research TensorFlow Testing

Perks/benefits: Career development Startup environment

Region: Asia/Pacific
Country: India

More jobs like this