Data Scientist
Mumbai - Worli, Mumbai - Worli, IN
Mahindra Group
A technology & innovation-led, global, federation of companies, that provides a wide range of products, services & possibilities, enabling people to RiseResponsibility
- Design and implement scalable, cloud-based data solutions using AWS, Azure, or GCP (mandatory).
- Build, optimize, and maintain ETL/ELT pipelines for efficient data processing.
- Manage and design cloud-based data storage systems such as data warehouses and data lakes.
- Ensure data quality, security, and compliance in cloud environments.
- Implement real-time and batch data processing frameworks.
- Analyze large datasets to uncover trends, patterns, and actionable insights.
- Develop predictive, prescriptive, and descriptive models using advanced machine learning techniques.
- Build, train, and deploy Generative AI models (e.g., transformers, GANs, diffusion models) for tasks like text generation, image synthesis, and more.
- Implement GenAI pipelines using libraries and frameworks such as Hugging Face, OpenAI, or PyTorch.
- Continuously monitor, optimize, and maintain GenAI models in production.
- Create data-driven solutions for business problems through experimentation and analysis.
Skills and Qualifications
Technical Skills
- Cloud Expertise (Mandatory): Proficiency in Cloud (AWS,GCP,AZURE)
- Programming: Advanced Python skills with experience in libraries such as Pandas, NumPy, Scikit-learn, TensorFlow, PyTorch, or Hugging Face.
- Data Engineering: Expertise in SQL and NoSQL databases, data modeling, and big data tools like Spark.
- GenAI:
- Deep understanding of transformer architectures (e.g., GPT, BERT, T5).
- Experience with training and fine-tuning large language models (LLMs).
- Proficiency in tools like Hugging Face Transformers, LangChain, and OpenAI APIs.
- DevOps for AI/ML: Knowledge in CI/CD for machine learning and GenAI pipelines, model versioning, and monitoring.
Analytical Skills
- Strong knowledge of statistics, probability, and machine learning algorithms.
- Ability to design experiments and implement hypothesis-driven approaches.
- Proficiency in handling unstructured, structured, and semi-structured data.
Soft Skills
- Strong problem-solving and critical-thinking abilities.
- Excellent communication and collaboration skills with a stakeholder-focused approach.
- Ability to adapt to rapidly evolving technology trends and business needs.
Experience
- Mandatory: 3-7 years of experience in roles involving cloud platforms, data science, Python development, and experience with GenAI models.
- Proven track record of designing and deploying scalable cloud-based data solutions and delivering impactful AI/ML and GenAI models.
This role is ideal for professionals passionate about leveraging GenAI models and cloud technologies to drive innovation and deliver transformative business solutions.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs Architecture AWS Azure BERT Big Data CI/CD Data quality DevOps Diffusion models ELT Engineering ETL GANs GCP Generative AI GPT LangChain LLMs Machine Learning NoSQL NumPy OpenAI Pandas Pipelines Python PyTorch Scikit-learn Security Spark SQL Statistics TensorFlow Transformers
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.