Data Scientist/Machine Learning Engineer

Bengaluru, India

Hitachi Vantara

Driving digital innovation with advanced storage solutions, partnerships and eco-friendly storage technologies.

View all jobs at Hitachi Vantara

Apply now Apply later

Our Company

We’re Hitachi Vantara, the data foundation trusted by the world’s innovators. Our resilient, high-performance data infrastructure means that customers – from banks to theme parks ­– can focus on achieving the incredible with data.   

If you’ve seen the Las Vegas Sphere, you’ve seen just one example of how we empower businesses to automate, optimize, innovate – and wow their customers. Right now, we’re laying the foundation for our next wave of growth.  We’re looking for people who love being part of a diverse, global team – and who get excited about making a real-world impact with data.

The Role

We are looking for a skilled Data Scientist/ML Engineer with a good background in statistical and machine learning methods, with a specialized focus on natural language processing (NLP). The ideal candidate needs to have good knowledge of model training, retraining and optimization and will proactively collaborate in productionising end-to-end machine learning solutions at scale.

 

What you’ll bring

Understand business and product needs and use classical ML methods or advanced AI techniques to solve them at scale
Design, train, and fine-tune machine learning models for information extraction (PII extraction), and evaluate model performance using relevant metrics and iteratively improve the models.
• Communicate and collaborate with engineering/cross-functional teams to implement a feedback mechanism to optimise the models by training, tuning and evaluating them on a timely basis
What you will need:
• Bachelors Degree in Engineering or equivalent, with 5-8 years of experience in solving industry problems using statistical, traditional ML and AI methods with proven experience in developing machine learning or NLP models, particularly for information extraction tasks
• Enthusiastic learner and skilled in both theory and practice of basic statistical methods such as regression, clustering, general ML algorithms such as SVM, tree-based methods, neural networks etc. for solving supervised and unsupervised problems.
Experience working on advanced NLP methods like feature extraction, tagging and entity recognition and classification to identify sensitive information that comply with relevant data privacy regulations from structured and unstructured data sources.
• Understanding of data privacy regulations and best practices.
• Good Understanding of evaluation metrics specific to NLP tasks.
Skills in working with LLMs through prompt engineering, fine-tuning pre-trained models on specific datasets for targeted applications, etc would be good to have
• Work closely with data engineers, software developers, and other stakeholders to integrate the solutions into existing systems with systemic feedback and continuous training and optimization.

 

Technical Skills:
Proficiency in programming languages such as Python
• Experience with machine learning frameworks (e.g., Sklearn, TensorFlow, PyTorch).
• Experience with NLP libraries and frameworks (e.g., spaCy, NLTK, Hugging Face Transformers).
• Good to have: Familiarity with libraries and frameworks commonly used for LLMs (e.g., Hugging Face Transformers, LlamaIndex, LangChain etc).

  About us   We’re a global team of innovators. Together, we harness engineering excellence and passion for insight to co-create meaningful solutions to complex challenges. We turn organizations into data-driven leaders that can a make positive impact on their industries and society. If you believe that innovation can inspire the future, this is the place to fulfil your purpose and achieve your potential.     #LI-SP7

Championing diversity, equity, and inclusion   

Diversity, equity, and inclusion (DEI) are integral to our culture and identity. Diverse thinking, a commitment to allyship, and a culture of empowerment help us achieve powerful results. We want you to be you, with all the ideas, lived experience, and fresh perspective that brings. We support your uniqueness and encourage people from all backgrounds to apply and realize their full potential as part of our team.   

How we look after you  

We help take care of your today and tomorrow with industry-leading benefits, support, and services that look after your holistic health and wellbeing. We’re also champions of life balance and offer flexible arrangements that work for you (role and location dependent). We’re always looking for new ways of working that bring out our best, which leads to unexpected ideas. So here, you’ll experience a sense of belonging, and discover autonomy, freedom, and ownership as you work alongside talented people you enjoy sharing knowledge with.   

We’re proud to say we’re an equal opportunity employer and welcome all applicants for employment without attention to race, colour, religion, sex, sexual orientation, gender identity, national origin, veteran, age, disability status or any other protected characteristic.Should you need reasonable accommodations during the recruitment process, please let us know so that we can do our best to set you up for success.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  2  0  0

Tags: Classification Clustering Engineering LangChain LLMs Machine Learning ML models Model training NLP NLTK Privacy Prompt engineering Python PyTorch Scikit-learn spaCy Statistics TensorFlow Transformers Unstructured data

Perks/benefits: Career development Equity / stock options Flex hours Health care Startup environment

Region: Asia/Pacific
Country: India

More jobs like this