Senior Data Scientist I
India - (Home based)
⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️
Elsevier
Elsevier is a global information analytics company that helps institutions and professionals progress science, advance healthcare and improve performanceSenior Data Scientist I
Are you interested in working with data and analytics to solve problems?
Are you interested in bringing your Gen AI, Machine Learning and NLP expertise to projects?
About our Team:
Data Science Health Content Operations team works with a focus on Generative AI, Machine Learning, Natural Language Processing, and Statistical techniques. It helps in building state of the art applications for the health sciences domain.
About the Role:
As a Senior Data Scientist, you will play a pivotal role in the development and deployment of cutting-edge Generative AI models and solutions. You will be responsible for building, testing, and maintaining our Generative AI, Retrieval Augmented Generation (RAG) and Natural Language Processing (NLP) solutions. This includes evaluating their performance and implementing guardrails to ensure ethical and responsible use of AI technologies. You will engage in the entire life cycle of data science projects, including design, implementation, evaluation, productionisation and ongoing enhancement. A key focus of your work will be on the customization and optimization of existing RAG pipelines to support applications that involve content ingestion, machine translation, and contextualized information retrieval. Experience with end-to-end model deployment, including leveraging AI agents, Model Context Protocol (MCP) for effective context management, and cloud platforms such as AWS (including AWS Bedrock), Azure, or similar services, is a strong plus. Your deliverables will include efficient, production-ready Python code, with experience in Java considered an asset. You will collaborate closely with Subject Matter Experts (SMEs) and the technology team to deploy and operationalize our data science pipelines.
This role requires a strong foundation in Natural Language Processing (NLP), Machine Learning, Transformer models and Generative AI, as well as proficiency in Python.
Responsibilities:
Collect data, perform data analysis, develop models, define quality metrics, and conduct quality assessments of models, along with regular presentations to stakeholders.
Create production-ready Python packages for each component of data science pipelines (e.g., pre-processing, model inference, evaluation) and coordinate their deployment with the technology team.
Design, develop, and deploy Generative AI models and solutions that meet specific business needs.
Expertise in Retrieval Augmented Generation (RAG) optimization and customization of existing RAG pipelines to meet specific project needs.
Proficiency in large-scale data ingestion, preprocessing, and transformation of multilingual content to ensure high-quality inputs for downstream models.
Build AI Agentic models with RAG pipeline.
Fine-tune large language models (LLMs) and transformer models to enhance accuracy and relevance.
Implement guardrails and evaluation mechanisms to ensure responsible and ethical AI usage.
Conduct rigorous testing and evaluation of AI models to ensure high performance and reliability.
Integrate data science components and ensure end-to-end quality assessment.
Maintain the robustness of data science pipelines against model drift and ensure consistent output quality.
Establish a reporting process for pipeline performance and develop automatic re-training strategies for existing pipelines.
Work collaboratively with cross-functional teams to integrate AI solutions into existing products and services.
Mentor junior data scientists and contribute to the knowledge-sharing culture within the team.
Stay up-to-date with the latest advancements in AI, machine learning, and NLP technologies.
Requirements:
Master’s or Ph.D. in Computer Science, Data Science, Artificial Intelligence, or a related field.
7+ years of relevant applied experience in data science, with a focus on Generative AI, NLP, and machine learning.
Proficiency in Python for data analysis, model development, and deployment.
Strong experience with transformer models and fine-tuning techniques for large language models (LLMs).
Proficiency in Generative AI technologies, including utilizing LLMs via API access, LLM evaluation tools, and prompt engineering.
Knowledge of various RAG pipelines and their practical implementation.
Experience building Agentic RAG systems is strong requirement.
Experience with AI agent management frameworks such as LangChain, AutoGen, Haystack, MCP, or similar tools.
Experience with advanced algorithms in deep learning, neural networks, reinforcement learning, and transfer learning.
Familiarity with traditional machine learning algorithms such as random forests, SVM, logistic regression, and Bayesian modelling for model building, validation, and testing.
Understanding of AI ethics, guardrail implementation, and evaluation metrics.
Familiarity with cloud platforms (e.g., Bedrock, AWS, Azure) for model deployment and the creation of production-ready pipelines.
Proficiency in data visualization tools and techniques.
Experience with version control systems (e.g., GitLab or GitHub), Jira, and working in an Agile environment.
Proficient in using *nix systems, open-source software, Jupyter Notebook, libraries, and cloud computing.
Excellent problem-solving and analytical skills, with strong attention to detail.
Strong communication skills and the ability to work effectively in a team-oriented environment.
Work in a Way That Works for You:
We promote a healthy work/life balance and provide various well-being initiatives, shared parental leave, study assistance, and sabbaticals to help you meet both your immediate responsibilities and long-term goals.
Working for You:
We offer comprehensive benefits to support your health and well-being, including:
Health insurance for you and your family.
Enhanced health insurance options at competitive rates.
Group life insurance for financial security.
Group accident insurance for protection against accidental death and permanent disability.
Flexible working arrangements for work-life balance.
Employee assistance programs for personal and work-related support.
Medical screenings prioritize your well-being.
Modern family benefits, including maternity, paternity, and adoption support.
Long-service awards to recognize dedication.
New baby gifts to celebrate parenthood.
Subsidized meals at specific locations.
Various paid time-off options, including casual leave, sick leave, privilege leave, compassionate leave, special sick leave, and public holidays.
Free transportation for home-office-home travel in select locations.
About Business:
We are a global leader in information and analytics, helping researchers and healthcare professionals advance science and improve health outcomes. Building on our publishing heritage, we combine quality information and vast data sets with analytics to support visionary science, research, health education, interactive learning, and exceptional healthcare and clinical practice. At Elsevier, your work contributes to addressing the world's grand challenges and creating a more sustainable future. We harness innovative technologies to support science and healthcare, partnering for a better world
-----------------------------------------------------------------------
We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-855-833-5120.
Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here.
Please read our Candidate Privacy Policy.
We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.
USA Job Seekers:
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile APIs AWS Azure Banking Bayesian Computer Science Data analysis Data visualization Deep Learning Engineering Generative AI GitHub GitLab Haystack Java Jira Jupyter LangChain LLMs Machine Learning ML models Model deployment Model inference NLP Open Source Pipelines Privacy Prompt engineering Python RAG Reinforcement Learning Research Security Statistics Testing
Perks/benefits: Career development Flex hours Flex vacation Health care Insurance Medical leave Parental leave
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.