Data Scientist III
India-Chennai (Ascendas Tech park)
Elsevier
Elsevier is a global information analytics company that helps institutions and professionals progress science, advance healthcare and improve performanceData Scientist III
Are you interested in working with data and analytics to solve problems?
Are you interested in bringing your GenAI, Machine Learning and NLP expertise to projects?
About our Team:
Data Science Health Content Operations team works with a focus on Machine Learning, NLP, and Statistical techniques. It helps in building state of the art applications for the health sciences domain.
About the Role:
You will be responsible for building, testing, and maintaining our GenAI, RAG and NLP solutions. You will work throughout the whole life cycle of data science projects: design, implementation, evaluation, product ionization and beyond. You will deliver efficient and production-ready Python code. You will collaborate with SME on Evaluation and technology team to deploy and productionize our data science pipelines.
Responsibilities:
Collecting Data, data analysis, model development, defining quality metrics, quality assessment of models and regular presentations to stakeholders.
Creating production-ready Python packages for each component of data science pipelines (such as pre-processing, model inference, evaluation) and their deployment together with the technology team
Developing GenAI pipelines, building search applications using RAG.
Evaluating GenAI applications.
Integrating of data science components and end-to-end quality assessment
Keeping our data science pipelines robust against model drift and ensuring continuous output quality.
Developing tools and strategies for maintenance such as automatic model re-training.
Establishing the reporting process of the performance of the pipeline, and automatic re-training strategy for the existing pipelines
Requirements:
3+ years of relevant applied experience or PhD/MSc/MTech in the field of computer science, data science, artificial intelligence
Experience in some relevant implementation platforms for ML/NLP tasks – proficiency in Python, SQL, R, Java
Experience in model building, validation and testing using ML algorithms such as random forest, SVM, Logistic Regression, Bayesian modelling etc
Experience with GitLab or Github, Jira and working in Agile environment
Experience using *nix systems, open-source software, jupyter notebook hubs, libraries and cloud computing is required
Experience using latest algorithms in deep learning, neural networks, reinforcement learning, transfer learning
Experience with GenAI technologies. ie., utilizing LLMs via API access, building RAG systems using LangChain or Llamaindex, LLM Evaluation tools and Prompt Engineering.
About Us:
A global leader in information and analytics, we help researchers and healthcare professionals’ advance science and improve health outcomes for the benefit of society. Building on our publishing heritage, we combine quality information and vast data sets with analytics to support visionary science and research, health education and interactive learning, as well as exceptional healthcare and clinical practice. At Elsevier, your work contributes to the world’s grand challenges and a more sustainable future. We harness innovative technologies to support science and healthcare to partner for a better world.
Join Us:
Purposeful Work When you work with us, your work matters. You are part of an organization that nurtures your curiosity to stimulate innovation for the communities that we serve.
Growing Every DayLike the communities we serve, you are on a constant path of discovery to shape your career and personal development.
Colleagues Who CareYou will be part of the Elsevier family. We will support your well-being and provide the flexibility you need to thrive at work and home.
Together, we create possibilities.
-----------------------------------------------------------------------
Elsevier is an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law. We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form: https://forms.office.com/r/eVgFxjLmAK , or please contact 1-855-833-5120.
Please read our Candidate Privacy Policy.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile APIs Bayesian Computer Science Data analysis Deep Learning Engineering Generative AI GitHub GitLab Java Jira Jupyter LangChain LLMs Machine Learning ML models Model inference NLP Open Source PhD Pipelines Privacy Prompt engineering Python R RAG Reinforcement Learning Research SQL Statistics Testing
Perks/benefits: Career development
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.