Senior Data Analyst – Data Infrastructure Development & Maintenance
Canada - Montreal (St. Laurent)
RELX
Make better decisions, get better results and be more productive with RELX's analytics and decision toolsSenior Data Analyst – Data Infrastructure Development and Maintenance
Would you like to be part of a global innovative team using the latest technologies to empower advanced analytical capabilities?
Does developing scalable solutions and delivering impactful insights for research intelligence excite you?
About the Team
As a leading international provider of high quality bibliometric and research evaluation services, the Analytical and Data Services (ADS) team helps position Elsevier as a trusted partner of leading institutions, funders, and governments it serves.
About the Role
Join our global research intelligence team as a Senior Data Analyst, playing a pivotal role in the development, optimization, and maintenance of our advanced data infrastructure. Our custom-built Python library enhances data production, discovery, and analysis, streamlining processes and replacing legacy pipelines. You'll be at the heart of maintaining and enhancing this critical infrastructure, ensuring seamless data workflows and empowering advanced analytical capabilities.
You’ll be instrumental in enabling our research analysts to efficiently discover, analyze, and produce high-quality insights that empower our global research intelligence capabilities.
Responsibilities
Leading the development, optimization, and maintenance of our in-house Python-based data library, ensuring robustness, scalability, and ease of use.
Collaborating with data scientists and analysts to implement new data pipelines, optimize existing processes, and support analytical workflows.
Continuously monitoring and improve data workflows, leveraging PySpark, DuckDB, and SQL databases.
Evaluating and incorporate new methodologies and analytical approaches into the data infrastructure.
Ensuring comprehensive documentation and adherence to established development conventions and versioning practices (Semantic Versioning, Conventional Commits).
Conducting thorough testing and validation using established caching mechanisms to maintain data accuracy and performance.
Actively participating in code reviews, fostering best practices in software development and data management across the team.
Requirements
Have proven experience developing and maintaining data infrastructure, pipelines, or analytical libraries.
Possess strong proficiency in Python, with demonstrated experience in PySpark, DuckDB, SQL databases, and ETL workflows, or a clear ability to pivot quickly to these technologies.
Have prior experience with code version control (Git)
Experience in academic or scientific publication domains helpful
Familiarity with Large Language Models (LLMs) and APIs preferred
Experience with AWS services (EC2, S3, Lambda) helpful
Familiarity with developing within Databricks environments.
Work in a way that works for you:
We promote a healthy work/life balance across the organization. We offer an appealing working prospect for our people. With numerous wellbeing initiatives, shared parental leave, study assistance and sabbaticals, we will help you meet your immediate responsibilities and your long-term goals.
Working for you:
We know that your wellbeing and happiness are key to a long and successful career. These are some of the benefits we are delighted to offer:
- Health plan benefits
- Employee Assistance Program
- Retirement Benefits
- Various Leave Programs
- Educational Assistance
- Disability, Life and Accidental Death Insurance
- Paid Vacation
- Up to two days of paid leave each to participate in Employee Resource Groups and to volunteer with charities and causes that matter to you
About the business:
A global leader in information and analytics, we help researchers and healthcare professionals advance science and improve health outcomes for the benefit of society. Building on our publishing heritage, we combine quality information and vast data sets with analytics to support visionary science and research, health education and interactive learning, as well as exceptional healthcare and clinical practice. At Elsevier, your work contributes to the world's grand challenges and a more sustainable future. We harness innovative technologies to support science and healthcare to partner for a better world.
-----------------------------------------------------------------------
We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-855-833-5120.
Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here.
Please read our Candidate Privacy Policy.
USA Job Seekers:
We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs AWS Banking Databricks Data management Data pipelines EC2 ETL Git Lambda LLMs Pipelines Privacy PySpark Python Research SQL Testing
Perks/benefits: Career development Health care Insurance Parental leave
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.