Summer Intern/Data Engineering
Cambridge MA/Onsite, US
Full Time Internship Entry-level / Junior USD 42K - 80K
GSK
At GSK, we unite science, technology and talent to get ahead of disease togetherWhy GSK?
Uniting science, technology and talent to get ahead of disease together.
GSK is a global biopharma company with a special purpose – to unite science, technology and talent to get ahead of disease together – so we can positively impact the health of billions of people and deliver stronger, more sustainable shareholder returns – as an organisation where people can thrive. We prevent and treat disease with vaccines, specialty and general medicines. We focus on the science of the immune system and the use of new platform and data technologies, investing in four core therapeutic areas (infectious diseases, HIV, respiratory/ immunology and oncology).
Our success absolutely depends on our people. While getting ahead of disease together is about our ambition for patients and shareholders, it’s also about making GSK a place where people can thrive. We want GSK to be a place where people feel inspired, encouraged and challenged to be the best they can be. A place where they can be themselves – feeling welcome, valued, and included. Where they can keep growing and look after their wellbeing. So, if you share our ambition, join us at this exciting moment in our journey to get Ahead Together.
Department Description
Onyx Data Engineering Team harnesses the power of data to drive innovation and support GSK’s strategic goals. By building robust, scalable data solutions, we empower all teams to make informed decisions, ultimately enhancing patient outcomes and advancing our mission.
Our Data Engineering interns will serve as technical contributors, helping teams translate well-defined specifications into functioning components—such as pipelines, services, APIs, or functions. They will follow best practices for software development and data engineering, including code quality, documentation, DevOps, and testing.
At GSK, we have a leading portfolio of vaccines, respiratory and specialty medicines as well as R&D based on immune system and genetics science. GSK’s ambition and purpose are to unite science, talent and technology to get ahead of disease together – all with the clear ambition of delivering human health impact; stronger and more sustainable shareholder returns; and as a new GSK where outstanding people thrive.This internship will support the Onyx Data Engineering Team to deliver tech innovation that will support our scientists to drive scientific breakthroughs that will change lives all over the world.
Something to get you excited about Onyx:
What is Onyx? Hear directly from Shobie, GSK’s Chief Digital and Technology Officer: Welcome to GSK Onyx - YouTube
Why Onyx? Hear from Nick, VP of Onyx Research Data Platform and Kim,SVP and Head of AI/ML: Why Onyx? - YouTube
Job Description
- Create modular code and services using modern data engineering tools (Python, Spark, Kafka) and orchestration platforms (Google Workflow, Airflow).
- Develop well-engineered solutions with automated test suites and comprehensive documentation.
- Maintain consistent logging and data lineage by enforcing platform abstractions.
- Adhere to QMS (Quality Management System) frameworks and CI/CD best practices for reliable deployments.
- Troubleshoot and resolve issues in existing tools, services, and pipelines.
Minimum Qualifications
- Pursuing a Bachelor’s or Master’s in Computer Science or related disciplines.
- Proficiency in at least one programming language (Python, Java, or Scala).
- Basic knowledge of SQL and relational databases.
- Familiarity with data structures, algorithms, and simple ETL concepts.
- Must be able to work full-time (35-40 hours/week) throughout the 12-week internship (May/June - August 2025).
- Must have an active student status and/or within 12 months post-graduation from a BS or MS degree program. Post-doctoral candidates are not eligible.
Preferred Qualifications
- Exposure to modern software development tools/ways of working (e.g., git/GitHub, DevOps)
- Prior hands-on project or internship in data engineering or software development.
- Exposure to common tools for data engineering (e.g. Spark, Kafka)
- Proficiency with Microsoft Word, PowerPoint, Excel and Outlook
- Self-starters that take initiative and think quickly on their feet
- Effective communication and teamwork skills.
- Motivation for an exciting summer opportunity with potential for future full-time employment
Eligibility Requirements
- Must successfully pass a drug screen and background check prior to assignment target start date.
- If your skillsets are a match for this role, you will be contacted by our recruitment team with next steps to complete our internal World of GSK Assessment.
- Please note, you must receive a passing score to move forward in the interview process. Once your assessment is complete, a recruiter will review your results and be in touch with next steps.
Benefits
- While GSK embraces a flexible work environment, we do require certain positions to be onsite. Candidates who are hired for an on-site role or hybrid role, and reside outside of 50-miles from their assigned work location, are eligible for relocation stipend. This is a one-time payment to help offset housing & relocation expenses. Please refer to the position details for the requirements of each position.
- GSK Interns and Co-ops are offered a competitive hourly pay rate and benefits. Please note, benefits eligibility determined the month following date of hire.
This job posting is for a temporary role as an employee of Atrium on assignment at GSK. The individual selected for this role will be offered the role as an employee of Atrium; compensation, medical benefits, fringe benefits and other terms and conditions of employment shall be presented by Atrium upon offer. The pay rate range provided is a reasonable estimate of the anticipated compensation range for this job at the time of posting. The actual pay rate will be based on several factors, including skills, competencies, experience, educational degree obtained, location and/or being pursued and other job-related factors permitted by law.
In addition, this role will be eligible for overtime pay, in accordance with federal and state requirement.
Pay Rate Range: $21/hr to $40/hr
Interested in learning more? Register now on our digital learning platform (GSK Get Ahead - Connectr) where you can access interview and assessment hints and tips, speak to a mentor and learn more about life at GSK.
Tags: Airflow APIs CI/CD Computer Science DevOps Engineering ETL Excel Git GitHub Java Kafka Machine Learning Pipelines Python R R&D RDBMS Research Scala Spark SQL Testing
Perks/benefits: Career development Competitive pay Flex hours Health care Relocation support
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.