Data Engineer (SE5)
Petaling Jaya, Malaysia
Roche
As a pioneer in healthcare, we have been committed to improving lives since the company was founded in 1896 in Basel, Switzerland. Today, Roche creates innovative medicines and diagnostic tests that help millions of patients globally.At Roche you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections, where you are valued, accepted and respected for who you are, allowing you to thrive both personally and professionally. This is how we aim to prevent, stop and cure diseases and ensure everyone has access to healthcare today and for generations to come. Join Roche, where every voice matters.
The Position
Data Engineering & Infrastructure
Assist in building and maintaining data pipelines and basic ETL/ELT processes for structured and unstructured data.
Support implementation of data models in Snowflake under supervision.
Learn and apply DBT for data transformation and modeling tasks.
Support the management of AWS-based data infrastructure (e.g., S3, Lambda, Glue).
Help monitor data pipeline performance and troubleshoot basic issues.
Participate in system planning and documentation for data processes.
System & IT Support for Data Projects
Support data teams by preparing and validating datasets for analysis.
Assist in testing and deploying applications used in data engineering and science workflows.
Learn to work with operating systems, databases, and utilities software to improve data handling.
Contribute to discussions on business problems that can be solved with automated data workflows.
Process & System Optimization
Assist in identifying and implementing improvements to data flow and infrastructure.
Learn and apply data governance and security best practices in a regulated environment.
Help with automation tasks using orchestration tools such as Airflow or Prefect.
Participate in the evaluation and testing of new data tools and technologies.
Requirements
Must-Have:
Bachelor’s degree in Computer Science, Data Engineering, or a related field.
Up to 2 years of experience in data engineering, internships, or relevant academic projects.
Basic proficiency in Python for data processing and scripting.
Exposure to AWS services (S3, Lambda, Glue) through coursework or hands-on learning.
Familiarity with SQL for querying and basic data analysis.
Willingness to learn tools such as Snowflake, dbt, and orchestration platforms.
Good analytical thinking and communication skills.
Awareness or interest in data security, privacy, and compliance practices.
A healthier future drives us to innovate. Together, more than 100’000 employees across the globe are dedicated to advance science, ensuring everyone has access to healthcare today and for generations to come. Our efforts result in more than 26 million people treated with our medicines and over 30 billion tests conducted using our Diagnostics products. We empower each other to explore new possibilities, foster creativity, and keep our ambitions high, so we can deliver life-changing healthcare solutions that make a global impact.
Let’s build a healthier future, together.
Roche is an Equal Opportunity Employer.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow AWS Computer Science Data analysis Data governance Data pipelines dbt ELT Engineering ETL Lambda Pipelines Privacy Python Security Snowflake SQL Testing Unstructured data
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.