Associate Data Engineer
San Francisco
Full Time Mid-level / Intermediate USD 82K - 110K
GSK
At GSK, we unite science, technology and talent to get ahead of disease togetherThe Onyx Research Data Tech organization is GSK’s Research data ecosystem which has the capability to bring together, analyze, and power the exploration of data at scale. We partner with scientists across GSK to define and understand their challenges and develop tailored solutions that meet their needs. The goal is to ensure scientists have the right data and insights when they need it to give them a better starting point for and accelerate medical discovery. Ultimately, this helps us get ahead of disease in more predictive and powerful ways.
Onyx is a full-stack shop consisting of product and portfolio leadership, data engineering, infrastructure and DevOps, data / metadata / knowledge platforms, and AI/ML and analysis platforms, all geared toward:
- Building a next-generation, metadata- and automation-driven data experience for GSK’s scientists, engineers, and decision-makers, increasing productivity and reducing time spent on “data mechanics”
- Providing best-in-class AI/ML and data analysis environments to accelerate our predictive capabilities and attract top-tier talent
- Aggressively engineering our data at scale, as one unified asset, to unlock the value of our unique collection of data and predictions in real-time
Data Engineering is responsible for the design, delivery, support, and maintenance of industrialized automated end to end data services and pipelines. They apply standardized data models and mapping to ensure data is accessible for end users in end-to-end user tools through use of APIs. They define and embed best practices and ensure compliance with Quality Management practices and alignment to automated data governance. They also acquire and process internal and external, structure and unstructured data in line with Product requirements.
The Associate Data Engineer is a technical contributor who can take a well-defined specification for a function, pipeline, service, or other sort of component, and a technical approach to building it, and deliver it at a high level with guidance. They are aware of, and adhere to, best practice for software development in general (and data engineering in particular), including code quality, documentation, DevOps practices, and testing. They ensure robustness of our services and serve as an escalation point in the operation of existing services, pipelines, and workflows.
An Associate Data Engineer should have awareness of the most common tools (languages, libraries, etc) in the data space, such as Spark, Kafka, Storm, etc. They should be constantly seeking feedback and guidance to further develop their technical skills and expertise, and should take feedback well from all sources in the name of development.
Key responsibilities- Builds modular code / libraries / services / etc using modern data engineering tools (Python/Spark, Kafka, Storm, …) and orchestration tools (e.g. Google Workflow, Airflow Composer)
- Produces well-engineered software with guidance, including appropriate automated test suites and technical documentation
- Ensure consistent application of platform abstractions to ensure quality and consistency with respect to logging and lineage
- Adhere to QMS framework and CI/CD best practices
- Provide L3 support to existing tools / services / pipelines
- Contribute to knowledge management activities and follow best practices
We are looking for professionals with these required skills to achieve our goals:
- Bachelor’s degree in Data Engineering, Computer Science, Software Engineering or related field.
- Software engineering experience
- Experience with writing data processing pipelines
- Experience with choosing appropriate data structures for scale and access patterns
- Exposure to automated testing techniques
- Exposure to database concepts and SQL and grasp of relational and analytical database management theory and practice
- Knowledge and use of at least one common programming language (e.g., Python, Scala, Java), including toolchains for documentation and testing
- Exposure to modern software development tools / ways of working (e.g. git/GitHub, DevOps tools, …)
- Exposure to common tools for data engineering (e.g. Spark, Kafka, Storm, …)
- Nice to have experience in Big Data and NoSQL
- Nice to have basic experience in cloud environments (AWS, Azure, GCP...)
#GSKOnyx, #LI-GSK and #GSKTech1 #earlycareers
The annual base salary for new hires in this position ranges from $82,025 to $110,975 taking into account a number of factors including work location within the US market, the candidate’s skills, experience, education level and the market rate for the role. In addition, this position offers an annual bonus and eligibility to participate in our share based long term incentive program which is dependent on the level of the role. Available benefits include health care and other insurance benefits (for employee and family), retirement benefits, paid holidays, vacation, and paid caregiver/parental and medical leave.Please visit GSK US Benefits Summary to learn more about the comprehensive benefits program GSK offers US employees.
Why GSK?
Uniting science, technology and talent to get ahead of disease together.
GSK is a global biopharma company with a special purpose – to unite science, technology and talent to get ahead of disease together – so we can positively impact the health of billions of people and deliver stronger, more sustainable shareholder returns – as an organisation where people can thrive. We prevent and treat disease with vaccines, specialty and general medicines. We focus on the science of the immune system and the use of new platform and data technologies, investing in four core therapeutic areas (infectious diseases, HIV, respiratory/ immunology and oncology).
Our success absolutely depends on our people. While getting ahead of disease together is about our ambition for patients and shareholders, it’s also about making GSK a place where people can thrive. We want GSK to be a place where people feel inspired, encouraged and challenged to be the best they can be. A place where they can be themselves – feeling welcome, valued, and included. Where they can keep growing and look after their wellbeing. So, if you share our ambition, join us at this exciting moment in our journey to get Ahead Together.
If you require an accommodation or other assistance to apply for a job at GSK, please contact the GSK Service Centre at 1-877-694-7547 (US Toll Free) or +1 801 567 5155 (outside US).
GSK is an Equal Opportunity Employer and, in the US, we adhere to Affirmative Action principles. This ensures that all qualified applicants will receive equal consideration for employment without regard to race, color, national origin, religion, sex, pregnancy, marital status, sexual orientation, gender identity/expression, age, disability, genetic information, military service, covered/protected veteran status or any other federal, state or local protected class.
Important notice to Employment businesses/ Agencies
GSK does not accept referrals from employment businesses and/or employment agencies in respect of the vacancies posted on this site. All employment businesses/agencies are required to contact GSK's commercial and general procurement/human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business/ agency and GSK. In the absence of such written authorization being obtained any actions undertaken by the employment business/agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses/agencies in respect of the vacancies posted on this site.
Please note that if you are a US Licensed Healthcare Professional or Healthcare Professional as defined by the laws of the state issuing your license, GSK may be required to capture and report expenses GSK incurs, on your behalf, in the event you are afforded an interview for employment. This capture of applicable transfers of value is necessary to ensure GSK’s compliance to all federal and state US Transparency requirements. For more information, please visit GSK’s Transparency Reporting For the Record site.
Tags: Airflow APIs AWS Azure Big Data CI/CD Computer Science Data analysis Data governance DevOps Engineering GCP Git GitHub Java Kafka Machine Learning NoSQL Pipelines Python Research Scala Spark SQL Testing Unstructured data
Perks/benefits: Career development Health care Insurance Medical leave Parental leave Salary bonus
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.