Data Engineer Programmer – NCCT CTMC
Winston-Salem, NC, United States
Atrium Health
Atrium Health provides healthcare, hope and healing at more than 1,400 care locations and 40 hospitals across NC, SC, GA and AL.JOB SUMMARY
The National Center for Clinical Trials (NCCT) is designed to serve as an innovative platform to revolutionize and catalyze the conduct of clinical trials−greatly accelerating the translation of scientific findings into improvements in the prevention, diagnosis, and treatment of disease for our communities and patients. The NCCT will offer core services for patient recruitment and enrollment, trial administration and follow-up, and to gather real world data and evidence.
The Clinical Trial Methods Center (CTMC) has been established within the Wake Forest University School of Medicine to provide the necessary tools and expertise that the NCCT will access and apply to deliver many of its core services.
The Data Engineer Programmer will be part of a team that provides informatics expertise including integrating and normalizing data from disparate sources, creating data ELT pipelines, and API development; and will report to the Informatics Lead Programmer who oversees the team of application and data programmers in the CTMC. This position will also work with Investigators, Evaluation staff, Industry Sponsors, and NCCT Leadership to identify and evaluate new data sources, automate data ingestion, and create data management processes and tools. Data infrastructure enables informational insight in support of clinical trial startup, from site and study feasibility, through patient recruitment and data collection, to follow-up outcomes and process assessment.
ESSENTIAL FUNCTIONS
- Attend project and departmental meetings and contribute to the project design concerning data management needs
- Collaborate with faculty, team leads, and stakeholders to anticipate, define, and satisfy data needs
- Builds custom ingestion pipelines to incorporate data from novel sources
- Ensures that there is visibility on the status of automated data tasks to catch mistakes before they become problems
- Collaborate with other members of the data team to improve performance and stability of transformation tasks
- Participate in design conversations for improving the architecture of our data infrastructure
- Supports team in identifying and implementing data integration and quality control strategies to improve data quality and availability.
- Prepares research data for ingestion and conversion to a unified data standard using ETL and automation tools.
- Assists the team by maintaining the database environment by creating views/queries, documentation, and data pipelines.
- Owns data integrity, availability, documentation, and efficient access to data.
- Identifies opportunities for process improvement in the end-to-end data development and delivery lifecycle.
- Incorporates automation wherever possible to improve access to data and analyses.
- Performs other related duties as needed
EDUCATION/EXPERIENCE
Bachelor's Degree in an applicable field with 3 years of experience working on full-cycle data analytics and visualization projects; or an equivalent combination of education and experience.
SKILLS/QUALIFICATIONS
- Strong initiative and proven ability to work independently
- Requires moderate skill set and proficiency in discipline. Conducts work assignments of increasing complexity, under moderate supervision with some latitude for independent judgment.
- Experience with data replication tools or services such as Meltano, Airbyte, Fivetran, or Stitch
- Experience with orchestration tools such as Airflow, Luigi, Prefect, or Dagster
- Experience using scalable and distributed compute, storage, and networking resources such as those provided by Azure, especially in the context of the Snowflake data stack
- Experience with code versioning systems such as Git
- Knowledge of common file formats for analytic data workloads like Parquet, ORC, or Avro
- Knowledge of high-performance table formats such as Apache Iceberg or Delta Lake
- Additional consideration given for experience with tools, languages, data processing frameworks, and databases such as R, Python, SQL, MongoDB, Redis, Hadoop, Spark, Hive, Scala, BigTable, Cassandra, Presto, Strom.
- Experience with healthcare and/or biomedical research operations and systems a plus
- Ability to communicate on a professional level with customers and staff
- Superior problem-solving skills
- Familiarity with the clinical trial lifecycle a plus
Position is located in Winston-Salem, NC - May be eligible for remote employment
Wake Forest University School of Medicine (WFUSM) is a U.S. News and World Report top 50 ranked medical school, integrated with a world-class health system, Atrium Health. WFUSM, the academic core of Atrium Health Enterprise, is a recognized leader in experiential medical education and groundbreaking research that includes Wake Forest Innovations, a commercialization enterprise focused on advancing health care through new medical technologies and biomedical discovery. WFUSM, has over $300M in annual, extramural funding that drives a cutting-edge Academic Learning Health System by integrating innovative research with excellent patient care across our enterprise.
Atrium Health Wake Forest Baptist is based in Winston-Salem, North Carolina and is part of Advocate Health, which is headquartered in Charlotte, North Carolina, and is the fifth-largest nonprofit health system in the United States, created from the combination of Atrium Health and Advocate Aurora Health. AHWFB is an 885-bed tertiary-care hospital in Winston-Salem – that includes Brenner Children’s Hospital, five community hospitals, more than 300 primary and specialty care locations and more than 2,700 physicians. Our highly integrated academic and clinical environment is deeply committed to improving health, elevating hope, and advancing healing – for all.
It should be noted that while you are applying on the Wake Forest University School of Medicine Career Site, you will receive communications from the Atrium Health Recruitment Team. Please know that this is an expected process. Thanks in advance for your flexibility.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow API Development APIs Architecture Avro Azure Bigtable Cassandra Dagster Data Analytics Data management Data pipelines Data quality ELT ETL FiveTran Git Hadoop MongoDB Nonprofit Parquet Pipelines Python R Research Scala Snowflake Spark SQL
Perks/benefits: Career development Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.