Data Engineer
Ohio
Full Time Mid-level / Intermediate USD 103K - 143K
CDC Foundation
The CDC Foundation is a global nonprofit, managing public health programs that impact chronic and infectious diseases and emergency threats like COVID-19.
The CDC Foundation helps the Centers for Disease Control and Prevention (CDC) save and improve lives by unleashing the power of collaboration between CDC, philanthropies, corporations, organizations and individuals to protect the health, safety and security of America and the world. The CDC Foundation is the go-to nonprofit authorized by Congress to mobilize philanthropic partners and private-sector resources to support CDC’s critical health protection mission. Since 1995, the CDC Foundation has raised over $1.9 billion and launched more than 1,300 programs impacting a variety of health threats from chronic disease conditions including cardiovascular disease and cancer, to infectious diseases like rotavirus and HIV, to emergency responses, including COVID-19 and Ebola. The CDC Foundation managed hundreds of programs in the United States and in more than 90 countries last year. Visit www.cdcfoundation.org for more information.
Job HighlightsLocation: Remote, must be based in the United StatesSalary Range: $103,500-$143,500 per year, plus benefits. Individual salary offers will be based on experience and qualifications unique to each candidate. Position Type: Grant funded, limited-term opportunityPosition End Date: June 30, 2025
OverviewThe Data Engineer will play a crucial role in advancing the CDC Foundation's mission by designing, building, and maintaining data infrastructure for a public health organization. This role is aligned to the Workforce Acceleration Initiative (WAI). WAI is a federally funded CDC Foundation program with the goal of helping the nation’s public health agencies by providing them with the technology and data experts they need to accelerate their information system improvements.
Working within Cleveland Department of Public Health (CDPH). The Data Engineer is responsible for enabling data integration and data preparation pipelines for downstream analytics on behalf of the Office of Epidemiology and Population Health. This role requires business intuition and ability to use a variety of technical and soft skills necessary to collaborate across departments. The Data Engineer will be hired by the CDC Foundation and assigned to the Epidemiologist responsible for informatics in the Office of Epidemiology and Population Health (OEPH). The Data Engineer will additionally cooperate with the Office of Urban Analytics & Innovation (Urban AI) at the City of Cleveland for alignment on data infrastructure requirements and best practices for the enterprise. This position is eligible for a fully remote work arrangement for U.S. based candidates.
All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of race, color, religion, sex, national origin, age, mental or physical disabilities, veteran status, and all other characteristics protected by law.
We comply with all applicable laws including E.O. 11246 and the Vietnam Era Readjustment Assistance Act of 1974 governing employment practices and do not discriminate on the basis of any unlawful criteria in accordance with 41 C.F.R. §§ 60-300.5(a)(12) and 60-741.5(a)(7). As a federal government contractor, we take affirmative action on behalf of protected veterans.
The CDC Foundation is a smoke-free environment. Relocation expenses are not included.
Job HighlightsLocation: Remote, must be based in the United StatesSalary Range: $103,500-$143,500 per year, plus benefits. Individual salary offers will be based on experience and qualifications unique to each candidate. Position Type: Grant funded, limited-term opportunityPosition End Date: June 30, 2025
OverviewThe Data Engineer will play a crucial role in advancing the CDC Foundation's mission by designing, building, and maintaining data infrastructure for a public health organization. This role is aligned to the Workforce Acceleration Initiative (WAI). WAI is a federally funded CDC Foundation program with the goal of helping the nation’s public health agencies by providing them with the technology and data experts they need to accelerate their information system improvements.
Working within Cleveland Department of Public Health (CDPH). The Data Engineer is responsible for enabling data integration and data preparation pipelines for downstream analytics on behalf of the Office of Epidemiology and Population Health. This role requires business intuition and ability to use a variety of technical and soft skills necessary to collaborate across departments. The Data Engineer will be hired by the CDC Foundation and assigned to the Epidemiologist responsible for informatics in the Office of Epidemiology and Population Health (OEPH). The Data Engineer will additionally cooperate with the Office of Urban Analytics & Innovation (Urban AI) at the City of Cleveland for alignment on data infrastructure requirements and best practices for the enterprise. This position is eligible for a fully remote work arrangement for U.S. based candidates.
Responsibilities
- Utilize software engineering methods and tools on a common data analytic platform to integrate, process and prepare multiple sources of data for downstream public health surveillance analyses.
- Collaborate with the Data Analyst and Epidemiologists to understand data requirements, develop and maintain data pipelines automating data transformation tasks.
- Perform data linkages between public health surveillance data and geospatial data assets.
- Document data transformation processes and maintain comprehensive records for reproducibility.
- Test data and/or applications to validate data accuracy/quality
- Track projects from conceptualization to completion, including helping to create project roadmaps, project plans and requirements documentation
- Create and manage the systems and pipelines that enable efficient and reliable flow of data, including ingestion, processing, and storage.
- Collect data from various sources, transforming and cleaning it to ensure accuracy and consistency. Load data into storage systems or data warehouses.
- Optimize data pipelines, infrastructure, and workflows for performance and scalability.
- Monitor data pipelines and systems for performance issues, errors, and anomalies, and implement solutions to address them.
- Implement security measures to protect sensitive information.
- Collaborate with data scientists, analysts, and other partners to understand their data needs and requirements, and to ensure that the data infrastructure supports the organization's goals and objectives.
- Collaborate with cross-functional teams to understand data requirements and design scalable solutions that meet business needs.
- Implement and maintain ETL processes to ensure the accuracy, completeness, and consistency of data.
- Design and manage data storage systems, including relational databases, NoSQL databases, and data warehouses.
- Knowledgeable about industry trends, best practices, and emerging technologies in data engineering, and incorporating the trends into the organization's data infrastructure.
- Provide technical guidance to other staff.
- Communicate effectively with partners at all levels of the organization to gather requirements, provide updates, and present findings.
Qualifications
- Bachelor's degree in computer science or information systems, or equivalent experience
- Demonstrated ability in complex data management and data preparation, including but not limited to data storage, data standardization, and data operations, for data warehousing efforts
- Experience working with data integration frameworks
- Experience working with cloud services & infrastructure (Microsoft Azure Databricks preferred)
- Experience in designing, writing, and delivering code in a team environment, using source code control, unit testing, and other software engineering principles (e.g., Java, Python, R)
- Ability to thrive in a project-based, team environment
- Preferred Skills
- Spatial data experience, e.g. geopandas or ArcGIS
All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of race, color, religion, sex, national origin, age, mental or physical disabilities, veteran status, and all other characteristics protected by law.
We comply with all applicable laws including E.O. 11246 and the Vietnam Era Readjustment Assistance Act of 1974 governing employment practices and do not discriminate on the basis of any unlawful criteria in accordance with 41 C.F.R. §§ 60-300.5(a)(12) and 60-741.5(a)(7). As a federal government contractor, we take affirmative action on behalf of protected veterans.
The CDC Foundation is a smoke-free environment. Relocation expenses are not included.
Job stats:
1
0
0
Category:
Engineering Jobs
Tags: Azure Computer Science Databricks Data management DataOps Data pipelines Data Warehousing Engineering ETL Java Nonprofit NoSQL Pipelines Python R RDBMS Security Testing
Perks/benefits: Health care Relocation support
Regions:
Remote/Anywhere
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Principal Data Scientist jobsBI Developer jobsStaff Data Scientist jobsPrincipal Data Engineer jobsData Scientist II jobsData Manager jobsJunior Data Analyst jobsData Science Manager jobsResearch Scientist jobsBusiness Data Analyst jobsLead Data Analyst jobsSenior AI Engineer jobsSr. Data Scientist jobsData Engineer III jobsData Science Intern jobsData Specialist jobsJunior Data Engineer jobsSenior Data Scientist, Performance Marketing jobsBI Analyst jobsSoftware Engineer, Machine Learning jobsSr Data Engineer jobsData Analyst Intern jobsData Analyst II jobsSenior Artificial Intelligence/Machine Learning Engineer - Remote, Latin America jobsJunior Data Scientist jobs
Snowflake jobsEconomics jobsLinux jobsHadoop jobsOpen Source jobsJavaScript jobsPhysics jobsComputer Vision jobsAirflow jobsKafka jobsMLOps jobsRDBMS jobsBanking jobsData Warehousing jobsNoSQL jobsScala jobsGoogle Cloud jobsData warehouse jobsKPIs jobsR&D jobsPostgreSQL jobsOracle jobsGitHub jobsSAS jobsCX jobs
Classification jobsStreaming jobsTerraform jobsScikit-learn jobsLooker jobsScrum jobsDistributed Systems jobsPandas jobsData Mining jobsBigQuery jobsPySpark jobsRobotics jobsJenkins jobsJira jobsIndustrial jobsRedshift jobsdbt jobsReact jobsUnstructured data jobsMicroservices jobsMySQL jobsData strategy jobsE-commerce jobsGPU jobsNumPy jobs