Data Engineer
Maine
Full Time Mid-level / Intermediate USD 103K - 143K
CDC Foundation
The CDC Foundation is a global nonprofit, managing public health programs that impact chronic and infectious diseases and emergency threats like COVID-19.Job Highlights
Position Title: Data Engineer Department: Programs and Innovation Office – Workforce Acceleration InitiativeLocation: Remote, must be based in the United StatesSalary Range: $103,500-$143,500 per year, plus benefits. Individual salary offers will be based on experience and qualifications unique to each candidate.Work Schedule: 8:00 am – 5:00 pm EST, Monday to FridayPosition Type: Grant funded, limited-term opportunityPosition End Date: June 30, 2025
Overview:
The Data Engineer will play a crucial role in advancing the CDC Foundation's mission by designing, building, and maintaining data infrastructure for a public health organization. This role is aligned to the Workforce Acceleration Initiative (WAI). WAI is a federally funded CDC Foundation program with the goal of helping the nation’s public health agencies by providing them with the technology and data experts they need to accelerate their information system improvements.
Working within the Maine CDC’s Informatics team, as part of the Division of Disease Surveillance, the Data Engineer will deliver the architecture needed for data generation, storage, processing, and analysis. The Data Engineer will collaborate with data content experts, analysts, data scientists, data modelers, warehouse architects, IT staff and other organization staff to design and implement proposed solutions and architectures that meet the needs of the public health agency. They will develop plans for data governance, policies and procedures, and documentation to set up a data lake/warehouse for the Infectious Disease Epidemiology Surveillance Systems, with the goal of expanding its capabilities to include additional areas within DHHS. They will also develop use cases to encourage other programs to add data to the data lake/warehouse.
The Data Engineer will be hired by the CDC Foundation and assigned to the Maine CDC Informatics team, as part of the Division of Disease Surveillance. This position is eligible for a fully remote work arrangement for U.S. based candidates.
Responsibilities
· Collaborate with data scientists, analysts, and other partners to understand their data needs and requirements, and to ensure that the data infrastructure supports the organization's goals and objectives.· Collaborate with cross-functional teams to understand data requirements and design scalable solutions that meet business needs.· Implement and maintain ETL processes to ensure the accuracy, completeness, and consistency of data.· Implement security measures to protect sensitive information.· Design and manage data storage systems, including relational databases, NoSQL databases, and data warehouses.· Knowledgeable about industry trends, best practices, and emerging technologies in data engineering, and incorporating the trends into the organization's data infrastructure.· Create and manage the systems and pipelines that enable efficient and reliable flow of data, including ingestion, processing, and storage.· Collect data from various sources, transforming and cleaning it to ensure accuracy and consistency. Load data into storage systems or data warehouses.· Optimize data pipelines, infrastructure, and workflows for performance and scalability.· Monitor data pipelines and systems for performance issues, errors, and anomalies, and implement solutions to address them.· Provide technical guidance to other staff.· Communicate effectively with partners at all levels of the organization to gather requirements, provide updates, and present findings.
Required Qualifications:
· Bachelor's degree in Computer Science, Information Technology, Data Science, or a related field is preferred.· Minimum of 5 years of relevant experience in data engineering.· Proficiency in programming languages commonly used in data engineering, such as Python, Java, Scala, or SQL. Candidate should be able to implement data automations within existing frameworks as opposed to writing one-off scripts.· Experience with large-scale projects using Amazon Web Services is required. Certification is preferred.· Strong technical writing skills for creating documentation, policies, and procedures.· Experience with project planning, including developing timelines, setting milestones, and managing resources.· Knowledge of data warehousing concepts and tools.· Experience with cloud computing platforms.· Experience with data security and data governance.· Experience regarding engineering best practices such as source control, automated testing, continuous integration and deployment, and peer review.· Expertise in data modeling, ETL (Extract, Transform, Load) processes, and data integration techniques.· Strong analytical thinking and problem-solving abilities.· Excellent verbal and written communication skills, including the ability to convey technical concepts to non-technical partners effectively.· Flexibility to adapt to evolving project requirements and priorities.· Outstanding interpersonal and teamwork skills; and the ability to develop productive working relationships with colleagues and partners.· Experience working in a virtual environment with remote partners and teams.· Proficiency in Microsoft Office.
Preferred Qualifications:
· Experience with big data technologies and frameworks like Hadoop, Spark, Kafka, and Flink.· Strong understanding of database systems, including relational databases (e.g., MySQL, PostgreSQL) and NoSQL databases (e.g., MongoDB, Cassandra).· Familiarity with agile development methodologies, software design patterns, and best practices.· Previous experience working with or within government agencies is preferred.
Special Notes:
This role is involved in a dynamic public health program. As such, roles and responsibilities are subject to change as situations evolve. Roles and responsibilities listed above may be expanded upon or updated to match priorities and needs, once written approval is received by the CDC Foundation in order to best support the public health programming.All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of race, color, religion, sex, national origin, age, mental or physical disabilities, veteran status, and all other characteristics protected by law.
We comply with all applicable laws including E.O. 11246 and the Vietnam Era Readjustment Assistance Act of 1974 governing employment practices and do not discriminate on the basis of any unlawful criteria in accordance with 41 C.F.R. §§ 60-300.5(a)(12) and 60-741.5(a)(7). As a federal government contractor, we take affirmative action on behalf of protected veterans.
The CDC Foundation is a smoke-free environment. Relocation expenses are not included.
Tags: Agile Architecture AWS Big Data Cassandra Computer Science Data governance Data pipelines Data Warehousing Engineering ETL Flink Hadoop Java Kafka MongoDB MySQL Nonprofit NoSQL Pipelines PostgreSQL Python R RDBMS Scala Security Spark SQL Testing
Perks/benefits: Health care Relocation support
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.