Data Engineer
PBS Headquarters
PBS
Watch full episodes of your favorite PBS shows, explore music and the arts, find in-depth news analysis, and more. Home to Antiques Roadshow, Frontline, NOVA, PBS Newshour, Masterpiece and many others.Position Title:
Data Engineer - A360Department:
Product Development
Corporate Area:
Status:
Fixed Term (Fixed Term), Full time ExemptManager Title:
Director, TechnologyPosition Overview:
PBS is seeking a skilled Data Engineer to join our Data Team. The ideal candidate will design, build, and maintain scalable data pipelines and ensure the availability, reliability, and integrity of our data. The candidate will join our team of talented data engineers, data scientists, and work alongside our product team and other stakeholders to improve the quality of PBS’s data and support data-driven decision-making and analysis across the public media ecosystem.Key responsibilities will include, but are not limited to:
Work as part of the Data Engineering Team to design, code and deploy cloud data solutions that extract, transform and load data into our data architecture.
Serve as primary point of contact for client-facing data requests from internal and external partners. Provide those partners with custom one-time or repeated exports from the data lake, assist them with navigating any technical challenges and gather technical requirements when they seek to use or integrate with the data lake.
Lead the building of dashboards and other tools that facilitate the monitoring of the volume, velocity and veracity of data in the Enterprise Data Lakehouse.
Build and maintain ingestion and transformation of digital analytics data into data lake. Serve as the expert for this key data source and ensure it aligns with all Data Governance policies.
Evaluate and test new data-processing technologies.
Maintain and update existing data pipelines, data marts and other key features of data architecture.
Participate in stand-ups and software development syncs to align and collaborate with our Data Engineering Team.
Requirements for success:
4+ years of experience building data products using cloud data tools.
Proficient in Python, with a deep understanding of data interface libraries.
Proficiency in SQL (DML, DDL) with experience with RDBMS; preferably PostGres.
A deep understanding of data object modeling and database design. (normalized forms, indexing, query optimization).
Experience with processing basic data file formats: csv, json
Experience with development tools such as Github, Jira.
Preferred Skills:
Familiarity with Big Data tools such as Spark (using PySpark).
Familiarity with AWS Data tools such as: S3, Lambdas, EMR, Glue, Athena, Managed Airflow, ECS, DMS, Datasync
Familiarity with big data file formats: Parquet/Iceberg
Familiarity with Python Data Science libraries, such as: pandas, numpy
Python visualization libraries such as Streamlit, matplotlib
Snowflake / Metabase
DBT / DBT Cloud.
Google cloud tools/products: BigQuery, Cloud functions, Cloud storage, Google Analytics 4
PBS is an Equal Opportunity Employer in accordance with the EEOC and the Commonwealth of Virginia.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Architecture Athena AWS AWS DataSync Big Data BigQuery CSV Data governance Data pipelines dbt DDL ECS Engineering GCP GitHub Google Cloud Jira JSON Matplotlib Metabase NumPy Pandas Parquet Pipelines PostgreSQL PySpark Python RDBMS Snowflake Spark SQL Streamlit
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.