Data Engineer

PBS Headquarters

PBS

Watch full episodes of your favorite PBS shows, explore music and the arts, find in-depth news analysis, and more. Home to Antiques Roadshow, Frontline, NOVA, PBS Newshour, Masterpiece and many others.

View all jobs at PBS

Apply now Apply later

Position Title:

Data Engineer - A360

Department:

Product Development


Corporate Area:

Digital & Marketing

Status:

Fixed Term (Fixed Term), Full time Exempt

Manager Title:

Director, Technology

Position Overview:

PBS is seeking a skilled Data Engineer to join our Data Team. The ideal candidate will design, build, and maintain scalable data pipelines and ensure the availability, reliability, and integrity of our data. The candidate will join our team of talented data engineers, data scientists, and work alongside our product team and other stakeholders to improve the quality of PBS’s data and support data-driven decision-making and analysis across the public media ecosystem.

Key responsibilities will include, but are not limited to:

  • Work as part of the Data Engineering Team to design, code and deploy cloud data solutions that extract, transform and load data into our data architecture.

  • Serve as primary point of contact for client-facing data requests from internal and external partners. Provide those partners with custom one-time or repeated exports from the data lake, assist them with navigating any technical challenges and gather technical requirements when they seek to use or integrate with the data lake.

  • Lead the building of dashboards and other tools that facilitate the monitoring of the volume, velocity and veracity of data in the Enterprise Data Lakehouse.

  • Build and maintain ingestion and transformation of digital analytics data into data lake. Serve as the expert for this key data source and ensure it aligns with all Data Governance policies.

  • Evaluate and test new data-processing technologies.

  • Maintain and update existing data pipelines, data marts and other key features of data architecture.

  • Participate in stand-ups and software development syncs to align and collaborate with our Data Engineering Team.

Requirements for success:

  • 4+ years of experience building data products using cloud data tools.

  • Proficient in Python, with a deep understanding of data interface libraries. 

  • Proficiency in SQL (DML, DDL) with experience with RDBMS; preferably PostGres.

  • A deep understanding of data object modeling and database design. (normalized forms, indexing, query optimization).

  • Experience with processing basic data file formats: csv, json

  • Experience with development tools such as Github, Jira.

Preferred Skills:

  • Familiarity with Big Data tools such as Spark (using PySpark).

  • Familiarity with AWS Data tools such as: S3, Lambdas, EMR, Glue, Athena, Managed Airflow, ECS, DMS, Datasync

  • Familiarity with big data file formats: Parquet/Iceberg

  • Familiarity with Python Data Science libraries, such as: pandas, numpy

  • Python visualization libraries such as Streamlit, matplotlib

  • Snowflake / Metabase

  • DBT / DBT Cloud.

  • Google cloud tools/products: BigQuery, Cloud functions, Cloud storage, Google Analytics 4

PBS is an Equal Opportunity Employer in accordance with the EEOC and the Commonwealth of Virginia.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0
Category: Engineering Jobs

Tags: Airflow Architecture Athena AWS AWS DataSync Big Data BigQuery CSV Data governance Data pipelines dbt DDL ECS Engineering GCP GitHub Google Cloud Jira JSON Matplotlib Metabase NumPy Pandas Parquet Pipelines PostgreSQL PySpark Python RDBMS Snowflake Spark SQL Streamlit

More jobs like this