Comcast Data Engineer Intern
CO - Englewood, 183 Inverness Dr West
Comcast
Comcast NBCUniversal creates incredible technology and entertainment that connects millions of people to the moments and experiences that matter most.Job Summary
This job code is to be used for internships and co-ops. It is not to be used for temporary or contract workers.Job Description
Program Overview
Discover opportunities designed to set your career in motion! The Comcast internship/co-op program will help you cultivate meaningful relationships, develop strong interpersonal and business skills, gain exposure to the day-to-day operations of a Fortune 40 media and technology company, and receive mentorship opportunities to expand your professional network.
This program immerses students into the daily operation of a contemporary media and technology company while working side-by-side with Comcast’s top innovators. The student becomes an integral part of the Comcast team working on creative, innovative, and thought-provoking projects within various business units.
Organization & Team Overview
The dx Team in Comcast TPX organization is building core components needed to drive the next generation of data platforms and data processing capability at Comcast. Building data products, identifying trouble spots, and optimizing the overall user experience is a challenge that can only be met with a robust architecture capable of providing insights that would otherwise be drowned in an ocean of data. To provide access to data in Comcast’s platforms, dx team has built an enterprise Query Fabric based on open-source and commercial Trino technologies. This Data Engineer intern role will include a mixture of DevOps responsibilities, contributing to the software development of the Query Fabric Platform, new software features and defect resolution, day-to-day platform operations and support, and other platforms/technologies (e.g. caching and web services) as needed. You will have the opportunity to make contributions to the Trino codebase as well as other open-source projects, and you will be expected to support hundreds (and eventually thousands) of internal customers in their use of this platform.
Success in this role is best enabled by a broad mix of skills and interests ranging from traditional software engineering prowess to the multidisciplinary field of data science and data engineering.
Role Description
Comcast’s dx org is seeking a highly motivated and ambitious Data Engineer Intern to join our dynamic Query Fabric team. dx Open Egress Team has responsibility for the harmonization of the data egress and consumption layer across Comcast. We support accessing enterprise data sources via a consolidated set of entry points to help lower the barrier to entry to data access and use.
The Data Engineer intern will develop (code/program), test, debug SQL queries and data programs supporting both internal and external technically challenging business requirements (complex transformations, high data volume), as well as provide operational support for the underlying services and infrastructure of our big data platforms.
Develop solutions capable of processing millions of events per second and multi-billions of events per day, providing both a real time and historical view into the operation of Comcast’s wide array of systems. Design collection and enrichment system components for quality, timeliness, scale and reliability. Work on high-performance real-time data stores and a massive historical data store using best-of-breed and industry-leading technology. Build platforms that allow others to design, develop, and apply advanced statistical methods and Machine Intelligence algorithms, fostering self-service capabilities and ease of use across the entire Technology, Product, Xperience (TPX) organization landscape and beyond!
Job Responsibilities
Data Engineer intern will have the opportunity to work on a variety of projects across dx platform engineering team that have significant operational impact on the business within 1-3 years. Specific responsibilities will include but are not limited to:
Participate in the development and/or deployment of components and infrastructure supporting big data platforms
Support platform capabilities that analyze massive amounts of data both in real-time and batch processing
Facilitate the deployment of prototype ideas for new tools, products and services and the environments that support them
Employ rigorous continuous delivery practices managed under an agile software development approach
Ensure a quality transition to production and solid production operation of the platforms
Enhance our DevOps practices to deploy and operate our systems
Automate and streamline our operations and processes
Build and maintain tools for deployment, monitoring and operations
Troubleshoot and resolve issues in our development, test and production environments
Analyzes and determines data integration needs
Consults with and supports customer integration needs leveraging our platforms
Evaluates and plans software designs, test results and technical manuals using Big Data ecosystem
Reviews literature, current practices relevant to the solution of assigned projects in the Data Egress/Access domain
Experience with DevOps tools (GitHub, Jira) and methodologies (Agile, Scrum, Kanban, Test Driven Development)
Experience with a variety of relational and NoSQL database, Teradata, and other large Data Warehouse environment access capabilities
Exposure to data integration and storage in AWS - S3, Lambda, Glue Crawlers, Data Pipelines
Exposure to Hadoop ecosystem tools like Spark, YARN, HDFS, Hive, Sqoop, understanding of systems performance data (collection, monitoring, analysis)
Exposure to CI/CD, containerization and test-driven development (TDD)
Exposure to data loads in Databricks
Programs new software using Java or Python and Shell Scripts
Deep knowledge of SQL and data sourcing technologies such as Informatica.
Monitor job performances, file system/disk-space management, cluster and database connectivity, log files, management of backup/security and troubleshooting various user issues
Edits and reviews technical requirements documentation
Displays knowledge of software engineering methodologies, concepts, skills and their application in the area of specified engineering specialty (like Data Egress)
Displays knowledge of, and ability to apply, process software design and redesign skills
Displays in-depth knowledge of, and ability to apply, project management skills
Works independently, assumes responsibility for job development and training, researches and resolves questions and problems, requests supervisor input and keeps supervisor informed required
Other duties and responsibilities as assigned
All work needs to be documented
Preferred Skills
Excellent organizational, interpersonal and communication skills
Self-starter with ability to drive analytics independently and manage multiple projects / deadlines
Familiarity with MPP Databases (massively parallel processing) is a requirement
RDMS: Teradata or Oracle or MS SQL or MySQL
Language/Scripts: Java 8+ or Python, Shell Scripts, SQL
Scala, Python, R, UC4, Ranger, Kafka, AWS, AWS Lambda, BTEQ, Spark, Sqoop, gRPC, Apache Thrift, HTTP2.0
Web: HTTP, REST
Security: OAuth 2 with OpenId, RPC
AWS: S3, Glue
Experience with hybrid Linux infrastructure footprints – cloud hosts (e.g. AWS EC2 instances), Virtual hosts (e.g. VMWare), and local data center hosts
Preferred Majors: Computer Science, Information Technology, Data Engineering, Data Sciences
Minimum Qualifications and Eligibility Requirements
Currently pursuing bachelor’s in computer science engineering or information technology
Rising Junior or Rising Senior only (must have a graduation date between Winter 2025- Spring 2027)
Returning to degree-program (for at least a semester) after the completion of the summer internship (meaning, student must be returning to school for Fall 2025 semester before graduating)
Available to work 40 hours per week over the course of the summer program- June 2 through August 15, 2025
Authorized to work in the United States with no current or future sponsorship needs
Available to report in-person to the work location on the job posting (unless virtual offering)
Comcast is an Affirmative Action/EEO employer M/F/D/V.
Compensation
Base Pay: $32.00Base pay is one part of the Total Rewards that Comcast provides to compensate and recognize employees for their work. Most sales positions are eligible for a Commission under the terms of an applicable plan, while most non-sales positions are eligible for a Bonus. Additionally, Comcast provides best-in-class Benefits to eligible employees. We believe that benefits should connect you to the support you need when it matters most, and should help you care for those who matter most. That’s why we provide an array of options, expert guidance and always-on tools, that are personalized to meet the needs of your reality – to help support you physically, financially and emotionally through the big milestones and in your everyday life. Please visit the compensation and benefits summary on our careers site for more details.
The application window is 30 days from the date job is posted, unless the number of applicants requires it to close sooner or later.
Education
Certifications (if applicable)
Relative Work Experience
0-2 YearsComcast is proud to be an equal opportunity workplace. We will consider all qualified applicants for employment without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, disability, veteran status, genetic information, or any other basis protected by applicable law.Tags: Agile Architecture AWS Big Data CI/CD Computer Science Databricks Data pipelines Data warehouse DevOps EC2 Engineering GitHub Hadoop HDFS Informatica Java Jira Kafka Kanban Lambda Linux Machine intelligence MPP MS SQL MySQL NoSQL Open Source Oracle Pipelines Python R Scala Scrum Security Spark SQL Statistics TDD Teradata
Perks/benefits: Career development Team events
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.