Sr. Data Engineer, Data Quality
Bengaluru, India
Roku
This is the jump-off point to learn about working at Roku, including our employees, the Roku culture, our office locations around the world, and student internship opportunities. Roku, jobs, careers, internship, streaming, TV, engineeringTeamwork makes the stream work.
Roku is changing how the world watches TV
Roku is the #1 TV streaming platform in the US and Mexico, and we've set our sights on powering every television in the world. Roku pioneered streaming to the TV. Our mission is to be the TV streaming platform that connects the entire TV ecosystem. We connect consumers to the content they love, enable content publishers to build and monetize large audiences, and provide advertisers unique capabilities to engage consumers.
From your first day at Roku, you'll make a valuable - and valued - contribution. We're a fast-growing public company where no one is a bystander. We offer you the opportunity to delight millions of TV streamers around the world while gaining meaningful experience across a variety of disciplines.
About the team
The mission of our team is to develop a world-class big data platform that will provide value to us, our partners, and our customers by leveraging data. We aim to democratize data, provide self-service reporting and analytics tools, and fuel existing and new business/engineering initiatives that are critical to business success. You will join the Data Engineering Team as a Senior Data Engineer. You will work on building tools, frameworks, and processes to ensure high-quality data.
About the role
As a world-class big data platform, Data Governance must be implemented for the multitude of datasets within Roku. There is a wide variety of data sources with a wide range of schemas available. As a result, it becomes increasingly challenging to develop and maintain automated tests and data validations on a large scale. The data-driven nature of Roku, however, requires prompt detection and resolution of data quality issues and utmost confidence in the data.
What you’ll be doing
- Building highly scalable, available, fault-tolerant distributed data processing systems (batch and streaming systems) processing over 10s of terabytes of data ingested every day and a petabyte-sized data warehouse
- Building quality data solutions and refine existing diverse datasets into simplified models encouraging self-service
- Building data pipelines that perfect data quality and are resilient to inadequate data sources
- Partner with upstream teams to build test automation for data logging
- Define scalable schema design and custom validators for both offline and online data validations
- Building an online validation framework to minimize “bad” data flowing downstream
- Support downstream teams with data quality discovery and resolution
- Build data discovery frameworks that help with an accurate lineage of data elements and metrics
- Low level systems debugging, performance measurement & optimization in large production clusters
- Maintain and support existing platforms and evolve to modern technology stacks and architectures
We’re excited if you have
- Extensive SQL skills
- Ability in at least one scripting language, Python, is needed
- Ability in at least one object-oriented language, Java is preferred
- Experience with big data technologies like HDFS, YARN, Map-Reduce, Hive, Kafka, Spark, Airflow
- Experience with AWS (Amazon Web Services), GCP (Google Cloud Platform) (Google Cloud Platform), Looker is a plus
- Experience with Data Governance tools like Alation, DataHub etc. is a plus
- Experience with data access controls, data privacy implementation and data security standards is a plus
- Experience with streaming tech stack, schema registry, data contracts, validation frameworks & tools
- 5+ years professional experience as a data or software engineer
- BS in Computer Science; MS in Computer Science preferred
Benefits
Roku is committed to offering a diverse range of benefits as part of our compensation package to support our employees and their families. Our comprehensive benefits include global access to mental health and financial wellness support and resources. Local benefits include statutory and voluntary benefits which may include healthcare (medical, dental, and vision), life, accident, disability, commuter, and retirement options (401(k)/pension). Our employees can take time off work for vacation and other personal reasons to balance their evolving work and life needs. It's important to note that not every benefit is available in all locations or for every role. For details specific to your location, please consult with your recruiter.
The Roku Culture
Roku is a great place for people who want to work in a fast-paced environment where everyone is focused on the company's success rather than their own. We try to surround ourselves with people who are great at their jobs, who are easy to work with, and who keep their egos in check. We appreciate a sense of humor. We believe a fewer number of very talented folks can do more for less cost than a larger number of less talented teams. We're independent thinkers with big ideas who act boldly, move fast and accomplish extraordinary things through collaboration and trust. In short, at Roku you'll be part of a company that's changing how the world watches TV.
We have a unique culture that we are proud of. We think of ourselves primarily as problem-solvers, which itself is a two-part idea. We come up with the solution, but the solution isn't real until it is built and delivered to the customer. That penchant for action gives us a pragmatic approach to innovation, one that has served us well since 2002.
To learn more about Roku, our global footprint, and how we've grown, visit https://www.weareroku.com/factsheet.
By providing your information, you acknowledge that you have read our Applicant Privacy Notice and authorize Roku to process your data subject to those terms.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Architecture AWS Big Data Computer Science Data governance Data pipelines Data quality Data warehouse Engineering GCP Google Cloud HDFS Java Kafka Looker Pipelines Privacy Python Security Spark SQL Streaming
Perks/benefits: Health care Wellness
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.