Data Engineer
Toronto, Ontario
Reporting to the Senior Manager of AI & Data Platform, as a Data Engineer you will be building tools and infrastructure to support efforts of the Data Products and Insights & Innovation teams, and the business as a whole.
We’re looking for a talented, curious self-starter who is driven to solve complex problems and can juggle multiple domains and stakeholders. This highly technical individual will collaborate with all levels of the Data and AI team as well as the various engineering teams to develop data solutions, scale our data infrastructure and advance Wave to the next stage in our transformation as a data-centric organization.
This role is for someone with proven experience in complicated product environments. Strong communication skills are a must to bridge the gap between technical and non-technical audiences across a spectrum of data maturity.
Here’s How You Make an Impact:
- You’re a builder. You’ll be responsible for designing, building and deploying the components of a modern data stack, including CDC ingestion (using Debezium), a centralized Hudi data lake, and a variety of batch, incremental and stream-based pipelines.
- You’ll make things better. You enjoy the challenge of helping build and manage a fault tolerant data platform that scales economically, while balancing innovation with operational stability by maintaining legacy Python ELT scripts and accelerating the transition to dbt models in Redshift.
- You’re all about collaboration and relationships. You will collaborate within a cross-functional team in planning and rolling out data infrastructure and processing pipelines that serve workloads across analytics, machine learning and GenAI services. You enjoy working with different teams across Wave and helping them to succeed by ensuring that their data, analytics, and AI insights are reliably delivered.
- You’re self-motivated and can work autonomously. We count on you to thrive in ambiguous conditions by independently identifying opportunities to optimize pipelines and improve data workflows under tight deadlines.
- You will resolve and mitigate incidents: You will respond to PagerDuty alerts and proactively implement monitoring solutions to minimize future incidents, ensuring high availability and reliability of data systems.
- You're a strong communicator. As a data practitioner, you’ll have people coming to you for technical assistance, and your outstanding ability to listen and communicate with people will reassure them as you help answer their concern.
- You love helping customers. You will assess existing systems, optimize data accessibility, and provide innovative solutions to help internal teams surface actionable insights that enhance external customer satisfaction.
You Thrive Here By Possessing the Following:
- Data Engineering Expertise: Bring 3+ years of experience in building data pipelines and managing a secure, modern data stack. This includes CDC streaming ingestion using tools like Debezium into a Hudi data lake that supports AI/ML workloads and a curated Redshift data warehouse.
- AWS Cloud Proficiency: At least 3 years of experience working with AWS cloud infrastructure, including Kafka (MSK), Spark / AWS Glue, and infrastructure as code (IaC) using Terraform.
- Strong Coding Skills: Write and review high-quality, maintainable code that enhances the reliability and scalability of our data platform. We use Python, SQL, and dbt extensively, and you should be comfortable leveraging third-party frameworks to accelerate development.
- Data Lake Development: Prior experience building data lakes on S3 using Apache Hudi with Parquet, Avro, JSON, and CSV file formats.
- Workflow Automation: Build and manage multi-stage workflows using serverless Lambdas and AWS Step Functions to automate and orchestrate data processing pipelines.
- Data Governance Knowledge: Familiarity with data governance practices, including data quality, lineage, and privacy, as well as experience using cataloging tools to enhance discoverability and compliance.
- CI/CD Best Practices: Experience developing and deploying data pipeline solutions using CI/CD best practices to ensure reliability and scalability.
- Data Integration Tools: Working knowledge of tools such as Stitch and Segment CDP for integrating diverse data sources into a cohesive ecosystem.
- Analytical and ML Tools Expertise: Knowledge and practical experience with Athena, Redshift, or Sagemaker Feature Store to support analytical and machine learning workflows is a definite bonus!
At Wave, you’re treated like the incredible human being you are.
Work From Where You Work Best: We will always have a welcoming, energizing, and world-class office (in Toronto) with a space for you. Or, if you’re more comfortable working from home, the choice is yours.We Care About Future You: You will stretch yourself and you will grow at Wave. You will also be supported on this journey with diverse learning experiences, educational allowances, mentorship, and so much more.We Support the Full You: We make a serious investment in your health & wellness. When we think about benefits we think about body, mind, & soul and we take this stuff very seriously. We Take Care of the Fundamentals: Fair compensation, all the office perks you’d want, and the various goodies you’d expect from a growing tech company. This is the obvious stuff, but we don’t want you to think we forgot!
We believe that a diverse and inclusive culture creates the best workplace. We embrace our differences, value individuality, and the broad spectrum of every Waver's skills and abilities. We challenge each other from a place of respect and pursuit of continuous growth. We trust each other and encourage everyone to bring their authentic selves to work, everyday. As Wavers, our voices matter, our opinions are met with an open mind. The best ideas win, no matter whose they are. Contributing to an inclusive culture is a part of all of our job descriptions.
We’ve been continuously recognized as one of Canada's Top Ten Most Admired Corporate Cultures and one of Canada’s Great Places to Work in categories including Technology, Millennials, Mental Health, Inclusion and Women.
Are you ready to be a Waver? Join us!
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Athena Avro AWS AWS Glue CI/CD CSV Data governance Data pipelines Data quality Data warehouse dbt ELT Engineering Generative AI JSON Kafka Machine Learning Parquet Pipelines Privacy Python Redshift SageMaker Spark SQL Step Functions Streaming Terraform
Perks/benefits: Career development Health care Startup environment Wellness
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.