Principal Data Engineer
Pune, IN, India
Pattern
Pattern is the world’s leading ecommerce accelerator. Our ecommerce solutions accelerate brands on marketplaces, D2C, and other digital channels.Job Description:
Job Summary:
Lead the design, development, and optimization of enterprise-scale data pipelines and architectures to drive data-driven decision-making. Architect efficient data tables and schemas to enhance query performance, ensuring seamless integration with cross-functional teams
Key Responsibilities:
Data Pipeline
Design and implement high-performance data pipelines using ETL processes, batch/streaming frameworks (e.g., Apache Spark, Airflow), and cloud platforms (AWS).
Optimize data ingestion, transformation, and storage workflows to meet scalability, reliability, and latency requirements.
Data Architecture & Query Efficiency
Architect database schemas and dimensional models (e.g., star/snowflake) to maximize query performance and reduce latency.
Implement indexing strategies, partitioning, and materialized views to optimize data retrieval.
Collaboration & Governance
Partner with data scientists, product managers, and engineers to align data infrastructure with business needs.
Establish data governance frameworks, ensuring compliance with security, privacy, and quality standards.
Mentor junior engineers and foster best practices in data engineering
Innovation & Leadership
Stay ahead of industry trends (e.g., AI/ML integration, real-time analytics) and advocate for emerging technologies.
Lead technical strategy for data infrastructure, balancing innovation with operational stability.
Technical Expertise:
8+ years of experience in data engineering, with a focus on ETL, data modelling, and cloud-based architectures.
Proficiency in SQL, Python, and tools like Spark and Snowflake.
Strong knowledge of database design, dimensional modeling, and query optimization.
Leadership & Collaboration
Proven ability to lead teams, mentor engineers, and drive technical initiatives.
Experience translating business requirements into scalable technical solutions.
Preferred qualification –
Expertise in machine learning model deployment or real-time analytics.
Familiarity with DevOps practices (CI/CD, Docker, Kubernetes).
Our Core Values
- Data Fanatics: Our edge is always found in the data
- Partner Obsessed: We are obsessed with partner success
- Team of Doers: We have a bias for action
- Game Changers: We encourage innovation
Pattern is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Architecture AWS CI/CD Data governance Data pipelines DevOps Docker Engineering ETL Kubernetes Machine Learning Model deployment Pipelines Privacy Python Security Snowflake Spark SQL Streaming
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.