Principal Snowflake Data Engineer & Data Engineering Lead
IN Hyderabad, India
Health Catalyst
Health Catalyst is a leading provider of data and analytics technology and services to healthcare organizations, committed to being the catalyst for massive, measurable, data-informed healthcare improvement.Join one of the nation’s leading and most impactful health care performance improvement companies. Over the years, Health Catalyst has achieved and documented clinical, operational, and financial improvements for many of the nation’s leading healthcare organizations. We are also increasingly serving international markets. Our mission is to be the catalyst for massive, measurable, data-informed healthcare improvement through:
Data: integrate data in a flexible, open & scalable platform to power healthcare’s digital transformation
Analytics: deliver analytic applications & services that generate insight on how to measurably improve
Expertise: provide clinical, financial & operational experts who enable & accelerate improvement
Engagement: attract, develop and retain world-class team members by being a best place to work
Job Title: Principal Snowflake Data Engineer & Data Engineering Lead
Experience: 8–10 Years
Employment Type: Full-Time
About the Role:
We are seeking a Principal Snowflake Data Engineer with 8–10 years of experience in data
engineering, including deep specialization in the Snowflake Data Cloud, and a proven track
record of technical leadership and team management.
This role goes beyond individual contribution—you will also lead and mentor cross-functional
teams across data synchronization, Data Operations, and ETL domains, driving best
practices and architectural direction while ensuring the delivery of scalable, efficient, and
secure data solutions across the organization.
Key Responsibilities
Technical Leadership
• Own the architectural vision and implementation strategy for Snowflake-based data
platforms.
• Lead the design, optimization, and maintenance of ELT pipelines and data lake
integrations with Snowflake.
• Drive Snowflake performance tuning, warehouse sizing, clustering design, and cost
governance.
• Leverage Snowflake-native features like Streams, Tasks, Time Travel, Snowpipe, and
Materialized Views for real-time and batch workloads.
• Establish robust data governance, security policies (RBAC, data masking, row-level
access), and regulatory compliance within Snowflake.
• Ensure best practices in schema design, data modeling, and version-controlled pipeline
development using tools like dbt, Airflow, and Git.
Team & People Management
• Lead and mentor the data synchronization, Data Operations, and ETL engineering
teams—ensuring alignment with business and data strategies.
• Drive sprint planning, project prioritization, and performance management within the
team.
• Foster a culture of accountability, technical excellence, collaboration, and continuous
learning.
• Partner with product managers, business analysts, and senior leadership to translate
business requirements into technical roadmaps.
Operational Excellence
• Oversee end-to-end data ingestion and transformation pipelines using Spark, AWS Glue,
and other cloud-native tools.
• Implement CI/CD pipelines and observability for data operations.
• Establish data quality monitoring, lineage tracking, and system reliability processes.
• Champion automation and Infrastructure-as-Code practices across the Snowflake and
data engineering stack.
Required Skills
• 8–10 years of data engineering experience with at least 4–5 years of hands-on
Snowflake expertise.
• Proven leadership of cross-functional data teams (ETL, Data Operations, data
synchronization).
• Deep expertise in:
o Snowflake internals (clustering, caching, performance tuning)
o Streams, Tasks, Snowpipe, Materialized Views, UDFs
o Data governance (RBAC, secure views, masking policies)
• Strong SQL and data modeling (dimensional & normalized)
• Hands-on experience with:
o Apache Spark, PySpark, AWS Glue
o Orchestration frameworks (Airflow, dbt, Dagster, or AWS Step Functions)
o CI/CD and Git-based workflows
• Strong understanding of data lakes, especially Delta Lake on S3 or similar
Nice to Have
• Snowflake Certifications (SnowPro Advanced Architect preferred)
• Experience with Data Operations tools (e.g., Datadog, CloudWatch, Prometheus)
• Familiarity with Terraform, CloudFormation, and serverless technologies (AWS Lambda,
Docker)
• Exposure to Databricks and distributed compute environments
Why Join Us?
• Lead and shape the future of data architecture and engineering in a high-impact, cloudnative environment.
• Be the go-to Snowflake expert and technical mentor across the company.
• Enjoy the opportunity to manage teams, drive innovation, and influence strategy at
scale.
• Flexible remote work options, high autonomy, and strong support for career
development.
The above statements describe the general nature and level of work being performed in this job function. They are not intended to be an exhaustive list of all duties, and indeed additional responsibilities may be assigned by Health Catalyst.
Studies show that candidates from underrepresented groups are less likely to apply for roles if they don’t have 100% of the qualifications shown in the job posting. While each of our roles have core requirements, please thoughtfully consider your skills and experience and decide if you are interested in the position. If you feel you may be a good fit for the role, even if you don’t meet all of the qualifications, we hope you will apply. If you feel you are lacking the core requirements for this position, we encourage you to continue exploring our careers page for other roles for which you may be a better fit.
At Health Catalyst, we appreciate the opportunity to benefit from the diverse backgrounds and experiences of others. Because of our deep commitment to respect every individual, Health Catalyst is an equal opportunity employer.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Architecture AWS AWS Glue CI/CD CloudFormation Clustering Dagster Databricks Data governance DataOps Data quality dbt Docker ELT Engineering ETL Git Lambda Pipelines PySpark Security Snowflake Spark SQL Step Functions Terraform
Perks/benefits: Career development
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.