Data Quality Manager
Hyderabad (Office), India
Novartis
Working together, we can reimagine medicine to improve and extend people’s lives.Job Description Summary
We are seeking a highly skilled Data Quality Manager with hands-on experience in SQL, PySpark, Databricks, Snowflake and CI/CD processes. The ideal candidate will be responsible for designing, developing, and maintaining scalable data pipelines and infrastructure to support our data analytics and business intelligence needs. You will work closely with data scientists, analysts, and other stakeholders to ensure the efficient processing and delivery of high-quality data.
Job Description
Key Responsibilities:
- Design, develop, and optimize data pipelines using PySpark to process and analyze large datasets.
- Write complex SQL queries for data extraction, transformation, and loading (ETL).
- Work with Databricks to build and maintain collaborative and scalable data solutions.
- Implement and manage CI/CD processes for data pipeline deployments to ensure seamless and efficient integration and deployment.
- Collaborate with data scientists and business analysts to understand data requirements and deliver appropriate solutions.
- Ensure data quality, integrity, and security across all data processes.
- Monitor and troubleshoot data pipelines and workflows to resolve issues promptly.
- Continuously improve data and code quality through automation and best practices.
Qualifications:
- Bachelor’s degree in Computer Science, Engineering, Information Technology, or a related field.
- Proven experience with PySpark, including developing and tuning data processing applications.
- Advanced proficiency in SQL and experience in writing complex queries and optimizing them for performance.
- Hands-on experience with Databricks, including notebooks, clusters, and integration with other data tools.
- Strong understanding of CI/CD pipelines and experience with tools such as Jenkins, GitLab CI/CD, or Azure DevOps.
- Familiarity with cloud platforms (e.g., AWS, Azure, Google Cloud) and related data services.
- Excellent problem-solving skills and attention to detail.
- Strong communication and collaboration skills, with the ability to work effectively in a team environment.
Preferred Skills:
- Knowledge of data warehousing concepts and tools (e.g., Snowflake, Redshift).
- Good to have knowledge on kedro framework.
Skills Desired
Agility, Analytical Thinking, Brand Awareness, Building Construction, Business Analytics, Cross-Functional Collaboration, Digital Marketing, Marketing Strategy, Media Campaigns, Sales, Stakeholder Engagement, Stakeholder Management, Strategic Marketing, Waterfall Model* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: AWS Azure Business Analytics Business Intelligence CI/CD Computer Science Data Analytics Databricks Data pipelines Data quality Data Warehousing DevOps Engineering ETL GCP GitLab Google Cloud Jenkins Pipelines PySpark Redshift Security Snowflake SQL
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.