Data Quality Engineer (Big Data, SQL, Automation & Cloud Expertise)
Bengaluru - BCIT, India
Synechron
Synechron is an innovative global consulting firm delivering industry-leading digital solutions to transform and empower businesses.Job Summary
Synechron is seeking a proficient and analytical Data Quality Engineer specializing in Data to join our data quality team. In this role, your primary focus will be on validating data processes, building automated testing frameworks, and ensuring the accuracy and integrity of data pipelines and warehousing solutions. Your contributions will enable reliable data-driven decision-making across the organization, directly supporting strategic business objectives related to data quality, operational efficiency, and compliance.
Software Requirements
Required Skills:
- Advanced proficiency in SQL query writing and optimization (versions 2016 or later preferred)
- Experience with data warehousing and big data tools such as Hadoop, Spark, Hive, and Kafka
- Automation framework development using Selenium, Cucumber, or similar tools
- Scripting in Java, Shell, and optionally Python for automation purposes
- Working knowledge of Agile methodologies and testing practices in cloud environments
Preferred Skills:
- Exposure to cloud platforms such as AWS (Redshift, S3, Athena, EMR)
- Familiarity with data ingestion, transformation, and orchestration tools in cloud ecosystems
Overall Responsibilities
- Design, develop, and execute comprehensive data validation and testing plans for ETL processes, data pipelines, and data warehouses
- Develop and maintain automated testing frameworks to improve testing efficiency and coverage
- Collaborate with cross-functional teams to understand data flows, business requirements, and technical specifications
- Conduct detailed analysis of data discrepancies, root cause investigations, and resolve data quality issues promptly
- Optimize SQL queries and ETL workflows for performance and reliability
- Document data validation scenarios, test cases, and validation results, ensuring traceability and transparency
- Implement best practices for data validation, automation, and testing within Agile and cloud-based environments
- Promote continuous improvement initiatives to enhance data quality standards and operational efficiency
Technical Skills (By Category)
Programming Languages:
- Required: SQL (advanced query optimization and scripting)
- Preferred: Java, Shell scripting, Python (for automation and testing frameworks)
Databases / Data Management:
- Required: Experience with data warehousing solutions such as Teradata, Oracle, or Hadoop-based data lakes
- Knowledge of data ingestion, transformation, and loading (ETL) processes
Cloud Technologies:
- Preferred: Familiarity with AWS services such as Redshift, S3, Athena, and EMR for data processing and validation tasks
Frameworks and Libraries:
- Experience with automation testing frameworks like Selenium, Cucumber, or similar tools
Development Tools and Methodologies:
- Proficiency in version control systems such as Git
- Strong understanding of Agile methodologies (Scrum/Kanban) and test management tools like JIRA and Confluence
- Knowledge of DevOps practices around continuous integration and deployment (CI/CD)
Security Protocols:
- Implement data security and privacy standards in testing workflows, ensuring compliance with organizational policies
Experience Requirements
- 6 to 8 years of professional experience in data quality, validation, or related software engineering roles
- Hands-on experience with data validation of ETL pipelines, data warehouses, and big data platforms
- Proven success in creating automation frameworks and scripts to improve data testing efficiency
- Experience working in Agile environments, participating in CI/CD pipelines, and cross-team collaboration
- Alternative: Extensive experience with data analysis and validation in enterprise data environments may qualify candidates with fewer years but significant hands-on expertise
Day-to-Day Activities
- Develop and execute automated test cases for data validation across data pipelines and warehousing systems
- Collaborate with data engineers, developers, and business analysts to understand data flow requirements and validation needs
- Perform root cause analysis of data quality issues and coordinate remediation actions
- Maintain and enhance automation frameworks to ensure scalable and reusable test processes
- Participate in daily stand-up meetings, sprint planning, and review sessions
- Document testing activities, validation results, and contribute to process improvements
- Provide support to team members on testing best practices and data validation techniques
Qualifications
- Bachelor’s degree or higher in Computer Science, Information Technology, Data Science, or a related field
- Certifications in data validation, testing, or automation frameworks (e.g., ISTQB, Cisco, AWS certifications) are a plus
- Continuous learning through professional development, technical certifications, and industry trends
Professional Competencies
- Analytical mindset with strong problem-solving abilities focused on data quality issues
- Excellent communication skills to effectively interface with technical and non-technical stakeholders
- Collaborative team player capable of working across multiple functions and geographies
- Adaptability to evolving technologies, tools, and requirements in cloud and big data environments
- Focus on quality, accuracy, and process improvement initiatives
- Strong organizational skills to manage multiple validation projects and prioritize tasks effectively
SYNECHRON’S DIVERSITY & INCLUSION STATEMENT
Diversity & Inclusion are fundamental to our culture, and Synechron is proud to be an equal opportunity workplace and is an affirmative action employer. Our Diversity, Equity, and Inclusion (DEI) initiative ‘Same Difference’ is committed to fostering an inclusive culture – promoting equality, diversity and an environment that is respectful to all. We strongly believe that a diverse workforce helps build stronger, successful businesses as a global company. We encourage applicants from across diverse backgrounds, race, ethnicities, religion, age, marital status, gender, sexual orientations, or disabilities to apply. We empower our global workforce by offering flexible workplace arrangements, mentoring, internal mobility, learning and development programs, and more.
All employment decisions at Synechron are based on business needs, job requirements and individual qualifications, without regard to the applicant’s gender, gender identity, sexual orientation, race, ethnicity, disabled or veteran status, or any other characteristic protected by law.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Athena AWS Big Data CI/CD Computer Science Confluence Data analysis Data management Data pipelines Data quality Data Warehousing DevOps Engineering ETL Git Hadoop Java Jira Kafka Kanban Oracle Pipelines Privacy Python Redshift Scrum Security Selenium Shell scripting Spark SQL Teradata Testing
Perks/benefits: Career development Flex hours Team events Transparency
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.