Data Quality Engineer (Big Data, SQL, Automation & Cloud Expertise)

Bengaluru - BCIT, India

Synechron

Synechron is an innovative global consulting firm delivering industry-leading digital solutions to transform and empower businesses.

View all jobs at Synechron

Apply now Apply later

Job Summary

Synechron is seeking a proficient and analytical Data Quality Engineer specializing in Data to join our data quality team. In this role, your primary focus will be on validating data processes, building automated testing frameworks, and ensuring the accuracy and integrity of data pipelines and warehousing solutions. Your contributions will enable reliable data-driven decision-making across the organization, directly supporting strategic business objectives related to data quality, operational efficiency, and compliance.

Software Requirements

Required Skills:

  • Advanced proficiency in SQL query writing and optimization (versions 2016 or later preferred)
  • Experience with data warehousing and big data tools such as Hadoop, Spark, Hive, and Kafka
  • Automation framework development using Selenium, Cucumber, or similar tools
  • Scripting in Java, Shell, and optionally Python for automation purposes
  • Working knowledge of Agile methodologies and testing practices in cloud environments

Preferred Skills:

  • Exposure to cloud platforms such as AWS (Redshift, S3, Athena, EMR)
  • Familiarity with data ingestion, transformation, and orchestration tools in cloud ecosystems

Overall Responsibilities

  • Design, develop, and execute comprehensive data validation and testing plans for ETL processes, data pipelines, and data warehouses
  • Develop and maintain automated testing frameworks to improve testing efficiency and coverage
  • Collaborate with cross-functional teams to understand data flows, business requirements, and technical specifications
  • Conduct detailed analysis of data discrepancies, root cause investigations, and resolve data quality issues promptly
  • Optimize SQL queries and ETL workflows for performance and reliability
  • Document data validation scenarios, test cases, and validation results, ensuring traceability and transparency
  • Implement best practices for data validation, automation, and testing within Agile and cloud-based environments
  • Promote continuous improvement initiatives to enhance data quality standards and operational efficiency

Technical Skills (By Category)

Programming Languages:

  • Required: SQL (advanced query optimization and scripting)
  • Preferred: Java, Shell scripting, Python (for automation and testing frameworks)

Databases / Data Management:

  • Required: Experience with data warehousing solutions such as Teradata, Oracle, or Hadoop-based data lakes
  • Knowledge of data ingestion, transformation, and loading (ETL) processes

Cloud Technologies:

  • Preferred: Familiarity with AWS services such as Redshift, S3, Athena, and EMR for data processing and validation tasks

Frameworks and Libraries:

  • Experience with automation testing frameworks like Selenium, Cucumber, or similar tools

Development Tools and Methodologies:

  • Proficiency in version control systems such as Git
  • Strong understanding of Agile methodologies (Scrum/Kanban) and test management tools like JIRA and Confluence
  • Knowledge of DevOps practices around continuous integration and deployment (CI/CD)

Security Protocols:

  • Implement data security and privacy standards in testing workflows, ensuring compliance with organizational policies

Experience Requirements

  • 6 to 8 years of professional experience in data quality, validation, or related software engineering roles
  • Hands-on experience with data validation of ETL pipelines, data warehouses, and big data platforms
  • Proven success in creating automation frameworks and scripts to improve data testing efficiency
  • Experience working in Agile environments, participating in CI/CD pipelines, and cross-team collaboration
  • Alternative: Extensive experience with data analysis and validation in enterprise data environments may qualify candidates with fewer years but significant hands-on expertise

Day-to-Day Activities

  • Develop and execute automated test cases for data validation across data pipelines and warehousing systems
  • Collaborate with data engineers, developers, and business analysts to understand data flow requirements and validation needs
  • Perform root cause analysis of data quality issues and coordinate remediation actions
  • Maintain and enhance automation frameworks to ensure scalable and reusable test processes
  • Participate in daily stand-up meetings, sprint planning, and review sessions
  • Document testing activities, validation results, and contribute to process improvements
  • Provide support to team members on testing best practices and data validation techniques

Qualifications

  • Bachelor’s degree or higher in Computer Science, Information Technology, Data Science, or a related field
  • Certifications in data validation, testing, or automation frameworks (e.g., ISTQB, Cisco, AWS certifications) are a plus
  • Continuous learning through professional development, technical certifications, and industry trends

Professional Competencies

  • Analytical mindset with strong problem-solving abilities focused on data quality issues
  • Excellent communication skills to effectively interface with technical and non-technical stakeholders
  • Collaborative team player capable of working across multiple functions and geographies
  • Adaptability to evolving technologies, tools, and requirements in cloud and big data environments
  • Focus on quality, accuracy, and process improvement initiatives
  • Strong organizational skills to manage multiple validation projects and prioritize tasks effectively

S​YNECHRON’S DIVERSITY & INCLUSION STATEMENT
 

Diversity & Inclusion are fundamental to our culture, and Synechron is proud to be an equal opportunity workplace and is an affirmative action employer. Our Diversity, Equity, and Inclusion (DEI) initiative ‘Same Difference’ is committed to fostering an inclusive culture – promoting equality, diversity and an environment that is respectful to all. We strongly believe that a diverse workforce helps build stronger, successful businesses as a global company. We encourage applicants from across diverse backgrounds, race, ethnicities, religion, age, marital status, gender, sexual orientations, or disabilities to apply. We empower our global workforce by offering flexible workplace arrangements, mentoring, internal mobility, learning and development programs, and more.


All employment decisions at Synechron are based on business needs, job requirements and individual qualifications, without regard to the applicant’s gender, gender identity, sexual orientation, race, ethnicity, disabled or veteran status, or any other characteristic protected by law.

Candidate Application Notice

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0

Tags: Agile Athena AWS Big Data CI/CD Computer Science Confluence Data analysis Data management Data pipelines Data quality Data Warehousing DevOps Engineering ETL Git Hadoop Java Jira Kafka Kanban Oracle Pipelines Privacy Python Redshift Scrum Security Selenium Shell scripting Spark SQL Teradata Testing

Perks/benefits: Career development Flex hours Team events Transparency

Region: Asia/Pacific
Country: India

More jobs like this