Test Automation Engineering Manager (Data Catalog)

India - Hyderabad

Full Time Mid-level / Intermediate USD 24K - 56K * ^est.

Amgen

Amgen is committed to unlocking the potential of biology for patients suffering from serious illnesses by discovering, developing, manufacturing and delivering innovative human therapeutics.

View all jobs at Amgen

Apply now Apply later

Posted 3 weeks ago

Career Category

Information Systems

Job Description

Test Automation Engineering Manager (Data Catalog)

ABOUT AMGEN

Amgen harnesses the best of biology and technology to fight the world’s toughest diseases, and make people’s lives easier, fuller and longer. We discover, develop, manufacture and deliver innovative medicines to help millions of patients. Amgen helped establish the biotechnology industry more than 45 years ago and remains on the cutting-edge of innovation, using technology and human genetic data to push beyond what’s known today.

ABOUT THE ROLE

Role Description:

We are seeking a highly skilled and experienced hands-on Test Automation Engineering Manager with a deep expertise in Data Quality (DQ), Data Integration (DIF), and Data Governance. In this role, you will design and implement automated frameworks that ensure data accuracy, metadata consistency, and compliance throughout the data pipeline, leveraging technologies like Data bricks, AWS, and cloud-native tools. You will have a major focus on Data Cataloguing and Governance, ensuring that data assets are well-documented, auditable, and secure across the enterprise.

In this role, you will be responsible for the end-to-end design and development of a test automation framework, working collaboratively with the team. As the delivery owner for test automation, your primary focus will be on building and automating comprehensive validation frameworks for data cataloging, data classification, and metadata tracking, while ensuring alignment with internal governance standards. will also work closely with data engineers, product teams, and data governance leads to enforce data quality and governance policies. Your efforts will play a key role in driving data integrity, consistency, and trust across the organization.

The role is highly technical and hands-on, with a strong focus on automation, metadata validation, and ensuring data governance practices are seamlessly integrated into development pipelines.

Roles & Responsibilities:

Data Quality & Integration Frameworks

Design and implement Data Quality (DQ) frameworks that validate schema compliance, transformations, completeness, null checks, duplicates, threshold rules, and referential integrity.

Build Data Integration Frameworks (DIF) that validate end-to-end data pipelines across ingestion, processing, storage, and consumption layers.

Automate data validations in Databricks/Spark pipelines, integrated with AWS services like S3, Glue, Athena, and Lake Formation.

Develop modular, reusable validation components using PySpark, SQL, Python, and orchestration via CI/CD pipelines.

Data Cataloging & Governance

Integrate automated validations with AWS Glue Data Catalog to ensure metadata consistency, schema versioning, and lineage tracking.

Implement checks to verify that data assets are properly cataloged, discoverable, and compliant with internal governance standards.

Validate and enforce data classification, tagging, and access controls, ensuring alignment with data governance frameworks (e.g., PII/PHI tagging, role-based access policies).

Collaborate with governance teams to automate policy enforcement and compliance checks for audit and regulatory needs.

Visualization & UI Testing

Automate validation of data visualizations in tools like Tableau, Power BI, Looker, or custom React dashboards.

Ensure charts, KPIs, filters, and dynamic views correctly reflect backend data using UI automation (Selenium with Python) and backend validation logic.

Conduct API testing (via Postman or Python test suites) to ensure accurate data delivery to visualization layers.

Technical Skills and Tools

Hands-on experience with data automation tools like Databricks and AWS is essential, as the manager will be instrumental in building and managing data pipelines.

Leverage automated testing frameworks and containerization tools to streamline processes and improve efficiency.

Experience in UI and API functional validation using tools such as Selenium with Python and Postman, ensuring comprehensive testing coverage.

Technical Leadership, Strategy & Team Collaboration

Define and drive the overall QA and testing strategy for UI and search-related components with a focus on scalability, reliability, and performance, while establishing alerting and reporting mechanisms for test failures, data anomalies, and governance violations.

Contribute to system architecture and design discussions, bringing a strong quality and testability lens early into the development lifecycle.

Lead test automation initiatives by implementing best practices and scalable frameworks, embedding test suites into CI/CD pipelines to enable automated, continuous validation of data workflows, catalog changes, and visualization updates

Mentor and guide QA engineers, fostering a collaborative, growth-oriented culture focused on continuous learning and technical excellence.

Collaborate cross-functionally with product managers, developers, and DevOps to align quality efforts with business goals and release timelines.

Conduct code reviews, test plan reviews, and pair-testing sessions to ensure team-level consistency and high-quality standards.

Good-to-Have Skills:

Experience with data governance tools such as Apache Atlas, Collibra, or Alation

Understanding of DataOps methodologies and practices

Familiarity with monitoring/observability tools such as Datadog, Prometheus, or CloudWatch

Experience building or maintaining test data generators

Contributions to internal quality dashboards or data observability systems

Awareness of metadata-driven testing approaches and lineage-based validations

Experience working with agile Testing methodologies such as Scaled Agile.

Familiarity with automated testing frameworks like Selenium, JUnit, TestNG, or PyTest.

Must-Have Skills:

Strong hands-on experience with Data Quality (DQ) framework design and automation

Expertise in PySpark, Python, and SQL for data validations

Solid understanding of ETL/ELT pipeline testing in Databricks or Apache Spark environments

Experience validating structured and semi-structured data formats (e.g., Parquet, JSON, Avro)

Deep familiarity with AWS data services: S3, Glue, Athena, Lake Formation, Data Catalog

Integration of test automation with AWS Glue Data Catalog or similar catalog tools

UI automation using Selenium with Python for dashboard and web interface validation

API testing using Postman, Python, or custom API test scripts

Hands-on testing of BI tools such as Tableau, Power BI, Looker, or custom visualization layers

CI/CD test integration with tools like Jenkins, GitHub Actions, or GitLab CI

Familiarity with containerized environments (e.g., Docker, AWS ECS/EKS)

Knowledge of data classification, access control validation, and PII/PHI tagging

Understanding of data governance standards (e.g., GDPR, HIPAA, CCPA)

Understanding Data Structures: Knowledge of various data structures and their applications.

Ability to analyze data and identify inconsistencies.

Proven hands-on experience in test automation and data automation using Databricks and AWS.

Strong knowledge of Data Integrity Frameworks (DIF) and Data Quality (DQ) principles.

Familiarity with automated testing frameworks like Selenium, JUnit, TestNG, or PyTest.

Strong understanding of data transformation techniques and logic.

Education and Professional Certifications

Bachelor’s degree in computer science and engineering preferred, other Engineering field is considered; Master’s degree and 6+ years’ experience Or Bachelor’s degree and 8+ years

Soft Skills:

Excellent analytical and troubleshooting skills.

Strong verbal and written communication skills

Ability to work effectively with global, virtual teams

High degree of initiative and self-motivation.

Ability to manage multiple priorities successfully.

Team-oriented, with a focus on achieving team goals

Strong presentation and public speaking skills.

EQUAL OPPORTUNITY STATEMENT

Amgen is an Equal Opportunity employer and will consider you without regard to your race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or disability status.

We will ensure that individuals with disabilities are provided with reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request an accommodation.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats: 1 0 0

Categories: Engineering Jobs Leadership Jobs

Tags: Agile APIs Architecture Athena Avro AWS AWS Glue Biology CI/CD Classification Computer Science Databricks Data governance DataOps Data pipelines Data quality DevOps Docker ECS ELT Engineering ETL GitHub GitLab Jenkins JSON KPIs Lake Formation Looker Parquet Pipelines Power BI PySpark Python React Selenium Spark SQL Tableau Testing