Test Automation Engineering Manager (Data Catalog)

India - Hyderabad

Amgen

Amgen is committed to unlocking the potential of biology for patients suffering from serious illnesses by discovering, developing, manufacturing and delivering innovative human therapeutics.

View all jobs at Amgen

Apply now Apply later

Career Category

Information Systems

Job Description

Test Automation Engineering Manager (Data Catalog) 

ABOUT AMGEN 

Amgen harnesses the best of biology and technology to fight the world’s toughest diseases, and make people’s lives easier, fuller and longer. We discover, develop, manufacture and deliver innovative medicines to help millions of patients. Amgen helped establish the biotechnology industry more than 45 years ago and remains on the cutting-edge of innovation, using technology and human genetic data to push beyond what’s known today. 

ABOUT THE ROLE 

Role Description: 

We are seeking a highly skilled and experienced  hands-on Test Automation Engineering Manager with a deep expertise in Data Quality (DQ), Data Integration (DIF), and Data Governance. In this role, you will design and implement automated frameworks that ensure data accuracy, metadata consistency, and compliance throughout the data pipeline, leveraging technologies like Data bricks, AWS, and cloud-native tools. You will have a major focus on Data Cataloguing and Governance, ensuring that data assets are well-documented, auditable, and secure across the enterprise. 

In this role, you will be responsible for the end-to-end design and development of a test automation framework, working collaboratively with the team. As the delivery owner for test automation, your primary focus will be on building and automating comprehensive validation frameworks for data cataloging, data classification, and metadata tracking, while ensuring alignment with internal governance standards. will also work closely with data engineers, product teams, and data governance leads to enforce data quality and governance policies. Your efforts will play a key role in driving data integrity, consistency, and trust across the organization. 

 

The role is highly technical and hands-on, with a strong focus on automation, metadata validation, and ensuring data governance practices are seamlessly integrated into development pipelines

Roles & Responsibilities:  

Data Quality & Integration Frameworks 

  • Design and implement Data Quality (DQ) frameworks that validate schema compliance, transformations, completeness, null checks, duplicates, threshold rules, and referential integrity. 

  • Build Data Integration Frameworks (DIF) that validate end-to-end data pipelines across ingestion, processing, storage, and consumption layers. 

  • Automate data validations in Databricks/Spark pipelines, integrated with AWS services like S3, Glue, Athena, and Lake Formation. 

  • Develop modular, reusable validation components using PySpark, SQL, Python, and orchestration via CI/CD pipelines. 

Data Cataloging & Governance 

  • Integrate automated validations with AWS Glue Data Catalog to ensure metadata consistency, schema versioning, and lineage tracking. 

  • Implement checks to verify that data assets are properly cataloged, discoverable, and compliant with internal governance standards. 

  • Validate and enforce data classification, tagging, and access controls, ensuring alignment with data governance frameworks (e.g., PII/PHI tagging, role-based access policies). 

  • Collaborate with governance teams to automate policy enforcement and compliance checks for audit and regulatory needs. 

Visualization & UI Testing 

  • Automate validation of data visualizations in tools like Tableau, Power BI, Looker, or custom React dashboards. 

  • Ensure charts, KPIs, filters, and dynamic views correctly reflect backend data using UI automation (Selenium with Python) and backend validation logic. 

  • Conduct API testing (via Postman or Python test suites) to ensure accurate data delivery to visualization layers. 

Technical Skills and Tools 

  • Hands-on experience with data automation tools like Databricks and AWS is essential, as the manager will be instrumental in building and managing data pipelines. 

  • Leverage automated testing frameworks and containerization tools to streamline processes and improve efficiency. 

  • Experience in UI and API functional validation using tools such as Selenium with Python and Postman, ensuring comprehensive testing coverage. 

Technical Leadership, Strategy & Team Collaboration 

  • Define and drive the overall QA and testing strategy for UI and search-related components with a focus on scalability, reliability, and performance, while establishing alerting and reporting mechanisms for test failures, data anomalies, and governance violations. 

  • Contribute to system architecture and design discussions, bringing a strong quality and testability lens early into the development lifecycle. 

  • Lead test automation initiatives by implementing best practices and scalable frameworks, embedding test suites into CI/CD pipelines to enable automated, continuous validation of data workflows, catalog changes, and visualization updates 

  • Mentor and guide QA engineers, fostering a collaborative, growth-oriented culture focused on continuous learning and technical excellence. 

  • Collaborate cross-functionally with product managers, developers, and DevOps to align quality efforts with business goals and release timelines. 

  • Conduct code reviews, test plan reviews, and pair-testing sessions to ensure team-level consistency and high-quality standards. 

 

Good-to-Have Skills: 

  • Experience with data governance tools such as Apache Atlas, Collibra, or Alation 

  • Understanding of DataOps methodologies and practices 

  • Familiarity with monitoring/observability tools such as Datadog, Prometheus, or  CloudWatch 

  •  Experience building or maintaining test data generators 

  •  Contributions to internal quality dashboards or data observability systems 

  •  Awareness of metadata-driven testing approaches and lineage-based validations 

  • Experience working with agile Testing methodologies such as Scaled Agile. 

  • Familiarity with automated testing frameworks like Selenium, JUnit, TestNG, or PyTest. 

 

 

Must-Have Skills: 

  • Strong hands-on experience with Data Quality (DQ) framework design and automation 

  • Expertise in PySpark, Python, and SQL for data validations 

  • Solid understanding of ETL/ELT pipeline testing in Databricks or Apache Spark environments 

  • Experience validating structured and semi-structured data formats (e.g., Parquet, JSON, Avro) 

  • Deep familiarity with AWS data services: S3, Glue, Athena, Lake Formation, Data Catalog 

  • Integration of test automation with AWS Glue Data Catalog or similar catalog tools 

  • UI automation using Selenium with Python for dashboard and web interface validation 

  • API testing using Postman, Python, or custom API test scripts 

  • Hands-on testing of BI tools such as Tableau, Power BI, Looker, or custom visualization layers 

  • CI/CD test integration with tools like Jenkins, GitHub Actions, or GitLab CI 

  • Familiarity with containerized environments (e.g., Docker, AWS ECS/EKS) 

  • Knowledge of data classification, access control validation, and PII/PHI tagging 

  • Understanding of data governance standards (e.g., GDPR, HIPAA, CCPA) 

  • Understanding Data Structures: Knowledge of various data structures and their applications. 

  • Ability to analyze data and identify inconsistencies. 

  • Proven hands-on experience in test automation and data automation using Databricks and AWS. 

  • Strong knowledge of Data Integrity Frameworks (DIF) and Data Quality (DQ) principles. 

  • Familiarity with automated testing frameworks like Selenium, JUnit, TestNG, or PyTest. 

  • Strong understanding of data transformation techniques and logic. 

 

Education and Professional Certifications 

  • Bachelor’s degree in computer science and engineering preferred, other Engineering field is considered; Master’s degree and 6+ years’ experience Or Bachelor’s degree and 8+ years  

Soft Skills: 

  • Excellent analytical and troubleshooting skills. 

  • Strong verbal and written communication skills 

  • Ability to work effectively with global, virtual teams 

  • High degree of initiative and self-motivation. 

  • Ability to manage multiple priorities successfully.  

  • Team-oriented, with a focus on achieving team goals 

  • Strong presentation and public speaking skills. 

 

EQUAL OPPORTUNITY STATEMENT 

Amgen is an Equal Opportunity employer and will consider you without regard to your race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or disability status. 

We will ensure that individuals with disabilities are provided with reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request an accommodation. 

 

 

 

.
Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  1  0  0

Tags: Agile APIs Architecture Athena Avro AWS AWS Glue Biology CI/CD Classification Computer Science Databricks Data governance DataOps Data pipelines Data quality DevOps Docker ECS ELT Engineering ETL GitHub GitLab Jenkins JSON KPIs Lake Formation Looker Parquet Pipelines Power BI PySpark Python React Selenium Spark SQL Tableau Testing

Perks/benefits: Career development

Region: Asia/Pacific
Country: India

More jobs like this