Test Automation Engineering Manager (Data Catalog)
India - Hyderabad
Amgen
Amgen is committed to unlocking the potential of biology for patients suffering from serious illnesses by discovering, developing, manufacturing and delivering innovative human therapeutics.Career Category
Information SystemsJob Description
Test Automation Engineering Manager (Data Catalog)
ABOUT AMGEN
Amgen harnesses the best of biology and technology to fight the world’s toughest diseases, and make people’s lives easier, fuller and longer. We discover, develop, manufacture and deliver innovative medicines to help millions of patients. Amgen helped establish the biotechnology industry more than 45 years ago and remains on the cutting-edge of innovation, using technology and human genetic data to push beyond what’s known today.
ABOUT THE ROLE
Role Description:
We are seeking a highly skilled and experienced hands-on Test Automation Engineering Manager with a deep expertise in Data Quality (DQ), Data Integration (DIF), and Data Governance. In this role, you will design and implement automated frameworks that ensure data accuracy, metadata consistency, and compliance throughout the data pipeline, leveraging technologies like Data bricks, AWS, and cloud-native tools. You will have a major focus on Data Cataloguing and Governance, ensuring that data assets are well-documented, auditable, and secure across the enterprise.
In this role, you will be responsible for the end-to-end design and development of a test automation framework, working collaboratively with the team. As the delivery owner for test automation, your primary focus will be on building and automating comprehensive validation frameworks for data cataloging, data classification, and metadata tracking, while ensuring alignment with internal governance standards. will also work closely with data engineers, product teams, and data governance leads to enforce data quality and governance policies. Your efforts will play a key role in driving data integrity, consistency, and trust across the organization.
The role is highly technical and hands-on, with a strong focus on automation, metadata validation, and ensuring data governance practices are seamlessly integrated into development pipelines.
Roles & Responsibilities:
Data Quality & Integration Frameworks
Design and implement Data Quality (DQ) frameworks that validate schema compliance, transformations, completeness, null checks, duplicates, threshold rules, and referential integrity.
Build Data Integration Frameworks (DIF) that validate end-to-end data pipelines across ingestion, processing, storage, and consumption layers.
Automate data validations in Databricks/Spark pipelines, integrated with AWS services like S3, Glue, Athena, and Lake Formation.
Develop modular, reusable validation components using PySpark, SQL, Python, and orchestration via CI/CD pipelines.
Data Cataloging & Governance
Integrate automated validations with AWS Glue Data Catalog to ensure metadata consistency, schema versioning, and lineage tracking.
Implement checks to verify that data assets are properly cataloged, discoverable, and compliant with internal governance standards.
Validate and enforce data classification, tagging, and access controls, ensuring alignment with data governance frameworks (e.g., PII/PHI tagging, role-based access policies).
Collaborate with governance teams to automate policy enforcement and compliance checks for audit and regulatory needs.
Visualization & UI Testing
Automate validation of data visualizations in tools like Tableau, Power BI, Looker, or custom React dashboards.
Ensure charts, KPIs, filters, and dynamic views correctly reflect backend data using UI automation (Selenium with Python) and backend validation logic.
Conduct API testing (via Postman or Python test suites) to ensure accurate data delivery to visualization layers.
Technical Skills and Tools
Hands-on experience with data automation tools like Databricks and AWS is essential, as the manager will be instrumental in building and managing data pipelines.
Leverage automated testing frameworks and containerization tools to streamline processes and improve efficiency.
Experience in UI and API functional validation using tools such as Selenium with Python and Postman, ensuring comprehensive testing coverage.
Technical Leadership, Strategy & Team Collaboration
Define and drive the overall QA and testing strategy for UI and search-related components with a focus on scalability, reliability, and performance, while establishing alerting and reporting mechanisms for test failures, data anomalies, and governance violations.
Contribute to system architecture and design discussions, bringing a strong quality and testability lens early into the development lifecycle.
Lead test automation initiatives by implementing best practices and scalable frameworks, embedding test suites into CI/CD pipelines to enable automated, continuous validation of data workflows, catalog changes, and visualization updates
Mentor and guide QA engineers, fostering a collaborative, growth-oriented culture focused on continuous learning and technical excellence.
Collaborate cross-functionally with product managers, developers, and DevOps to align quality efforts with business goals and release timelines.
Conduct code reviews, test plan reviews, and pair-testing sessions to ensure team-level consistency and high-quality standards.
Good-to-Have Skills:
Experience with data governance tools such as Apache Atlas, Collibra, or Alation
Understanding of DataOps methodologies and practices
Familiarity with monitoring/observability tools such as Datadog, Prometheus, or CloudWatch
Experience building or maintaining test data generators
Contributions to internal quality dashboards or data observability systems
Awareness of metadata-driven testing approaches and lineage-based validations
Experience working with agile Testing methodologies such as Scaled Agile.
Familiarity with automated testing frameworks like Selenium, JUnit, TestNG, or PyTest.
Must-Have Skills:
Strong hands-on experience with Data Quality (DQ) framework design and automation
Expertise in PySpark, Python, and SQL for data validations
Solid understanding of ETL/ELT pipeline testing in Databricks or Apache Spark environments
Experience validating structured and semi-structured data formats (e.g., Parquet, JSON, Avro)
Deep familiarity with AWS data services: S3, Glue, Athena, Lake Formation, Data Catalog
Integration of test automation with AWS Glue Data Catalog or similar catalog tools
UI automation using Selenium with Python for dashboard and web interface validation
API testing using Postman, Python, or custom API test scripts
Hands-on testing of BI tools such as Tableau, Power BI, Looker, or custom visualization layers
CI/CD test integration with tools like Jenkins, GitHub Actions, or GitLab CI
Familiarity with containerized environments (e.g., Docker, AWS ECS/EKS)
Knowledge of data classification, access control validation, and PII/PHI tagging
Understanding of data governance standards (e.g., GDPR, HIPAA, CCPA)
Understanding Data Structures: Knowledge of various data structures and their applications.
Ability to analyze data and identify inconsistencies.
Proven hands-on experience in test automation and data automation using Databricks and AWS.
Strong knowledge of Data Integrity Frameworks (DIF) and Data Quality (DQ) principles.
Familiarity with automated testing frameworks like Selenium, JUnit, TestNG, or PyTest.
Strong understanding of data transformation techniques and logic.
Education and Professional Certifications
Bachelor’s degree in computer science and engineering preferred, other Engineering field is considered; Master’s degree and 6+ years’ experience Or Bachelor’s degree and 8+ years
Soft Skills:
Excellent analytical and troubleshooting skills.
Strong verbal and written communication skills
Ability to work effectively with global, virtual teams
High degree of initiative and self-motivation.
Ability to manage multiple priorities successfully.
Team-oriented, with a focus on achieving team goals
Strong presentation and public speaking skills.
EQUAL OPPORTUNITY STATEMENT
Amgen is an Equal Opportunity employer and will consider you without regard to your race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or disability status.
We will ensure that individuals with disabilities are provided with reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request an accommodation.
.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile APIs Architecture Athena Avro AWS AWS Glue Biology CI/CD Classification Computer Science Databricks Data governance DataOps Data pipelines Data quality DevOps Docker ECS ELT Engineering ETL GitHub GitLab Jenkins JSON KPIs Lake Formation Looker Parquet Pipelines Power BI PySpark Python React Selenium Spark SQL Tableau Testing
Perks/benefits: Career development
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.