Clinical Data Engineer

Hyderabad, India

Zoetis

Zoetis is the largest global animal health company committed to nurturing the world and humankind by advancing care for animals. Learn more.

View all jobs at Zoetis

Apply now Apply later

Zoetis, Inc. is the world's largest producer of medicine and vaccinations for pets and livestock. 
Join us at Zoetis India Capability Center (ZICC) in Hyderabad, where innovation meets excellence. As part of the world's leading animal healthcare company, ZICC is at the forefront of driving transformative advancements and applying technology to solve the most complex problems. Our mission is to ensure sustainable growth and maintain a competitive edge for Zoetis globally by leveraging the exceptional talent in India.
 
At ZICC, you'll be part of a dynamic team that partners with colleagues worldwide, embodying the true spirit of One Zoetis. Together, we ensure seamless integration and collaboration, fostering an environment where your contributions can make a real impact. Be a part of our journey to pioneer innovation and drive the future of animal healthcare.
The Clinical Data Engineer, ZTD R&D is responsible for delivering IT expertise, strategy, and solutions to fulfill clinical data-oriented objectives and needs of R&D at Zoetis (VMRD).  This includes:
 
• Business Analysis: Self-directed engagement with SMEs to efficiently ascertain data traits, mappings, rules, and data product requirements, as well as data diagrams.
• Data Analysis: Connects to and analyzes data from databases, files, APIs, and other modalities to spot issues, deficiencies, and opportunities. Propose and predict remediation of issues.
• Data Development: Self-directed development (including working with and technical oversight of contract resources) of data products within the clinical data landscape (e.g., EDC, LIMS, CDMS), from various stages of prototype to production, with data products such as ETLs, integrations, analyses, limited data apps, and visualizations.
• Data Architecture: Maintaining and constantly contributing to the knowledge base of the business data landscape, both technically and from the business perspective, and leveraging this acumen in application to use cases, including harmonization and augmentation of related datasets to clinical data standards and meeting regulatory needs.
 
 
POSITION RESPONSIBILITIES 

Business Analysis
• Autonomously perform Business Analysis as needed for data-based use cases, including open interaction with SMEs to answer questions and seek feedback, early analysis investigation spikes to analyze preliminary datasets, and phase-based design planning for implementation.
• Creates accurate entity mappings for data integrations and transformations, including business rules
• Creates entity relationship diagrams to document a variety of data development cases, including process flow, entity source to target, and database diagrams
• Define scope, approach, next steps, and direction with regard to assigned project objectives, and consult with technical lead for internal and business objective alignment
• Determine and document software functional and system requirements in partnership with Solution Partner
• Provide key technical input on regulatory compliance where it mandates technical coverage, such as CFR Part 11
• Actively work with business users to define, track, and version business rules for study data such as validation rules and calculations.
Contribute to process rules and flow for data management, such as when processing steps, calculations, and publish/ingestions are applied 15%
Data Analysis
• Analyze incoming data (from raw sets, database tables, APIs, and other sources) rapidly to understand its potential, needed manipulations, and issues.
• Perform data cleanup activities without direction
• Profile datasets, detecting anomalies, needed alignments, and patterns
• Proactively determine probable rules, and confirm unknown concepts with domain technical owner
• Understands AI tools and technologies; able to leverage pre-trained models for daily tasks.
Provide L3/L4 support for production data issues including advanced query capabilities and ETL pattern knowledge/familiarity to troubleshoot datasets and ETLs, and direct contracted resources as needed to remediate issues. 25%
Data Development
• Design and develop data solutions from prototype to production, largely self-driven, with data products such as ETLs, integrations, analyses, limited data apps, and visualizations
• Develop and manage pipelines and integrations for clinical data processing, from ingestion of detailed study-related sets and up to harmonized conceptual data models
• Automate pipelines using scripting and ETL tools
• Manage and monitors pipeline execution
• Design and implement error handling capabilities
• Setup simple orchestration jobs
• Perform data migration analysis tasks, such as evaluation of current sets, mapping, and process flow diagramming
• Own and oversee as needed execution of data migration tasks as part of system deployment
• Diagnose, handle, and manage data migration issues
• Understands ACID principles for databases
• Understands and applies different approaches for data loading based on the scenario, such as wipe/reload, upsert, and CDC
• Recognizes and can approach varying latencies of data loading, from batch to near-realtime
• Contribute to clinical dataset design, including structure, process, and business rules, as well as mapping to canonical domains/variables
• Define and refine dataset versioning strategy for clinical data management
• Provide controlled terminology mapping design support and analysis
• Oversee and work with contract resources to fulfill design and execution for data products.
• Assume responsibility for the overall quality of delivered products.
• Understand and apply key Agile concepts like failing fast and minimum viable product.
• Participate in project management activities like daily stand-up meetings, sprint reviews, etc.
• Assume responsibility for own assigned tasks and reaching out for clarity,
• Create tasks as needed and delegate where appropriate
• Ensure and build compliance by design on any system components supporting a GxP system or process
• Document technical design specifications
• Interpret and diagnose existing legacy or inherited code for problems and proposed remediation
• Unit test workflows extensively to minimize rework
• Peer review with others to receive and provide advice and insight
• Regression test developed items to verify continuous coverage of existing functionality
• Determine and design appropriate test plan with compliance units to satisfy compliance requirements, especially regulatory
• Design, execute, and oversee test script execution and automation
• Define deployment/installation documentation
• Execute or oversee deployment, including installation, installation verification, and hypercare
Document products consistently 50%
Data Architecture
• Recognize key VMRD data entities and understand how they relate to others, to extend the value and linkability of data across use cases.
• Know and apply approaches and sources to acquire additional metadata for key VMRD entities
• Understands basic Master Data Management principles
• Assure adherence to basic data security approaches, including app, visualization, database, and file security
• Understand and apply role-based security approaches and inheritance
• Recognize and safely handle sensitive data such as Personal Information and Intellectual Property
• Deep understanding of the technical impacts of GxP-related systems and processes
• Build and continually enhance knowledge of both the technical and business functional landscape for VMRD
• Actively interface with Systems Engineers on app-related projects
Collaborate with colleagues to continually enhance process and knowledge 10%
 
ORGANIZATIONAL RELATIONSHIPS
 
• ZTD R&D Solution Partners
• ZTD R&D Systems Engineers 
• ZTD Centers Of Excellence
• VMRD business SMEs from multiple product lines and departments 
 
RESOURCES MANAGED   
 
Supervision
Indicate the typical number of Colleagues managed.  Include direct & indirect reports, matrix responsibility and or additional resources (i.e. contingent workers), as applicable.
0 colleagues
0-4 contingent workers’ technical direction
 
 
EDUCATION AND EXPERIENCE 

• Undergraduate degree related to information technology and/or computer science or equivalent education and work experience required.
• 5-8+ years’ experience with the design, building, and supporting of data applications (3-6+ with master’s degree)
• Experience with utilizing multiple vendors and/or departments for service and support activities.
• Excellent interpersonal and communication skills with the ability to build relationships.
• Experience in coordinating activities with multidisciplinary teams distributed in many physical locations with different time zones.
• Ability to prioritize issues and drive progress in ambiguous situations.
• Strong detail-oriented mindset with ability to deliver consistent, quality results.
• Hands-on experience with clinical EDC, LIMS, and/or CDMS systems.
 
TECHNICAL SKILLS REQUIREMENTS
Technical
• Strong experience with ETL tools, including ad hoc like Alteryx / Python and orchestrated like Informatica / DataBricks
• Strong experience with SQL query writing
• Strong experience with data analysis
• Familiarity with structured and unstructured data approaches
• Familiarity with structuring data to serve analytic needs, both for visualizations and data science use cases
• General experience with SQL Server and/or Oracle databases
• Working knowledge of T-SQL and/or PL/SQL
• Some experience with scripted languages such as Python, Powershell, and R
• Some experience with Power BI and/or Tableau
• Experience using pretrained GenAI tools for code acceleration.
• Strong experience with Microsoft Excel for basic ad hoc data purposes as well as as a data source
• Experience with software testing including unit, integration, and regression testing.
• Experience with technical writing for SDLC documentation.
 
Project Management
• Experience working within a Solution Delivery Lifecycle Management framework.
• Experience with Agile and familiarity with Waterfall development methodologies.
• Ability to self-manage targeted projects and create and delegate tasks.

Full time
Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0
Category: Engineering Jobs

Tags: Agile APIs Architecture Computer Science Data analysis Databricks Data management ETL Excel Generative AI Informatica Oracle Pipelines Power BI Python R R&D SDLC Security SQL Tableau Testing T-SQL Unstructured data

Perks/benefits: Career development Team events

Region: Asia/Pacific
Country: India

More jobs like this