Data Quality Engineer (Big Data)
1 NORTH WALL QUAY, Ireland
Citi
Citi is a leading global bank for institutions with cross-border needs, a global provider in wealth management and a U.S. personal bank.By Joining Citi, you will become part of a global organisation whose mission is to serve as a trusted partner to our clients by responsibly providing financial services that enable growth and economic progress.
At Citi, we value engineering and foster an environment where our best engineers continue to code and grow their careers.
This role of Data Quality Lead requires a highly skilled and experienced data quality leader with a strong foundation in Cloudera technologies and a proven track record of success in building and maintaining robust data quality frameworks. The Data quality lead will be responsible for designing, implementing, and overseeing the data quality strategy across the organization, ensuring data accuracy, consistency and reliability within Bigdata and cloud-based data environments. This role involves hands-on involvement in data engineering activities, including the development and maintenance of data pipelines.
Responsibilities:
- Partner with multiple management teams to ensure appropriate integration of functions to meet goals as well as identify and define necessary system enhancements to deploy new products and process improvements
- Resolve variety of high impact problems/projects through in-depth evaluation of complex business processes, system processes, and industry standards
- Provide expertise in area and advanced knowledge of applications programming and ensure application design adheres to the overall architecture blueprint
- Utilize advanced knowledge of system flow and develop standards for coding, testing, debugging, and implementation
- Develop comprehensive knowledge of how areas of business, such as architecture and infrastructure, integrate to accomplish business goals
- Provide in-depth analysis with interpretive thinking to define issues and develop innovative solutions
- Serve as advisor or coach to mid-level developers and analysts, allocating work as necessary
- Appropriately assess risk when business decisions are made, demonstrating consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behaviour, conduct and business practices, and escalating, managing and reporting control issues with transparency.
Core Responsibilities:
- Design, Architect, Develop, maintain a comprehensive data quality framework across the big data platform.
- Define and establish automated data quality standards, policies and best practices.
- Develop and implement automated data quality check and validation rules.
- Automate the monitoring and analysis of data quality metrics to identify and address data quality issues proactively.
- Ensure data accuracy, completeness, consistency and timeliness throughout the data lifecycle.
- Develop and maintain automated data quality dashboards and reports.
- Collaborate with data engineering teams to modernize the data platform, incorporating data quality considerations into the design and implementation of new solutions.
- Continuously evaluate and improve the automated data quality framework based on evolving business needs, industry best practices and platform modernization initiatives.
- Provide technical guidance and mentorship to team members on implementing requirements, changes, and enhancements to existing data engineering framework.
- Foster a collaborative and high-performing team environment.
Key Skills:
- Deep understanding of data quality principles, methodologies and automation best practices.
- Strong understanding of Python, expertise using Spark for data processing, transformations, and analysis.
- Ability to write efficient and performant SQL queries for large datasets, including techniques like partitioning, indexing and query tuning.
- In-depth knowledge of Cloudera Data Platform, CDP; hands-on experience with Hadoop, Hive, Spark, Impala, HBase and other Cloudera components.
- Knowledge of modern data architectures, cloud technologies and data lake concepts.
- Basic understanding of data visualization tools for dashboard design and creating meaningful insights.
- Ability to design, implement and maintain robust CI/CD pipelines for data engineering projects.
- Strong software engineering principles, object-oriented programming, design patterns and testing methodologies.
- Excellent communication, collaboration, and problem-solving skills to work effectively with cross-functional teams.
Qualifications:
- Significant relevant experience in data engineering roles.
- Proven experience in designing, building, and maintaining data pipelines and data warehouses.
- In-depth knowledge of Cloudera Data Platform, CDP; Strong hands-on experience with Hadoop, Spark, Hive, Kafka and cloud platforms (AWS).
- Proven leadership experience in leading and mentoring data engineering teams and driving project execution.
- Deep expertise in building and maintaining large-scale, complex data quality pipelines.
- Extensive experience in system analysis and in programming of software applications
- Experience in managing and implementing successful projects
- Ability to adjust priorities quickly as circumstances dictate
- Demonstrated leadership and project management skills
- Consistently demonstrates clear and concise written and verbal communication
Education:
- Bachelor’s degree/University degree or equivalent experience
- Master’s degree preferred
------------------------------------------------------
Job Family Group:
Technology------------------------------------------------------
Job Family:
Applications Development------------------------------------------------------
Time Type:
Full time------------------------------------------------------
Citi is an equal opportunity and affirmative action employer.
Qualified applicants will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Citigroup Inc. and its subsidiaries ("Citi”) invite all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View the "EEO is the Law" poster. View the EEO is the Law Supplement.
View the EEO Policy Statement.
View the Pay Transparency Posting
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture AWS Big Data CI/CD Data pipelines Data quality Data visualization Engineering Hadoop HBase Kafka OOP Pipelines Python Spark SQL Testing
Perks/benefits: Career development Team events Transparency
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.