Scala & Spark Developer- 5-8 Years- Chennai
TRIL INFO PARK, LITTLEWOOD TOWER, India
Citi
Citi is a leading global bank for institutions with cross-border needs, a global provider in wealth management and a U.S. personal bank.Job Description
The Institutional Data Platform is a next generation Data Fabric that functions at the core of Enterprise Data Strategy to solve the evolving Data needs of Citi’s SMB. The Fabric brings together disparate data sources for the Global Markets organization and enables seamless Regulatory reporting, Client/ Surveillance/ Supervisory reporting, industry-leading Analytics and Data Science solutions. It provides enhanced Data Quality controls, Completeness and Accuracy, Reconciliations, Entitlements, Performance, management of Data Retention and Archival per regulatory guidelines. As part of this program build out, the team is setting up a robust platform to receive, host and deliver massive volumes of data, transforming the data management & delivery paths, developing high performance data pipelines with best-in-class control mechanisms and operational dashboards.
The ideal candidate will have a passion cum extensive experience in large scale data handling, rich expertise on BigData Transformation programs right from Architecture to Maintenance. The candidate will work closely with Enterprise Application teams, Technology partners, Enterprise Architects, Data Governance Teams, Business Analysts, Quality Engineers to meet the IDP Program objectives.
Development Value
Candidate will be involved in design & development of the Data ingestion pipelines, developing/ leveraging reusable Frameworks, Technical data modeling, defining & implementing data controls, supporting Upgrades, thereby playing a Key Role in meeting the Enterprise Data Strategy Objectives.
The Applications Development Intermediate Programmer Analyst is an intermediate level position responsible for participation in the establishment and implementation of new or revised application systems and programs in coordination with the Technology team. The overall objective of this role is to contribute to applications systems analysis and programming activities.
Responsibilities
- Design and develop reusable Frameworks for Data ingestion, extraction, report submission etc.
- Translate high-level & functional requirements, Data Modelling (dimensional, semi structured, transactional use cases) to technical design.
- Develop batch & real time data ingestion Pipelines involving wide range of technologies like Messaging middleware, Kafka, SFTP, Spark, Hive etc.
- Develop programs to migrate Historical Data from legacy platforms to the BigData platform.
- Develop programs for real time and EOD reconciliations.
- Provide SME support for development of automated QA scripts.
- Participate in UAT/ SIT Test cycles, Release cycles, triage and resolve issues.
- Setup monitoring and management for services.
- Partner with Project Manager, BA and Business stakeholders and prioritize the Book of Work.
- Perform Code reviews, test case reviews and ensure Functional & Non-Functional requirements.
- Analyse Platform & Software version upgrades, evaluate new tools and technologies for Big Data handling.
- Ensure adherence to and develop best practices supporting Citi’s Project Management Standards.
- Ensure SDLC standards are followed with artefacts to support Internal & External Audits.
- Identify problems, analyze information, and make evaluative judgements to recommend and implement solutions
- Resolve issues by identifying and selecting solutions through the applications of acquired technical experience and guided by precedents
- Has the ability to operate with a limited level of direct supervision.
- Can exercise independence of judgement and autonomy.
- Acts as SME to senior stakeholders and /or other team members.
- Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency.
Qualifications
- 5+ years of software development experience building large scale distributed data processing systems or large-scale Web applications.
- Experience of at least 3 years in designing & developing Big Data solutions with at least one end to end implementation.
- Strong Hands-on experience in following technical skills:
- Apache Spark
- Java/ Scala
- XML/ JSON/ Parquet/ Avro/ protobuf
- SQL
- Spring Boot/ Microservices
- Linux
- Hadoop Ecosystem (HDFS, Spark, Impala, HIVE, HBASE etc.)
- Kafka
- Exposure to Cloudera offerings like Ozone, Iceberg etc. is good to have.
- Performance analysis, troubleshooting and issue resolution.
- Experience working with Software vendor teams on open issues and resolutions.
- Strong experience with SQL, building queries, analysing, troubleshooting and improving queries.
- A history of delivering against agreed objectives.
- Ability to multi-task and work under pressure.
- Enthusiastic and proactive approach with willingness to learn, ability to pick up new concepts and applying the knowledge.
- Demonstrated problem solving skills.
- Excellent analytical and process-based skills, ability to produce process flow diagrams, business modelling, and functional design.
- The candidate is expected to be dynamic, flexible with a high energy level as this is a demanding and rapidly changing environment.
- Intermediate level experience in Applications Development role
- Consistently demonstrates clear and concise written and verbal communication
- Demonstrated problem-solving and decision-making skills
- Ability to work under pressure and manage deadlines or unexpected changes in expectations or requirements
Education
- Bachelor’s degree/University degree or equivalent experience
This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required.
------------------------------------------------------
Job Family Group:
Technology------------------------------------------------------
Job Family:
Applications Development------------------------------------------------------
Time Type:
Full time------------------------------------------------------
Citi is an equal opportunity and affirmative action employer.
Qualified applicants will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Citigroup Inc. and its subsidiaries ("Citi”) invite all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View the "EEO is the Law" poster. View the EEO is the Law Supplement.
View the EEO Policy Statement.
View the Pay Transparency Posting
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture Avro Big Data Data governance Data management Data pipelines Data quality Data strategy Hadoop HBase HDFS Java JSON Kafka Linux Microservices Parquet Pipelines Scala SDLC Spark SQL XML
Perks/benefits: Career development Flex hours Team events
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.