Principal Data Engineer (Big Data - Scala/Pyspark) – C13/VP - Chennai

IT BUILDING, RAMANUJAN IT SEZ,

Full Time Senior-level / Expert USD 117K - 218K *

Citi

Citi is a leading global bank for institutions with cross-border needs, a global provider in wealth management and a U.S. personal bank.

View all jobs at Citi

Apply now Apply later

Posted 2 weeks ago

The Role

We are looking for a hands-on Principal Data Engineer who is passionate about solving business problems through innovation and engineering practices. As a Principal Data Engineer, you will leverage your deep technical knowledge to drive the creation of high-quality software products. You will also be expected to mentor other engineers, share your technical expertise, and promote a culture of technical excellence within the team. The Principal Data Engineer will report to an Engineering Manager and will be a floating member of multiple engineering teams. There is an expectation to contribute to the codebase and deliver solutions against the sprint-level commitments.

Responsibilities

· Code contributing member of multiple Agile teams, working to deliver sprint goals.

· Demonstrating deep technical knowledge and expertise in software development, including programming languages, frameworks, and best practices. Providing guidance and mentorship to junior team members

· Actively contributes to the implementation of critical features and complex technical solutions. Write clean, efficient, and maintainable code that meets the highest standards of quality.

· Collaborate with other Principal Engineers to define and evolve the overall system architecture and design.

· Provide guidance on scalable, robust, and efficient solutions that align with business requirements and industry best practices.

· Offer expert engineering guidance and support to multiple teams, helping them overcome technical challenges, make informed decisions, and deliver high-quality software solutions. Foster a culture of technical excellence and continuous improvement.

· Stay up to date with emerging technologies, tools, and industry trends. Evaluate their potential impact on the organization and provide recommendations for technology adoption and innovation.

Required Qualifications

· 10+ years’ experience of implementing data-intensive solutions using agile methodologies.

· Proficient in one or more programming languages commonly used in data engineering such as Scala or Pyspark

· Experience with Hadoop for data storage and processing is valuable, as is exposure to modern data platforms such as Snowflake and Databricks.

· Proven experience of providing technical vision and guidance to a data team

· Experience of modelling data for analytical consumers

· Strong proficiency in working with relational databases and using SQL for data querying, transformation, and manipulation.

· Clear understanding of Data Structures and Object-Oriented Principles.

· Multiple years of experience with software engineering best practices (unit testing, automation, design patterns, peer review, etc.)

· Experience in cloud native technologies and patterns (AWS, Google Cloud)

· Multiple years of experience architecting and building horizontally scalable, highly available, highly resilient, and low latency applications

· Multiple years of experience with Cloud-native development and Container Orchestration tools (Serverless, Docker, Kubernetes, OpenShift, etc.)

· Ability to automate and streamline the build, test and deployment of data pipelines.

· Thrives in a dynamic environment, capable of managing multiple tasks simultaneously while maintaining a high standard of work.

· BA/BS degree or equivalent work experience.

Preferred Qualifications

· Familiarity with open-source data engineering tools and frameworks (e.g. Spark, Kafka, Beam, Flink, Trino, Airflow, DBT) is a valuable asset

· Exposure to a range of table and file formats including Iceberg, Hive, Avro, Parquet and JSON

· Exposure to Infrastructure as Code tools (i.e., Terraform, Cloudformation, etc.)

· Experience of driving and/or influencing the data strategy of your team or organization

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Applications Development

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Citi is an equal opportunity and affirmative action employer.

Qualified applicants will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

Citigroup Inc. and its subsidiaries ("Citi”) invite all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

View the "EEO is the Law" poster. View the EEO is the Law Supplement.

View the EEO Policy Statement.

View the Pay Transparency Posting

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats: 0 0 0

Categories: Big Data Jobs Deep Learning Jobs Engineering Jobs

Tags: Agile Airflow Architecture Avro AWS Big Data CloudFormation Databricks Data pipelines Data strategy dbt Docker Engineering Flink GCP Google Cloud Hadoop JSON Kafka Kubernetes Open Source Parquet Pipelines PySpark RDBMS Scala Snowflake Spark SQL Terraform Testing