Apache Spark Module Lead

Kanchipuram, Tamil Nadu, India

Sopra Steria

Sopra Steria, a European Tech leader recognised for its consulting, digital services and software development, helps its clients drive their digital transformation to obtain tangible and sustainable benefits.

View all jobs at Sopra Steria

Apply now Apply later

Company Description

About Sopra Steria
Sopra Steria, a major Tech player in Europe with 56,000 employees in nearly 30 countries, is recognized for its consulting, digital services and software development. It helps its clients drive their digital transformation and obtain tangible and sustainable benefits. The Group provides end-to-end solutions to make large companies and organizations more competitive by combining in-depth knowledge of a wide range of business sectors and innovative technologies with a fully collaborative approach. Sopra Steria places people at the heart of everything it does and is committed to putting digital to work for its clients in order to build a positive future for all. In 2023, the Group generated revenues of €5.8 billion.
The world is how we shape it.

Job Description

Data Engineer with Big Data development experience - (4-5 yrs. experience)

Required Skills:

  • Strong expertise of programming languages – Python/Scala/others
  • Hands-on experience in Spark with Scala and Python(PySpark) including Data frame core functions, SparkSQL, Spark Streaming and with HIVE, Hadoop, Kafka, YARN.
  • Good Experience in Big data cloud platform, preferably Azure.
  • Databricks and coding with notebooks.
  • Hands-on experience with data orchestration tools such as Airflow, Apache NiFi
  • Processing and manipulating data using SQL and Python code.
  • Professional experience implementing data ingestion pipelines using Data Factory.
  • Building and optimizing data pipelines, architectures, and data sets.
  • Working knowledge of queueing, stream processing, and highly scalable data stores
  • User training, customer support, and coordination with cross-functional teams.
  • Writing database-heavy services or APIs.
  • Strong understanding of structuring code for testability.
  •         Should be able to perform estimations, lead the team technically, Monitoring & Tracking of activities.
  •         Should be able to coordinate and communicate well with customer.

 

Good to have:

  • Professional Data Engineer Certifications in Data Bricks.
  • Familiarity with GIT workflow, CI/CD Pipelines.
  • Working Knowledge of Azure DevOps environment.
  • Experience of Agile practices & methodologies.
  • Experience on Migration projects (Scala/PySpark).
  • Flexibility to work in both support and development projects

 

Total Experience Expected: 04-06 years

Qualifications

Education: B.E./ B.Tech./ MCA

Additional Information

At our organization, we are committed to fighting against all forms of discrimination. We foster a work environment that is inclusive and respectful of all differences.

All of our positions are open to people with disabilities.

Apply now Apply later
  • Share this job via
  • 𝕏
  • or
Job stats:  0  0  0
Category: Leadership Jobs

Tags: Agile Airflow APIs Architecture Azure Big Data CI/CD Consulting Databricks Data pipelines DevOps Git Hadoop Kafka NiFi Pipelines PySpark Python Scala Spark SQL Streaming

Perks/benefits: Team events

Region: Asia/Pacific
Country: India

More jobs like this