Apache Spark Module Lead
Kanchipuram, Tamil Nadu, India
Sopra Steria
Sopra Steria, a European Tech leader recognised for its consulting, digital services and software development, helps its clients drive their digital transformation to obtain tangible and sustainable benefits.Company Description
About Sopra Steria
Sopra Steria, a major Tech player in Europe with 56,000 employees in nearly 30 countries, is recognized for its consulting, digital services and software development. It helps its clients drive their digital transformation and obtain tangible and sustainable benefits. The Group provides end-to-end solutions to make large companies and organizations more competitive by combining in-depth knowledge of a wide range of business sectors and innovative technologies with a fully collaborative approach. Sopra Steria places people at the heart of everything it does and is committed to putting digital to work for its clients in order to build a positive future for all. In 2023, the Group generated revenues of €5.8 billion.
The world is how we shape it.
Job Description
Data Engineer with Big Data development experience - (4-5 yrs. experience)
Required Skills:
- Strong expertise of programming languages – Python/Scala/others
- Hands-on experience in Spark with Scala and Python(PySpark) including Data frame core functions, SparkSQL, Spark Streaming and with HIVE, Hadoop, Kafka, YARN.
- Good Experience in Big data cloud platform, preferably Azure.
- Databricks and coding with notebooks.
- Hands-on experience with data orchestration tools such as Airflow, Apache NiFi
- Processing and manipulating data using SQL and Python code.
- Professional experience implementing data ingestion pipelines using Data Factory.
- Building and optimizing data pipelines, architectures, and data sets.
- Working knowledge of queueing, stream processing, and highly scalable data stores
- User training, customer support, and coordination with cross-functional teams.
- Writing database-heavy services or APIs.
- Strong understanding of structuring code for testability.
- Should be able to perform estimations, lead the team technically, Monitoring & Tracking of activities.
- Should be able to coordinate and communicate well with customer.
Good to have:
- Professional Data Engineer Certifications in Data Bricks.
- Familiarity with GIT workflow, CI/CD Pipelines.
- Working Knowledge of Azure DevOps environment.
- Experience of Agile practices & methodologies.
- Experience on Migration projects (Scala/PySpark).
- Flexibility to work in both support and development projects
Total Experience Expected: 04-06 years
Qualifications
Education: B.E./ B.Tech./ MCA
Additional Information
At our organization, we are committed to fighting against all forms of discrimination. We foster a work environment that is inclusive and respectful of all differences.
All of our positions are open to people with disabilities.
Tags: Agile Airflow APIs Architecture Azure Big Data CI/CD Consulting Databricks Data pipelines DevOps Git Hadoop Kafka NiFi Pipelines PySpark Python Scala Spark SQL Streaming
Perks/benefits: Team events
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.