Senior Data Engineer

Bengaluru, KA, India

Blend360

Blend360 co-creates value with leading companies through the integration of data, advanced analytics, technology & people. Get in touch with us today.

View all jobs at Blend360

Apply now Apply later

Company Description

Blend is a premier AI services provider, committed to co-creating meaningful impact for its clients through the power of data science, AI, technology, and people. With a mission to fuel bold visions, Blend tackles significant challenges by seamlessly aligning human expertise with artificial intelligence. The company is dedicated to unlocking value and fostering innovation for its clients by harnessing world-class people and data-driven strategy. We believe that the power of people and AI can have a meaningful impact on your world, creating more fulfilling work and projects for our people and clients. For more information, visitĀ www.blend360.com

Job Description

You will be a key member of our Data Engineering team, focused on designing, developing, and maintaining robust data solutions on on-prem environments. You will work closely with internal teams and client stakeholders to build and optimize data pipelines and analytical tools using Python, Scala, SQL, Spark and Hadoop ecosystem technologies. This role requires deep hands-on experience with big data technologies in traditional data centre environments (non-cloud).Ā 

What you’ll be doingĀ 

  • Design, build, and maintain on-prem data pipelines to ingest, process, and transform large volumes of data from multiple sources into data warehouses and data lakesĀ 

  • Develop and optimize Scala-Spark and SQL jobs for high-performance batch and real-time data processingĀ 

  • Ensure the scalability, reliability, and performance of data infrastructure in an on-prem setupĀ 

  • Collaborate with data scientists, analysts, and business teams to translate their data requirements into technical solutionsĀ 

  • Troubleshoot and resolve issues in data pipelines and data processing workflowsĀ 

  • Monitor, tune, and improve Hadoop clusters and data jobs for cost and resource efficiencyĀ 

  • Stay current with on-prem big data technology trends and suggest enhancements to improve data engineering capabilitiesĀ 

Qualifications

  • Bachelor's degree in software engineering, or a related fieldĀ 

  • 5+ years of experience in data engineering or a related domainĀ 

  • Strong programming skills in Python & ScalaĀ 

  • Expertise in SQL with a solid understanding of data warehousing conceptsĀ 

  • Hands-on experience with Hadoop ecosystem components (e.g., HDFS, Hive, Apache Hudi, Iceberg and Delta Lake)Ā 

  • Proven ability to design and manage data solutions in on-prem environments (no cloud dependency)Ā 

  • 3rd party data integrations from different sources (including APIs)Ā 

  • Proficiency in Airflow or similar orchestration toolĀ 

  • Strong problem-solving skills with an ability to work independently and collaborativelyĀ 

  • Excellent communication skills and ability to engage with technical and non-technical stakeholdersĀ 

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index šŸ’°

Job stats:  0  0  0
Category: Engineering Jobs

Tags: Airflow APIs Big Data Data pipelines Data Warehousing Engineering Hadoop HDFS Pipelines Python Scala Spark SQL

Region: Asia/Pacific
Country: India

More jobs like this