Senior Cloud Engineer -Azure databricks

Bengaluru, KA, India

Bosch Group

Moving stories and inspiring interviews. Experience the meaning of "invented for life" by Bosch completely new. Visit our international website.

View all jobs at Bosch Group

Apply now Apply later

Job Description

As a Cloud engineer in our team, you work with large scale manufacturing data coming from our globally distributed plants. You will focus on building efficient, scalable & data-driven applications.

The data sets produced by these applications – whether data streams or data at rest – need to be highly available, reliable, consistent and quality-assured so that they can serve as input to wide range of other use cases and downstream applications.

We run these applications on a Azure databricks, you will be building applications, you will also contribute to scaling the platform including topics such as automation and observability.

Finally, you are expected to interact with customers and other technical teams e.g. for requirements clarification & definition of data models.

Primary responsibilities:                                             ·      

  • Engaging in design discussion of Data Pipelines in Azure.
  • Creating design for data pipelines and conceptualize data architecture for large-scale projects in Azure.
  • Architectural proposal and estimation for the application, technical leadership to the team
  • Define data model for data pipelines in Azure.
  • Coordination/Collaboration with central teams for tasks and standards
  • Develop data integration workflow in Azure
  • Developing pyspark applications for processing Streaming data.
  • Integrating the end-to-end Azure Databricks pipeline to take data from source systems to target system ensuring the quality and consistency of data.
  • Writing python scripts to automate manual activities.
  • Defining data quality and validation checks.
  • Configuring data processing and transformation.
  • Writing unit test cases for data pipelines.
  • Defining and implementing data quality and validation check.
  • Tuning pipeline configurations for optimal performance.
  • Participate in Peer review and PR review for the code written by team members

Qualifications

  • Bachelor’s degree in computer science, Computer Engineering, relevant technical field, or equivalent; Master’s degree preferred.
  • 6+ years’ experience in data engineering, ETL tools and working with large data sets.
  • Proven experience with cloud platform, particularly in Azure Databricks
  • Min 6 years of Experience in Design Development and integration applications using Various technologies and frameworks
  • Minimum 5 years of working experience of distributed cluster.

Additional Information

Key Competencies:

  • At least 2-3 years of Azure Databricks Cloud experience in Data Engineering
  • Experience of Delta table, ADLS, DBFS, ADF.
  • At least 6 years of experience in large scale Python software development (other object-oriented languages are also acceptable)
  • Deep level of understanding in distributed systems for data storage and processing (e.g. Kafka, pyspark, Azure Cloud)
  • Experience with Cloud based SQL Database: Azure SQL Editor
  • Excellent software engineering skills (i.e., data structures, algorithms, software design).
  • Excellent problem-solving, investigative, and troubleshooting skills
  • Experience with CI/CD tools such as Azure DevOps
  • Ability to work independently.

Soft Skills:

  • Good Communication Skills
  • Ability to coach and Guide young Data Engineers
  • Decent Level in English as Business Language
Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0
Category: Engineering Jobs

Tags: Architecture Azure CI/CD Computer Science Databricks Data pipelines Data quality DevOps Distributed Systems Engineering ETL Kafka Pipelines PySpark Python SQL Streaming

Perks/benefits: Startup environment Team events

Region: Asia/Pacific
Country: India

More jobs like this