Senior Cloud Engineer -Azure databricks
Bengaluru, KA, India
Bosch Group
Moving stories and inspiring interviews. Experience the meaning of "invented for life" by Bosch completely new. Visit our international website.Job Description
As a Cloud engineer in our team, you work with large scale manufacturing data coming from our globally distributed plants. You will focus on building efficient, scalable & data-driven applications.
The data sets produced by these applications – whether data streams or data at rest – need to be highly available, reliable, consistent and quality-assured so that they can serve as input to wide range of other use cases and downstream applications.
We run these applications on a Azure databricks, you will be building applications, you will also contribute to scaling the platform including topics such as automation and observability.
Finally, you are expected to interact with customers and other technical teams e.g. for requirements clarification & definition of data models.
Primary responsibilities: ·
- Engaging in design discussion of Data Pipelines in Azure.
- Creating design for data pipelines and conceptualize data architecture for large-scale projects in Azure.
- Architectural proposal and estimation for the application, technical leadership to the team
- Define data model for data pipelines in Azure.
- Coordination/Collaboration with central teams for tasks and standards
- Develop data integration workflow in Azure
- Developing pyspark applications for processing Streaming data.
- Integrating the end-to-end Azure Databricks pipeline to take data from source systems to target system ensuring the quality and consistency of data.
- Writing python scripts to automate manual activities.
- Defining data quality and validation checks.
- Configuring data processing and transformation.
- Writing unit test cases for data pipelines.
- Defining and implementing data quality and validation check.
- Tuning pipeline configurations for optimal performance.
- Participate in Peer review and PR review for the code written by team members
Qualifications
- Bachelor’s degree in computer science, Computer Engineering, relevant technical field, or equivalent; Master’s degree preferred.
- 6+ years’ experience in data engineering, ETL tools and working with large data sets.
- Proven experience with cloud platform, particularly in Azure Databricks
- Min 6 years of Experience in Design Development and integration applications using Various technologies and frameworks
- Minimum 5 years of working experience of distributed cluster.
Additional Information
Key Competencies:
- At least 2-3 years of Azure Databricks Cloud experience in Data Engineering
- Experience of Delta table, ADLS, DBFS, ADF.
- At least 6 years of experience in large scale Python software development (other object-oriented languages are also acceptable)
- Deep level of understanding in distributed systems for data storage and processing (e.g. Kafka, pyspark, Azure Cloud)
- Experience with Cloud based SQL Database: Azure SQL Editor
- Excellent software engineering skills (i.e., data structures, algorithms, software design).
- Excellent problem-solving, investigative, and troubleshooting skills
- Experience with CI/CD tools such as Azure DevOps
- Ability to work independently.
Soft Skills:
- Good Communication Skills
- Ability to coach and Guide young Data Engineers
- Decent Level in English as Business Language
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture Azure CI/CD Computer Science Databricks Data pipelines Data quality DevOps Distributed Systems Engineering ETL Kafka Pipelines PySpark Python SQL Streaming
Perks/benefits: Startup environment Team events
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.