Databricks (Remote)
Bengaluru, KA, India
PradeepIT
PradeepIT, supported by Asia's largest tech professional network, revolutionizing global talent acquisition. Discover the potential of hiring top Asian tech talents at ten times the speed, starting today!Roles & responsibilities
- Developing Modern Data Warehouse solutions using Databricks and AWS/ Azure Stack
- Ability to provide solutions that are forward-thinking in data engineering and analytics space
- Collaborating with DW/BI leads to understanding new ETL pipeline development requirements.
- Triage issues to find gaps in existing pipelines and fix the issues
- Work with businesses to understand the need in the reporting layer and develop a data model to fulfill
- reporting needs
- Help joiner team members to resolve issues and technical challenges.
- Drive technical discussion with client architects and team members
- Orchestrate the data pipelines in the scheduler via Airflow
- Qualification & experience
- Bachelor's and/or master's degree in computer science or equivalent experience.
- Must have a total of 6+ yrs. of IT experience and 3+ years' experience in Data warehouse/ETL projects.
- Deep understanding of Star and Snowflake dimensional modeling.
- Strong knowledge of Data Management principles
- Good understanding of Databricks Data & AI platform and Databricks Delta Lake Architecture
- Should have hands-on experience in SQL, Python, and Spark (PySpark)
- Candidate must have experience in AWS/ Azure stack
- Desirable to have ETL with batch and streaming (Kinesis).
- Experience in building ETL / data warehouse transformation processes
- Experience with Apache Kafka for use with streaming data / event-based data
- Experience with other Open-Source big data products Hadoop (incl. Hive, Pig, Impala)
- Experience with Open Source non-relational / NoSQL data repositories (incl. MongoDB,
- Cassandra, Neo4J)
- Experience working with structured and unstructured data including imaging & geospatial data.
- Experience working in a Dev/Ops environment with tools such as Terraform, CircleCI, and GIT.
- Proficiency in RDBMS, complex SQL, PL/SQL, Unix Shell Scripting, performance tuning, an troubleshooting.
- Databricks Certified Data Engineer Associate/Professional Certification (Desirable).
- Comfortable working in a dynamic, fast-paced, innovative environment with several ongoing
- concurrent projects
- Should have experience working in Agile methodology.
- Strong verbal and written communication skills.
- Strong analytical and problem-solving skills with a high attention to detail. Mandatory Skills:
- Python/ PySpark / Spark with Azure/ AWS Databricks
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Airflow Architecture AWS Azure Big Data Cassandra Computer Science Databricks Data management Data pipelines Data warehouse Engineering ETL Git Hadoop Kafka Kinesis MongoDB Neo4j NoSQL Open Source Pipelines PySpark Python RDBMS Shell scripting Snowflake Spark SQL Streaming Terraform Unstructured data
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.