Senior Data Engineer
IN-KA-BANGALORE-NEON BUILDING WEST TOWER
Applications have closed
Baker Hughes
Baker Hughes | We take energy forward - making it safer, cleaner, and more efficient for people and the planet.Senior Data Engineer
Would you like to shape the future of energy technology using data?
Would you like to shape our strategy for Data and Analytics?
Join our Industrial Solutions Team
Our Digital Solutions business provides intelligent, connected technologies to monitor and control our energy extraction assets. We provide customers with the peace of mind needed to reliably and efficiently improve their operations. Our team oversees the operational excellence and performance of our Data and Analytics platform and EcoSystem.
Partner with the best
As a part of Data Engineering Team you’ll help to solve our customers' toughest challenges, making flights safer, power cheaper, and oil & gas production safer for people and the environment by leveraging data and analytics. Senior Data Engineer will work with the team to create state-of-the-art data and analytics driven solutions, working across Baker Hughes to drive business analytics to a new level of predictive analytics while leveraging big data cutting edge tools and technologies.
As a Lead Data Engineer, you will be responsible for:
- Collaborating with Technical Product Managers and Architects to define the scope, capabilities and roadmap of Data Applications.
- Delivering End-to-end ownership of business features , meeting functional & non-functional requirements.
- Owning the design and development of ETL data pipelines, orchestration and process maturity for data lakehouse using medallion architecture.
- Researching and evaluating new open-source technologies and framework to solve problems & improve existing solutions.
- Integrating domain data knowledge into development of data requirements.
- Articulating and documenting best practices and design principles to mentor other developers.
- Looking across multiple systems, understands the purpose of each system and defines data requirements by systems.
- Identifying downstream implications of data loads/migration (e.g., data quality, regulatory, etc).
Fuel your passion
- Have a Bachelor's Degree in Computer Science or “STEM” Majors (Science, Technology,
Engineering and Math). A minimum 5+ years of professional experience
- Must have hands-on expertise in designing cost effective Big data ETL pipelines and orchestration, preferred streaming or near real time data flow.
- Have Proficiency in Lakehouse architecture and distributed data processing aka Big data engineering skills.
- Have hands-on project experience on Databricks development, troubleshooting and optimization using Pyspark.
- Have Good understanding of Apache Spark and Delta lake core concepts.
- Have Working knowledge of SQL/NoSql data model design.
- Have Working knowledge of Git repo, Git Branching strategy and CICD.
- Have Understanding of Big Data technologies core concepts – Airflow/Oozie, Spark, BigQuery, Hive, NoSql, Object Storage, etc.
- Have Ability to leverage data assets to respond to complex questions that require timely answers.
- Able to conduct exploratory data analysis and generates visual summaries of data. Identifies data quality issues.
- Can Actively identifies needs for novel methods/tools and works with team to invent as necessary. Validates new tools and methodologies.
Work in a way that works for you
We recognize that everyone is different and that the way in which people want to work and deliver at their best is different for everyone too. In this role, we can offer the following flexible working patterns:
- Working flexible hours - flexing the times when you work in the day to help you fit everything in and work when you are the most productive
Working with us
Our people are at the heart of what we do at Baker Hughes. We know we are better when all of our people are developed, engaged and able to bring their whole authentic selves to work. We invest in the health and well-being of our workforce, train and reward talent and develop leaders at all levels to bring out the best in each other.
Working for you
Our inventions have revolutionized energy for over a century. But to keep going forward tomorrow, we know we have to push the boundaries today. We prioritize rewarding those who embrace change with a package that reflects how much we value their input. Join us, and you can expect:
- Contemporary work-life balance policies and wellbeing activities
- Comprehensive private medical care options
- Safety net of life insurance and disability programs
- Tailored financial programs
- Additional elected or voluntary benefits
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Architecture Big Data BigQuery Business Analytics Computer Science Data analysis Databricks Data pipelines Data quality EDA Engineering ETL Git Industrial Mathematics Model design NoSQL Oozie Open Source Pipelines PySpark Spark SQL STEM Streaming
Perks/benefits: Flex hours Health care Insurance
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.