Senior Staff Engineer (Spark)
Hungary-Budapest
Cloudera
Cloudera delivers a hybrid data platform with secure data management and portable cloud-native data analytics.Business Area:
EngineeringSeniority Level:
Mid-Senior levelJob Description:
At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.
Cloudera is looking for a Senior Staff Software Engineer with strong distributed systems expertise to work on the Cloudera distribution of Apache Spark and Livy. We are looking for senior engineers with experience in large-scale, distributed systems and data processing to help build our enterprise-grade system, designed for customers running Spark on thousands of nodes and processing petabytes of data.
We are looking for a passionate individual excited about taking a product already supporting production systems at many of the biggest companies – and is looking to expand and take on even more projects to drive the next gen Data Engineering experience. You will be working with a distributed team, spread across the United States and Hungary, including multiple committers on Apache Spark.
As a Senior Staff Software Engineer, you will…
Design new features for Cloudera’s data engineering experience, and take them from prototypes to leading a team to deliver the feature in production at scale
Contribute to Apache Spark, Livy
Develop new features in Scala/Java/Python on a modern platforms
Gain expertise in distributed data processing, from SQL planners and optimizers, to data layout and table formats like Apache Parquet and Iceberg, to fault tolerance in distributed systems.
Gain a solid understanding and deep technical knowledge of components across the Cloudera Data Engineering Experience stack, but focusing on Iceberg and Spark, which you can utilize in your daily tasks
Get to work on large scale distributed systems, from 100s to 1000s of nodes, in production clusters
Debug system level deployment issues, root cause analysis, perform system test analysis and resolve failures
Work on improving internal infrastructure
Collaborate with other team members and stakeholders
We are excited, if you have…
8-10+ years professional software development.
Experience leading and delivering complex product enhancements.
We use Java/Scala/Python in projects, you should have a strong understanding of at least one of the following languages: Java, Scala, Python. And interested to learn the languages we’re using.
Experience with systems design, development.
Passionate about programming, clean coding habits, attention to detail, and focus on quality
Strong oral and written communication skills.
Strong ability to research and solve problems independently without constant supervision
(Most importantly) Open-minded, desire to learn new things and build great products.
Experience with distributed systems
You may also have…
Experience with SQL planners
Experience with using/developing Apache Spark, Livy or other related technologies.
Experience with large-scale, distributed systems design and development with an understanding of scaling, performance, and scheduling.
Solid experience with at least one cloud service (AWS, Azure, GCP, OpenShift)
Contributors to open-source projects.
What you can expect from us:
Generous PTO Policy
Support work life balance with Unplugged Days
Flexible WFH Policy
Mental & Physical Wellness programs
Phone and Internet Reimbursement program
Access to Continued Career Development
Comprehensive Benefits and Competitive Packages
Employee Resource Groups
Cloudera is an Equal Opportunity / Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status.
#LI-LL1
#Remote
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: AWS Azure Distributed Systems Engineering GCP Java Open Source Parquet Python Research Scala Spark SQL
Perks/benefits: Career development Flex hours Flex vacation Wellness
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.