Data Engineer
Bangalore Office, India
o9 Solutions, Inc.
Analytics, AI & knowledge-powered platform for planning & decision-making enabling true Integrated Business Planning (IBP) for global companies.Be part of something revolutionary
At o9 Solutions, our mission is clear: be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate global enterprises’ siloed planning capabilities, helping them capture millions and, in some cases, billions of dollars in value leakage. But our impact doesn’t stop there. Businesses that plan better and faster also reduce waste, which drives better outcomes for the planet, too.
We're on the lookout for the brightest, most committed individuals to join us on our mission. Along the journey, we’ll provide you with a nurturing environment where you can be part of something truly extraordinary and make a real difference for companies and the planet.
What you will do..
- Work on the data pipelines for capturing historical snapshots of both inputs and product outputs
- Performance tuning of the pipelines using industry best practices
- Batch orchestration design and development in a way that is least disruptive to system usage
- Develop PySpark/Python codes for data transformation and API data extraction in batch jobs
- Contribute to overall product architecture and make it best-in-class from a performance and scalability standpoint
- Learn new technologies as needed for product usecases
What should you have...
- 4 years of coding experience in Pyspark
- Worked with different aspects of the Spark ecosystem, including Spark SQL, DataFrames, Datasets, and streaming data
- 4+ years of experience as a data engineer
- Deep understanding and experience in Pyspark and some experience in the data lake and delta tables.
- Skilled in big data tools, building data pipelines, ETL design, and implementation
- Must have strong programming skills in Python. Scala is a plus.
- Should be familiar with Python (especially libraries like Pandas). The candidate should perform performance tuning and use Pyspark to move data.
- Experienced writing production-level code, optimizing data processing, identifying performance bottlenecks, and root causes, and resolving defects
- Collaborates effectively with cross-functional teams to achieve product goals
- Familiar with software development best practices (Git, CI/CD, Unit Testing...)
More about us…
With the latest increase in our valuation from $2.7B to $3.7B despite challenging global macroeconomic conditions, o9 Solutions is one of the fastest-growing technology companies in the world today. Our mission is to digitally transform planning and decision-making for the enterprise and the planet. Our culture is high-energy and drives us to aim 10x in everything we do.
Our platform, the o9 Digital Brain, is the premier AI-powered, cloud-native platform driving the digital transformations of major global enterprises including Google, Walmart, ABInBev, Starbucks and many others.
Our headquarters are located in Dallas, with offices in Amsterdam, Paris, London, Barcelona, Madrid, Sao Paolo, Bengaluru, Tokyo, Seoul, Milan, Stockholm, Sydney, Shanghai, Singapore Munich, Toronto.
o9 is an equal opportunity employer and seeks applicants of diverse backgrounds and hires without regard to race, colour, gender, religion, national origin, citizenship, age, sexual orientation or any other characteristic protected by law.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs Architecture Big Data CI/CD Data pipelines ETL Git MVP Pandas Pipelines PySpark Python Scala Spark SQL Streaming Testing
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.