Senior Data Engineer
Pune, Maharashtra, India
Etraveli Group
Etraveli Group is a world-leading tech platform for selling flights. We are here to solve the complexity by connecting millions of flights and travelers across the world.About Tripstack - We are travel tech entrepreneurs, changing the way millions of people travel. Our proprietary virtual interlining technology provides access to billions of travel itineraries by combining flights from different airline carriers that don’t traditionally work together. We take our customers from point A to B via C, at the lowest possible price. We are impacting the way people travel and provide higher margin opportunities to our partners that are some of the largest online travel agencies in the world. We pride ourselves on the performance-driven environment we have created for our teams to prosper and excel in. We come to work ready, to challenge and be challenged. We’re big enough to give our teams support but small enough that every person makes a difference. There are still plenty of challenges to champion.
The Role - We are seeking an experienced data engineer to join our Data Engineering team embedded within the Data organization at TripStack.
Responsibilities -
● Analyze and Organize Raw Data: Collect, clean, and structure raw data from diverse sources to make it suitable for further analysis and processing.
● Develop Robust Data Systems and Pipelines: Design and implement resilient data systems and pipelines to support efficient data processing, storage, and retrieval and establish data contracts with engineering teams.
● Ensure Data Meets Business Needs: Ensure that datasets are properly prepared and maintained to meet the reporting and analytics needs of the business.
● Prepare Data for Machine Learning: Collaborate with data scientists to prepare and process data for machine learning initiatives, ensuring compatibility and readiness for model training.
● Enhance Data Quality and Reliability: Implement processes and technologies to improve data quality and reliability, including data validation, cleansing, and deduplication.
● Collaborate on Analytics Data Flow Design: Work closely with data scientists and data architects to design and optimize the flow of analytics data, ensuring seamless integration and efficient data usage across the organization.
● Requirements gathering: Collaborate with cross-functional teams to gather and document business requirements for large-scale data engineering projects, ensuring clear understanding of stakeholder needs.
Desired Skills & Experience -
● Bachelor’s degree in Computer Science or equivalent
● 5+ years of experience in data engineering or similar roles
● Proficiency in Python
● Experience working with structured and unstructured data
● Experience with big data technologies eg. Spark and Kafka and Apache Druid
● Strong data modeling and SQL skills
● Experience with orchestration tools eg. Apache Airflow, DBT, Databricks
● Strong cross-functional collaboration skills
Nice to have -
● Master’s Degree in computer science, mathematics, engineering, or related discipline with 3+ years of experience
● Experience with MLOPs tools eg. MLFlow
● Airline travel industry experience is a plus
What it takes to succeed here : Ambition and dedication to make a difference and change the way people travel; Where we always play to each other strength in a high performing team reaching for our common goal. We hold ourselves to the highest expectations, and move with a sense of urgency and hold ourselves accountable and win by staying true to what we believe in.
What we offer : We offer an opportunity to work with a young, dynamic, and a growing team composed of high-caliber professionals. We value professionalism and promote a culture where individuals are encouraged to do more and be more. If you feel you share our passion for excellence, and growth, then look no further. We have an ambitious mission, and we need a world-class team to make it a reality. Upgrade to a First Class team!
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Big Data Computer Science Databricks Data quality dbt Engineering Excel Kafka Machine Learning Mathematics MLFlow MLOps Model training Pipelines Python Spark SQL Unstructured data
Perks/benefits: Career development Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.