Data Engineer
Bangalore South, India
Lifesight
Lifesight's unified marketing measurement leverages marketing mix modelling, incrementality testing & causal attribution to help you make better decisionsLifesight is empowering decisions with Advanced Data Intelligence
Lifesight is a fast-growing SaaS company focused on helping businesses leverage data & AI to improve customer acquisition and retention. We have a team of 130 serving 300+ customers across 5 offices in the US, Singapore, India, Australia, and the UK. Our mission is to make it easy for non-technical marketers to leverage advanced data activation and marketing measurement tools that are powered by AI, to improve their performance and achieve their KPIs. Our product is being adopted rapidly globally and we need the best people onboard the team to accelerate our growth.
Dealing with Petabytes of data and more than 400TB+ daily data processing to power attribution and measurement platforms of Lifesight, building scalable, highly available, fault-tolerant, big data platforms is critical for our success.
From your first day at Lifesight, you'll make a valuable - and valued - contribution. We offer you the opportunity to delight customers around the world while gaining meaningful experience across a variety of disciplines.
About The Role
Lifesight is growing rapidly and seeking a strong Data Engineer to be a key member of the Data and Business Intelligence organization with a focus on deep data engineering projects. You will be joining as one of the few initial data engineers as part of the data
platform team in our Bengaluru office. You will have an opportunity to help define our technical strategy and data engineering team culture in India.
You will design and build data platforms and services while managing our data infrastructure in cloud environments that fuels strategic business decisions across Lifesight products.
A successful candidate will be a self-starter, who drives excellence, is ready to jump into a variety of big data technologies & frameworks, and is able to coordinate and collaborate with other engineers, as well as mentor other engineers in the tea
What You’ll Be Doing
- Build highly scalable, available, fault-tolerant distributed data processing systems (batch and streaming systems) processing over 100s of terabytes of data ingested every day and petabyte-sized data warehouse and elasticsearch cluster.
- Build quality data solutions and refine existing diverse datasets to simplified models encouraging self-service
- Build data pipelines that optimize on data quality and are resilient to poor quality data sources
- Own the data mapping, business logic, transformations and data quality - Low level systems debugging, performance measurement & optimization on large production clusters
- Participate in architecture discussions, influence product roadmap, and take ownership and responsibility over new projects
- Maintain and support existing platforms and evolve to newer technology stacks and architectures
Requirements
We’re excited if you have
- Proficiency in Python and pyspark
- Deep understanding of Apache Spark, Spark tuning, creating RDDs, and building data frames. Create Java/ Scala Spark jobs for data transformation and aggregation.
- Experience in big data technologies like HDFS, YARN, Map-Reduce, Hive, Kafka, Spark, Airflow, Presto, etc.
- Experience in building distributed environments using any of Kafka, Spark, Hive, Hadoop, etc.
- Good understanding of the architecture and functioning of Distributed database systems
- Experience working with various file formats like Parquet, Avro, etc for large volumes of data
- Experience with one or more NoSQL databases
- Experience with AWS, GCP
- 5+ years of professional experience as a data or software engineer
Benefits
What’s in it for you?
As a team, we are concerned with not only the growth of the company but each other’s personal growth and well-being too. Along with our desire to utilize smart technology and innovative engineering strategies to make people’s lives easier, our team also bonds over our shared love for all kinds of tea, movies & fun-filled Friday events with a prioritizing healthy work-life balance.
1. Working for one of the fastest-growing and successful MarTech companies in times
2. Opportunity to be part of an early member of the core team to build a product from scratch starting from making tech stack choices, driving and influencing the way to simplify building complex products.
3. Enjoy working in small teams and a non-bureaucratic environment
4. Enjoy an environment that provides high levels of empowerment and space to achieve your objectives and growth with the organization.
6. Work in a highly profitable and growing organization, with opportunities to accelerate and shape your career.
7. Great benefits - apart from competitive compensation & benefits
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Architecture Avro AWS Big Data Business Intelligence Data pipelines Data quality Data warehouse Elasticsearch Engineering GCP Hadoop HDFS Java Kafka KPIs NoSQL Parquet Pipelines PySpark Python Scala Spark Streaming
Perks/benefits: Career development Competitive pay Startup environment Team events
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.