Big Data Architect
Bengaluru, India
AiDash
AI-first SaaS company on a mission to transform operations, maintenance, and sustainability in industries with geographically distributed assets by using satellites and AI at scale.We are a Series C climate tech startup backed by leading investors (including National Grid Partners, G2 Venture Partners, Lightrock, BGV, Marubeni, among others), and by our customers-turned-advocates (Duke Energy & National Grid Partners, among others)! We have been recognized by Forbes two years in a row as one of “America’s Best Startup Employers”. We are also proud to be one of the few climate software companies in Time Magazine’s “America’s Top GreenTech Companies2024”.
Join us in creating a greener, cleaner, and safer planet from space!
The Role:
The Platform Team at AiDash is responsible for building and managing the data pipelines that ingest, clean, and transform vast amounts of satellite imagery and enterprise data. As a Big Data Architect, you will play a critical role in enhancing and scaling these pipelines and our data infrastructure. Leveraging your technical expertise, leadership, and strategic mindset, you will directly shape company-wide data strategies and drive impactful outcomes. This role involves close collaboration with the Data Science and Engineering teams to tackle complex business challenges through innovative data solutions.
How you'll make an impact:
- Ensure efficient ingestion, storage, and processing of geospatial data, including satellite imagery, LiDAR, weather data, and enterprise datasets
- Manage massive imagery, 3D, and vector data using big data platforms
- Lead organization-wide initiatives to streamline data storage and transformation
- Identify and implement tools to host and catalog satellite and sensor data
- Implement data quality control, validation, and standardization processes to maintain dataset integrity
- Collaborate with data scientists and stakeholders to align data systems with business needs and enable data-driven decisions
- Provide technical leadership, evolve system architecture, and propose new technologies to keep the tech stack scalable and reliable.
What we're looking for:
- Minimum of 12 years of overall professional experience, including at least 8 years in data engineering, with a proven track record of designing, building, and operating large-scale data systems
- Experience in building and maintaining modern data pipelines in cloud or hybrid environments
- Deep understanding of data modeling, data warehousing solutions, and data architecture strategies for both transactional and analytical systems
- Strong experience with big data technologies (e.g., Hadoop, Spark), database systems (e.g., SQL, NoSQL), geospatial toolkits and ETL tools
- Expertise in at least one programming language, such as Scala, Java, or Python
- Experience with cloud services (AWS, Azure, Google Cloud), and a solid understanding of the data pipeline tools and services available on these platforms
- Exceptional problem-solving, analytical, communication, and teamwork skills
- Leadership experience with the ability to collaborate effectively across teams to achieve company objectives
- Bachelor’s or Master’s degree in Computer Science, Engineering, Mathematics, or a related field.
Preferred Qualifications:
- Expertise in building data platforms that host petabytes of imagery data
- Strong experience with modern data cataloging and warehousing technologies (e.g., Iceberg, Spark, Athena, Sedona)
- Experience in building platforms for storing and processing large volumes of geospatial data – including satellite and LiDAR images
We are proud to be an equal-opportunity employer. We are committed to embracing diversity and inclusion in our hiring practices, and we promote a work environment where everyone, from any race, color, religion, sex, sexual orientation, gender identity, or national origin, can do their best work.
We are committed to providing an inclusive and accessible interview experience for all candidates. Please let us know if you require any accommodation during the interview process, and we will make every effort to meet your needs.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture Athena AWS Azure Big Data Computer Science Data pipelines Data quality Data Warehousing Engineering ETL GCP Google Cloud Hadoop Java Lidar Mathematics NoSQL Pipelines Python Scala Spark SQL
Perks/benefits: Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.