Data Platform Engineering Team Lead (Hands-On | Data Lake Focus)

Herzliya, Tel Aviv District, IL

⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️

Full Time Senior-level / Expert USD 81K - 151K * ^est.

Upstream Security

Upstream provides a cloud-based automotive cybersecurity solution for protecting connected vehicles from cyber threats. Gain automotive-specific cyber threat intelligence to prevent and detect cyber attacks with Upstream's connected vehicle...

View all jobs at Upstream Security

Apply now Apply later

Posted 8 hours ago

Description

Modern vehicles create an enormous amount of data. Upstream turns that data into real-time cybersecurity detection & response (XDR), API security, and proactive quality detection for the world’s largest automotive and smart mobility companies. Our Data Engineering & Data Science Group leads the development of our Iceberg-based data platform, including data lake, query engine, and ML-Ops tools, serving as a solid AI-ready foundation for all our products.

At Upstream, you’ll have real impact, your work powers every security alert and analytics view we deliver. You’ll have room to innovate, pick new tools, refactor old ones, and set best practices. All within a people-first culture that values hybrid work, shared knowledge, and sustainable pace.

Technological background: Iceberg, Trino, Prefect ,GitHub Actions, Kubernetes. JupyterHub, MLflow, dbt

This role is full time and is Herzliya, Israel based.

Responsibilities

Spend about 70% of your time coding, building and tuning the data lake (Iceberg + Trino + dbt) for fast, cost-effective queries.
Lead and mentor skilled data engineers: code reviews, pairing, growth plans, and future hiring.
Own the platform roadmap including schema design, query-engine upgrades, governance and new technology choices.
Design and maintain well-partitioned tables and views so DS, DA, users and product teams can work faster and effectively.
Automate ETL data-digests and ML model training with Prefect, and own the pipelines for deployments (GitHub Actions → Kubernetes), data-quality checks, monitoring and SLOs.
Enhance and scale our ML platform, and automate the machine-learning lifecycle (MLflow, feature stores, model builds and deployment).
Stay focused on the data lake.

Requirements

8+ years of software/data engineering (strong Python preferred; JVM/Go welcome).
4+ years working with large-scale data lakes/lakehouses, including Iceberg (or similar formats). Experience with Apache Spark for heavy batch / ML pipelines is a plus.
2+ years leading engineers, while still being hands-on.
Solid SQL skills and experience with much of our stack: Trino/Presto, dbt, Iceberg, Prefect/Airflow, Kubernetes, GitHub Actions.
A builder’s mindset: measure, improve, document, then automate.
Clear communication in English and a collaborative attitude.

Upstream is an equal opportunity employer. All candidates for employment will be considered without regard to race, color, religion, sex, national origin, physical or mental disability, veteran status, or any other basis protected by applicable federal, state or local law.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats: 0 0 0

Categories: Engineering Jobs Leadership Jobs

Tags: Airflow APIs dbt Engineering ETL GitHub Kubernetes Machine Learning MLFlow MLOps Model training Pipelines Python Security Spark SQL