Data Engineer

Berlin Office

Apply now Apply later

sensmore automates the world's largest machines with unprecedented intelligence. Our proprietary Physical AI enables heavy machines such as wheel loaders to instantly adapt to dynamic environments and execute new tasks without prior training.

We integrate cutting-edge robotics into a platform powering intelligence and automation products - transforming productivity and safety for customers in mining, construction, and adjacent industries today.

Join us and play a pivotal role in transforming the automation landscape in heavy industries.

Role Overview:

As our Data Engineer, you will design, build, and maintain the data infrastructure that powers Sensmore’s embodied AI and Vision-Language-Action Models (VLAMs). You’ll collaborate with Robotics, ML and Software engineers to ensure clean, reliable data flows from our sensor arrays (radar, LiDAR, cameras, IMUs) into training and inference pipelines. This role blends classic data engineering (ETL/ELT, warehouse design, monitoring) with ML Ops best practices: model versioning, data drift detection, and automated retraining.

Key Responsibilities:

  • Build & operate data pipelines: Ingest, process, and transform multi-sensor telemetry (radar point-clouds, video frames, log streams) into analytics-ready and ML-ready formats.

  • Design scalable storage: Architect high-throughput, low-latency data lakes and warehouses (e.g., S3, Delta Lake, Redshift/Snowflake).

  • Enable ML Ops workflows: Integrate DVC or MLflow, automate model training/retraining triggers, track data/model lineage.

  • Ensure data quality: Implement validation, monitoring, and alerting to catch anomalies and schema changes early.

  • Collaborate cross-functionally: Partner with Embedded Systems, Robotics, and Software teams to align on data schemas, APIs, and real-time requirements.

  • Optimize performance: Tune distributed processing, queries, and storage layouts for cost-efficiency and throughput.

  • Document & evangelize: Maintain clear documentation for data schemas, pipeline architectures, and ML Ops practices to uplift the whole team.

Required Qualifications:

  • 3+ years of hands-on experience building production data pipelines in the cloud (AWS, GCP, or Azure).

  • Proficiency in Python, SQL, and at least one big-data framework.

  • Familiarity with ML Ops tooling: DVC, MLflow, Kubeflow, or similar.

  • Experience designing and operating data warehouses/data lakes (e.g., Redshift, Snowflake, BigQuery, Delta Lake).

  • Strong understanding of distributed systems, data serialization (Parquet, Avro), and batch vs. streaming paradigms.

  • Excellent problem-solving skills and the ability to work in ambiguous, fast-paced environments.

Preferred Skills:

  • Background in robotics or sensor data (radar, LiDAR, camera pipelines).

  • Knowledge of real-time data processing and edge-computing constraints.

  • Experience with infrastructure as code (Terraform, CloudFormation) and CI/CD for data workflows.

  • Familiarity with Kubernetes and containerized deployments.

  • Exposure to vision-language or action-planning ML models.

What We Offer:

  • Build physical AI for the world's largest off-highway machinery – making them intelligent, safe, and ready for every tough task

  • Join the pioneer in intelligent robotics backed by Point Nine & other Tier 1 investors

  • Combine cutting-edge robotics research in end-to-end learning & Vision Language Action Model with real-world heavy mobile equipment

  • Tailor your own career path, whether you like to become technical specialist or technical team lead

  • Experience a great team culture, beverages, and an amazing office environment


Benefits:

  • Attractive compensation package and stock options.

  • Beverages on-site and regular social events.

  • Engage with top-tier researchers, engineers, and thought leaders.

  • Influence the future of robotic technologies and tackle significant technological challenges.

  • Assistance with relocation to Berlin.


About Us:

Heavy machinery, light years ahead.

sensmore automates the world's largest machines with unprecedented intelligence. Our proprietary Physical AI enables heavy machines such as wheel loaders to instantly adapt to dynamic environments and execute new tasks without prior training.

We integrate cutting-edge robotics into a platform powering intelligence and automation products - transforming productivity and safety for customers in mining, construction, and adjacent industries today.

We are proudly backed by Point Nine and other Tier 1 investors.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  1  1  0
Category: Engineering Jobs

Tags: APIs Architecture Avro AWS Azure BigQuery CI/CD CloudFormation Data pipelines Data quality Distributed Systems ELT Engineering ETL GCP Kubeflow Kubernetes Lidar Machine Learning MLFlow ML models Model training Parquet Pipelines Python Radar Redshift Research Robotics Snowflake SQL Streaming Terraform

Perks/benefits: Career development Equity / stock options Relocation support Team events

Region: Europe
Country: Germany

More jobs like this