Data Engineer in ML

Warszawa, Poland

Tooploox

Discover how to build an AI software product with Tooploox Sp. z o.o., your expert partner in turning innovative ideas into successful digital solutions.

View all jobs at Tooploox

Apply now Apply later

Hi there!

We are Tooploox, an AI software development company offering custom AI solutions and services. We help innovative companies and startups design and build digital products with generative AI, mobile, and web technologies.

Our team, consisting of nearly 200 experts including our R&D team of over 40 engineers, many with PhDs, has pioneered AI solutions across industries like healthcare, fashion, and e-commerce. We’ve published over 15 research papers in top conferences like NeurIPS and ICML.

We're on the lookout for a Data Engineer in ML to take on a pivotal role in our team. You'll have a big impact on a new product that builds on data gathered in 46 countries and you'll scale data operations from thousands to millions of users.
If you want to create insights about all aspects of the product - financial, behavioral, and domain-specific, this role is tailor-made for you.

Feel invited!

What you will do:

  • Deliver comprehensive and thoroughly verified answers to clients’ questions, document the thought process and share findings in a clear way.
  • Focus on building a reliable data processing infrastructure.
  • Follow good engineering practices such as testing, documentation, infrastructure as code, and automation
  • Solve business problems as they come and communicate with the client to gather requirements and explain data.
  • Empower developers and business teams to use data in their workflows.

Experience and skills you need to join us:

  • Have strong Python (PySpark, pandas, Ray Data) and SQL skills.
  • Experience in data warehousing (Snowflake, BigQuery, Redshift).
  • Experience in ETL tools (Spark, DBT, Google Dataflow, AWS Glue).
  • Experience with pipeline management (Apache Airflow, Dagster).
  • Proven experience working on projects utilizing Machine Learning, including data preprocessing, model development, evaluation, and deployment to solve real-world problems.
  • Familiarity with IaC (Terraform, AWS CDK).
  • Familiarity with Docker.
  • You are fluent in Polish and English (you will attend meetings with English-speaking clients).

It would be great if you also have:

  • Familiarity with CI/CD tools (Jenkins, Tekton, GitHub Actions).
  • Familiarity with LLM/LMM architectures, training processes, and data requirements is preferred.
  • Ability to design and deploy data-processing cloud infrastructure.

How we work:

At Tooploox, you have the flexibility to choose your working hours and location. While we value remote work, we also believe in building relationships and invite you to join us in our Warsaw and Wrocław offices. Enjoy a relaxed atmosphere and try some “home-made” pizza from our office pizza oven. We love having pets in the office, so feel free to bring yours along.

Join us and shape the future of AI while working the way you like!

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  2  0  0

Tags: Airflow Architecture AWS AWS Glue BigQuery CI/CD Dagster Dataflow DataOps Data Warehousing dbt Docker E-commerce Engineering ETL Generative AI GitHub ICML Jenkins LLMs Machine Learning ML models NeurIPS Pandas PySpark Python R R&D Redshift Research Snowflake Spark SQL Terraform Testing

Perks/benefits: Conferences

Region: Europe
Country: Poland

More jobs like this