Mid/Senior Data Engineer

Remote job

GetInData

Our core activity is big data software development, we build a modern data platforms for session analytics, recommendations, paternities matching and anomaly detections in real-time. We also teach how to use modern real-time Big Data...

View all jobs at GetInData

Apply now Apply later

About us

GetInData | Part of Xebia is a leading data company working for international Clients, delivering innovative projects related to Data, AI, Cloud, Analytics, ML/LLM, and GenAI. The company was founded in 2014 by data engineers and today brings together 120 Data & AI experts. Our Clients are both fast-growing scaleups and large corporations that are industry leaders. In 2022, we joined forces with Xebia Group to broaden our horizons and bring new international opportunities.

What about the projects we work with?

We run a variety of projects in which our sweepmasters can excel. Advanced Analytics, Data Platforms, Streaming Analytics Platforms, Machine Learning Models, Generative AI and more. We like working with top technologies and open-source solutions for Data & AI and ML/AI. In our portfolio, you can find Clients from many industries, e.g., media, e-commerce, retail, fintech, banking, and telcos, such as Truecaller, Spotify, ING, Acast, Volt, Play, and Allegro. You can read some customer stories here.

What else do we do besides working on projects?

We conduct many initiatives like Guilds and Labs and other knowledge-sharing initiatives. We build a community around Data & AI, thanks to our conference Big Data Technology Warsaw Summit, meetup Warsaw Data Tech Talks, Radio Data podcast, and DATA Pill newsletter.


Data & AI projects that we run and the company's philosophy of sharing knowledge and ideas in this field make GetInData | Part of Xebia not only a great place to work but also a place that provides you with a real opportunity to boost your career.

If you want to be up to date with the latest news from us, please follow up on our LinkedIn profile.


About role

A Data Engineer's role involves crafting, constructing, and upholding the structure, tools, and procedures essential for an organization to gather, store, modify, and scrutinize extensive data amounts. This position involves creating data platforms using typically provided infrastructure and establishing a clear path for Analytics Engineers who utilize the system.

Responsibilities

  • Development and maintenance of ETL and data platforms (Python, Scala, Spark, HDFS, Hive)
  • Development and maintenance of access applications for business users and user support (Airflow, Jupyterhub, Trino, Superset, MLFlow) in the context of Kubernetes, Docker, ArgoCD
  • Automation and CICD (Gitlab-CI)
  • Monitoring (Prometheus)
  • R&D, maintenance, and monitoring of the platform's components
  • Implementing and executing policies aligned with the company's strategic plans concerning used technologies, work organization, etc.

Requirements

  • Proficiency in a programming language like Python and Scala
  • Working with Spark messaging systems
  • Experience with Hadoop
  • Hands-on experience with Kubernetes
  • Strong programming skills with a solid understanding of software engineering principles, best practices, and solutions
  • Experience with Version Control System, preferably GIT
  • Ability to actively participate/lead discussions with clients to identify and assess concrete and ambitious avenues for improvement
We offer
  • Salary: 140 - 185 PLN net + VAT/h B2B (depending on knowledge and experience)

  • 100% remote work

  • Flexible working hours

  • Possibility to work from the office located in the heart of Warsaw

  • Opportunity to learn and develop with the best Big Data experts

  • International projects

  • Possibility of conducting workshops and training

  • Certifications

  • Co-financing sport card

  • Co-financing health care

  • All equipment needed for work

Apply now Apply later
Job stats:  0  0  0
Category: Engineering Jobs

Tags: Airflow Banking Big Data Docker E-commerce Engineering ETL Excel FinTech Generative AI Git GitLab Hadoop HDFS Kubernetes LLMs Machine Learning MLFlow ML models Open Source Python R R&D Scala Spark Streaming Superset

Perks/benefits: Career development Flex hours

Region: Remote/Anywhere

More jobs like this