Data Engineer

Madrid, Community of Madrid, ES

Thetaray

Transparent and explainable solutions helping banks and fintechs improve risk detection and grow confidently

View all jobs at Thetaray

Apply now Apply later

Description

About ThetaRay:

ThetaRay is a trailblazer in AI-powered Anti-Money Laundering (AML) solutions, offering cutting-edge technology to fintechs, banks, and regulatory bodies worldwide. Our mission is to enhance trust in financial transactions, ensuring compliant and innovative business growth. 

Our technology empowers customers to expand into new markets and introduce groundbreaking products.

Thetaray is a culture-driven company. Our values are at the heart of our success. By joining us, you'll have the opportunity to embody these values and inspire others through your actions.

ThetaRay is looking for a Data Engineer to join our growing team of data experts.

The hire will be responsible for designing, implementing, and optimizing data pipeline flows within the ThetaRay system.

The ideal candidate has experience in building data pipelines and data transformations and enjoys optimizing data flows and building them from the ground up.

The Data Engineer will support our data scientists with the implementation of the relevant data flows based on the data scientist’s features design and construct complex rules to detect money laundering activity.

They must be self-directed and comfortable supporting multiple production implementations for various use cases.

 

Key Responsibilities

  • Implement and maintain data pipeline flows in production within the ThetaRay system based on the data scientist’s design
  • Design and implement solution-based data flows for specific use cases, enabling the applicability of implementations within the ThetaRay product
  • Building a Machine Learning data pipeline
  • Create data tools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader
  • Work with product, R&D, data, and analytics experts to strive for greater functionality in our systems
  • Train customer data scientists and engineers to maintain and amend data pipelines within the product
  • Travel to customer locations both domestically and abroad
  • Build and manage technical relationships with customers and partners

Requirements

  • 1+ years of hands-on experience with SQL.
  • Experience with Python (Pandas)
  • Experience with PySpark/Scala/Java/R
  • Hands-on experience with data transformation, validations, cleansing, and ML feature engineering
  • Hands-on experience working with Apache Spark cluster - an advantage.
  • BSc degree or higher in Computer Science, Statistics, Informatics, Information Systems, Engineering, or another quantitative field.
  • Experience working with and optimizing big data pipelines, architectures, and data sets - an advantage.
  • Strong analytic skills related to working with structured and semi-structured datasets.
  • Build processes supporting data transformation, data structures, metadata, dependency, and workload management.
  • Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
  • Business-oriented and able to work with external customers and cross-functional teams.
  • Fluent in English & Spanish both written and spoken


Nice to have

  • Experience with Linux
  • Experience in building Machine Learning pipeline
  • Experience with Elasticsearch
  • Experience with Zeppelin/Jupyter
  • Experience with workflow automation platforms such as Jenkins or Apache Airflow
  • Experience with Microservices architecture components, including Docker and Kubernetes.
Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0
Category: Engineering Jobs

Tags: Airflow Architecture Big Data Computer Science Data pipelines Docker Elasticsearch Engineering Feature engineering Java Jenkins Jupyter Kubernetes Linux Machine Learning Microservices Pandas Pipelines PySpark Python R R&D Scala Spark SQL Statistics

Perks/benefits: Career development

Region: Europe
Country: Spain

More jobs like this