Data Engineer / Data Scientist

Kaunas, Kaunas City Municipality, Lithuania

Full Time Mid-level / Intermediate EUR 35K - 43K

TeleSoftas

TeleSoftas has been a successful software development company for over 20 years. We help you find solutions for complex business puzzles by mastering technological tools, solid product development, and user-centric design.

View all jobs at TeleSoftas

Apply now Apply later

Posted 3 weeks ago

We are looking for a mid-level Data Engineer with a keen interest in the Data Science field to join our team. The ideal candidate will have a background in data engineering and software development, complemented by a strong curiosity for AI/ML, natural language processing (NLP), and agent-based systems.

In this role, you will focus on designing and maintaining scalable data pipelines and supporting the development of intelligent systems. You will be dedicated to a team working on AI agents and the infrastructure that powers them, contributing to the development of enterprise-grade cloud solutions using the latest AI technologies.

This is an opportunity to gain experience with real-world business cases, actively build the company’s knowledge base in the field, and grow your expertise at the intersection of data engineering and artificial intelligence.

Challenges you'll tackle:

Develop and maintain ETL/ELT pipelines using PySpark in Azure Databricks, with SQL for data transformations and Python/Pandas for data manipulation, where applicable
Design and implement data models for structured and unstructured data
Work on NLP, AI/ML, and agentic networks to build intelligent solutions
Develop and optimise machine learning models and integrate them into data pipelines
Collaborate with Data Scientists and Engineers to implement data-driven solutions
Work with Git and version control to manage code and data pipelines effectively
Research and experiment with new AI/ML techniques and apply them to real-world business problems

Requirements

Skills for success:

2+ years of experience in Data Engineering and/or Data Science
Strong programming skills in Python
Basic proficiency in PySpark and SQL
Basic proficiency with Azure Databricks and cloud-based data engineering
Conceptual understanding of NLP, AI/ML, and agentic networks
Experience in data and process modeling for large-scale systems
Understanding Git and software engineering best practices
Basic proficiency with data wrangling, transformation, and feature engineering
Problem-solving skills and the ability to work independently

Nice to Have:

Experience with MLOps and model deployment in production environments
Experience in implementing CI/CD pipelines for automated data workflows and model deployment, dockerization technologies
Basic proficiency in Huggingface, Langchain and generative AI technologies for agentic networks
Understanding of data streaming (e.g., Kafka, Azure Event Hubs)
Knowledge of machine learning frameworks such as TensorFlow, PyTorch, or Scikit-Learn

Benefits

Competitive Compensation & Growth Opportunities

Dedicated training budget for conferences, online courses, and books to support continuous learning
Access to English and Lithuanian language lessons
Professional development through workshops, coaching sessions, and tech events

Work-Life Balance & Flexibility

Flexible working hours to suit your schedule
Unlimited work-from-home option for greater autonomy
A 300€ Personal Perks Pack to support your work-life balance needs

Community & Team Connection

Employee referral program with rewards up to 2000€ net
Clients & External Ambassadors with rewards up to 5000€ net
Social events, including Summer/Winter parties and a Dev Day celebration
Team-building activities and annual live meet-ups with clients for enhanced collaboration

For this position, we offer 2975 € - 3636 €/month gross salary.

The final offer will depend on your experience and competencies.

Apply now Apply later

Job stats: 1 0 0

Categories: Data Science Jobs Engineering Jobs

Tags: Azure CI/CD Databricks Data pipelines ELT Engineering ETL Feature engineering Generative AI Git HuggingFace Kafka LangChain Machine Learning ML models MLOps Model deployment NLP Pandas Pipelines PySpark Python PyTorch Research Scikit-learn SQL Streaming TensorFlow Unstructured data