Data Engineer / Data Scientist

Kaunas, Kaunas City Municipality, Lithuania

TeleSoftas

TeleSoftas has been a successful software development company for over 20 years. We help you find solutions for complex business puzzles by mastering technological tools, solid product development, and user-centric design.

View all jobs at TeleSoftas

Apply now Apply later

We are looking for a mid-level Data Engineer with a keen interest in the Data Science field to join our team. The ideal candidate will have a background in data engineering and software development, complemented by a strong curiosity for AI/ML, natural language processing (NLP), and agent-based systems.

In this role, you will focus on designing and maintaining scalable data pipelines and supporting the development of intelligent systems. You will be dedicated to a team working on AI agents and the infrastructure that powers them, contributing to the development of enterprise-grade cloud solutions using the latest AI technologies.

This is an opportunity to gain experience with real-world business cases, actively build the company’s knowledge base in the field, and grow your expertise at the intersection of data engineering and artificial intelligence.

Challenges you'll tackle:

  • Develop and maintain ETL/ELT pipelines using PySpark in Azure Databricks, with SQL for data transformations and Python/Pandas for data manipulation, where applicable
  • Design and implement data models for structured and unstructured data
  • Work on NLP, AI/ML, and agentic networks to build intelligent solutions
  • Develop and optimise machine learning models and integrate them into data pipelines
  • Collaborate with Data Scientists and Engineers to implement data-driven solutions
  • Work with Git and version control to manage code and data pipelines effectively
  • Research and experiment with new AI/ML techniques and apply them to real-world business problems

Requirements

Skills for success:

  • 2+ years of experience in Data Engineering and/or Data Science
  • Strong programming skills in Python
  • Basic proficiency in PySpark and SQL 
  • Basic proficiency with Azure Databricks and cloud-based data engineering
  • Conceptual understanding of NLP, AI/ML, and agentic networks
  • Experience in data and process modeling for large-scale systems
  • Understanding Git and software engineering best practices
  • Basic proficiency with data wrangling, transformation, and feature engineering
  • Problem-solving skills and the ability to work independently

Nice to Have:

  • Experience with MLOps and model deployment in production environments
  • Experience in implementing CI/CD pipelines for automated data workflows and model deployment, dockerization technologies
  • Basic proficiency in Huggingface, Langchain and generative AI technologies for agentic networks
  • Understanding of data streaming (e.g., Kafka, Azure Event Hubs)
  • Knowledge of machine learning frameworks such as TensorFlow, PyTorch, or Scikit-Learn

Benefits

Competitive Compensation & Growth Opportunities

  • Dedicated training budget for conferences, online courses, and books to support continuous learning
  • Access to English and Lithuanian language lessons
  • Professional development through workshops, coaching sessions, and tech events

Work-Life Balance & Flexibility

  • Flexible working hours to suit your schedule
  • Unlimited work-from-home option for greater autonomy
  • A 300€ Personal Perks Pack to support your work-life balance needs

Community & Team Connection

  • Employee referral program with rewards up to 2000€ net
  • Clients & External Ambassadors with rewards up to 5000€ net
  • Social events, including Summer/Winter parties and a Dev Day celebration
  • Team-building activities and annual live meet-ups with clients for enhanced collaboration

For this position, we offer 2975 € - 3636 €/month gross salary

The final offer will depend on your experience and competencies.

Apply now Apply later
Job stats:  0  0  0

Tags: Azure CI/CD Databricks Data pipelines ELT Engineering ETL Feature engineering Generative AI Git HuggingFace Kafka LangChain Machine Learning ML models MLOps Model deployment NLP Pandas Pipelines PySpark Python PyTorch Research Scikit-learn SQL Streaming TensorFlow Unstructured data

Perks/benefits: Career development Competitive pay Conferences Flex hours Team events

Region: Europe
Country: Lithuania

More jobs like this