Data Engineer / Data Scientist
Kaunas, Kaunas City Municipality, Lithuania
Full Time Mid-level / Intermediate EUR 35K - 43K
TeleSoftas
TeleSoftas has been a successful software development company for over 20 years. We help you find solutions for complex business puzzles by mastering technological tools, solid product development, and user-centric design.We are looking for a mid-level Data Engineer with a keen interest in the Data Science field to join our team. The ideal candidate will have a background in data engineering and software development, complemented by a strong curiosity for AI/ML, natural language processing (NLP), and agent-based systems.
In this role, you will focus on designing and maintaining scalable data pipelines and supporting the development of intelligent systems. You will be dedicated to a team working on AI agents and the infrastructure that powers them, contributing to the development of enterprise-grade cloud solutions using the latest AI technologies.
This is an opportunity to gain experience with real-world business cases, actively build the company’s knowledge base in the field, and grow your expertise at the intersection of data engineering and artificial intelligence.
Challenges you'll tackle:
- Develop and maintain ETL/ELT pipelines using PySpark in Azure Databricks, with SQL for data transformations and Python/Pandas for data manipulation, where applicable
- Design and implement data models for structured and unstructured data
- Work on NLP, AI/ML, and agentic networks to build intelligent solutions
- Develop and optimise machine learning models and integrate them into data pipelines
- Collaborate with Data Scientists and Engineers to implement data-driven solutions
- Work with Git and version control to manage code and data pipelines effectively
- Research and experiment with new AI/ML techniques and apply them to real-world business problems
Requirements
Skills for success:
- 2+ years of experience in Data Engineering and/or Data Science
- Strong programming skills in Python
- Basic proficiency in PySpark and SQL
- Basic proficiency with Azure Databricks and cloud-based data engineering
- Conceptual understanding of NLP, AI/ML, and agentic networks
- Experience in data and process modeling for large-scale systems
- Understanding Git and software engineering best practices
- Basic proficiency with data wrangling, transformation, and feature engineering
- Problem-solving skills and the ability to work independently
Nice to Have:
- Experience with MLOps and model deployment in production environments
- Experience in implementing CI/CD pipelines for automated data workflows and model deployment, dockerization technologies
- Basic proficiency in Huggingface, Langchain and generative AI technologies for agentic networks
- Understanding of data streaming (e.g., Kafka, Azure Event Hubs)
- Knowledge of machine learning frameworks such as TensorFlow, PyTorch, or Scikit-Learn
Benefits
Competitive Compensation & Growth Opportunities
- Dedicated training budget for conferences, online courses, and books to support continuous learning
- Access to English and Lithuanian language lessons
- Professional development through workshops, coaching sessions, and tech events
Work-Life Balance & Flexibility
- Flexible working hours to suit your schedule
- Unlimited work-from-home option for greater autonomy
- A 300€ Personal Perks Pack to support your work-life balance needs
Community & Team Connection
- Employee referral program with rewards up to 2000€ net
- Clients & External Ambassadors with rewards up to 5000€ net
- Social events, including Summer/Winter parties and a Dev Day celebration
- Team-building activities and annual live meet-ups with clients for enhanced collaboration
For this position, we offer 2975 € - 3636 €/month gross salary.
The final offer will depend on your experience and competencies.
Tags: Azure CI/CD Databricks Data pipelines ELT Engineering ETL Feature engineering Generative AI Git HuggingFace Kafka LangChain Machine Learning ML models MLOps Model deployment NLP Pandas Pipelines PySpark Python PyTorch Research Scikit-learn SQL Streaming TensorFlow Unstructured data
Perks/benefits: Career development Competitive pay Conferences Flex hours Team events
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.