Data Engineer - GenAI
Warszawa - Marynarska - AGS, Poland
⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️
Alcon
Our mission is to provide innovative vision products that enhance quality of life by helping people see better. From vision research to eye health, learn more at Alcon.com.At Alcon, we are driven by the meaningful work we do to help people see brilliantly. We innovate boldly, champion progress, and act with speed as the global leader in eye care. Here, you’ll be recognized for your commitment and contributions and see your career like never before. Together, we go above and beyond to make an impact in the lives of our patients and customers. We foster an inclusive culture and are looking for diverse, talented people to join Alcon.
About Your Future Team
You will join the Generative AI & Data Engineering team – a pioneering unit focused on delivering intelligent solutions that power AI-driven products and decision-making. We work at the intersection of structured enterprise data and unstructured content (text, images, documents), enabling next-gen applications in NLP, LLMs, and intelligent search.
In this role, a typical day will include:
Build and optimize robust data pipelines to ingest, process, and serve structured (SQL, tabular) and unstructured (PDFs, text, images) data to downstream GenAI systems.
Design scalable and modular ETL/ELT workflows using Azure Data Services (Azure Cognitive Search, Azure Document Intelligence, etc).
Develop reusable and modular Python components for preprocessing unstructured data for GenAI models.
Design and deploy data workflows using Docker containers for environment consistency and scalability.
Orchestrate data pipelines and jobs using Apache Airflow, ensuring reliable scheduling and monitoring.
Collaborate closely with LLM engineers, data scientists, and product teams to ensure data readiness for RAG, embeddings, and vector databases.
Handle large-scale data transformations, metadata tagging, and schema evolution across data formats (JSON, CSV, Parquet, images).
Integrate Azure OpenAI and other LLM APIs into the data workflow when required.
Contribute to the data layer that supports chatbots, document summarization, and intelligent assistants.
Work in Agile teams with global collaboration across data, AI, and product domains.
WHAT YOU’LL BRING TO ALCON:
3+ years of experience in Data Engineering, especially in cloud-first environments.
Strong hands-on skills in Python (data wrangling, file parsing, API integration) and SQL (complex queries, performance tuning).
Experience with unstructured data processing – PDFs, images, HTML, JSON, etc.
Solid understanding of Azure Data Stack: Data Lake, Azure Search, Azure Blob.
Comfortable working with large language models, vector stores.
Practical experience in preparing data for GenAI pipelines – at least 2 large-scale projects (e.g., chunking, vector embeddings).
Strong working knowledge of Docker and containerization best practices.
Hands-on experience in designing and maintaining DAGs in Apache Airflow.
Familiarity with data formats and standards relevant to AI (e.g., tokenization, embeddings, ML metadata).
Bonus: Experience with AWS or Hybrid Cloud Environments
Familiarity with CI/CD pipelines, version control (Git), and DevOps practices.
Proactive, curious, and adaptable with strong communication skills.
Fluent in English – written and spoken.
HOW YOU CAN THRIVE AT ALCON:
- Competitive compensation package and hybrid working model (3+2).
- Performance-linked annual bonus.
- Direct exposure to Generative and Agentic AI product development.
- Training and certification support in Azure, OpenAI, Databricks, and LLM platforms.
- Work on cutting-edge use cases like intelligent search, automated document insights, and AI copilots.
- A collaborative culture that values creativity, data craftsmanship, and innovation.
- Flexible hours and hybrid model of work (3/2, weekly),
- Brand new office in Marynarska 15, Warsaw, with a lot of facilities inside
Alcon Careers See your impact at alcon.com/careers
ATTENTION: Current Alcon Employee/Contingent Worker
If you are currently an active employee/contingent worker at Alcon, please click the appropriate link below to apply on the Internal Career site.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Airflow APIs AWS Azure Chatbots CI/CD CSV Databricks Data pipelines DevOps Docker ELT Engineering ETL Generative AI Git JSON LLMs Machine Learning NLP OpenAI Parquet Pipelines Python RAG SQL Unstructured data
Perks/benefits: Career development Competitive pay Flex hours Salary bonus
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.