Senior AI Data Engineer
Tasks
- Build ETL processes for AI research
- Build real-time telemetry pipelines
- Build streaming and event driven pipelines
- Collaborate on data requirements and quality
- Create vector embeddings storage and retrieval
- Design scalable data pipelines
- Enable LLM RAG data pipelines
- Implement data governance and security
- Implement data validation and schema testing
- Model data for analytics and ML training
- Optimize cloud data platform reliability scalability cost
- Optimize partitioning indexing and storage
- Set up observability and monitoring
- Track data lineage
- Troubleshoot data pipeline failures
Perks/Benefits
- N/A
Skills/Tech-stack
AWS | Airflow | Automated Data Validation | Azure | CI/CD | Dask | Data Governance | Data Lakehouse | Data Lineage | Data Pipelines | Data Security | Data Validation | Docker | ETL | Event Driven | Event-driven architecture | Experiment tracking | Feature Engineering | Feature Store | Flink | GCP | Hybrid Architecture | Kafka | Knowledge graphs | Kubernetes | LLM | LLM Evaluation | Milvus | Monitoring | NoSQL | Observability | Pinecone | Prompt engineering | PyTorch | Python | RAG | Ray | Retrieval-Augmented Generation | SQL | Scala | Schema Testing | Semantic Caching | Spark | Streaming | TensorFlow | Terraform | Vector Databases | Vector embeddings | Weaviate
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Related jobs
-
Software Engineer III, AI/ML, Cloud PLN 250K-383KData Processing | Debugging | Language Processing | Machine Learning | Machine Learning InfrastructureSenior-level Full TimeWarsaw, Poland10h ago
-
BigQuery | C++ | Compute pushdown | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeWarsaw, Poland10h ago
-
Apache Iceberg | Apache Spark | Avro | CI/CD | DatabricksSenior-level Full TimeWarsaw, Mazowieckie, Poland11h ago
-
Apache Iceberg | Apache Spark | Avro | CI/CD | Cause analysisDiversity and inclusion initiatives | Mindfulness programs | Training and development | Wellness programsExecutive-level Full TimeWarsaw, Mazowieckie, Poland11h ago
-
Senior Data Engineer (ADB, Python) USD 130K-178KASP.Net Core | Angular | Apache Spark | Azure Cloud | Azure FabricHybrid or remote flexibility | Medical healthcare | Ongoing learning reimbursement | Team events local benefits | Top tier equipment provisionSenior-level Full TimeBulgaria, Georgia, Poland , Romania , …15h ago
-
Senior Data Engineer (ADB, Python) USD 130K-178KASP.Net Core | Angular | Apache Spark | Azure Blob | Azure Blob StorageHybrid/Remote flexibility | Medical healthcare | Ongoing learning reimbursement | Team events | Top tier equipment provisionSenior-level Full TimeBulgaria, Georgia, Poland , Romania , …15h ago
-
Senior Data Engineer (ADB, Python) USD 129K-177KASP.Net Core | Angular | Apache Spark | Azure Cloud | Azure DataHybrid/Remote flexibility | International projects | Medical healthcare | Ongoing learning reimbursement | Recognition programSenior-level Full TimeBulgaria, Georgia, Poland , Romania , …15h ago
-
Senior Data Engineer (ADB, Python) USD 129K-177KASP.NET | ASP.Net Core | Apache Spark | Azure Data | Azure Data LakeHybrid/Remote flexibility | Medical healthcare | Ongoing learning reimbursement | Referral bonuses | Sports compensationSenior-level Full TimeBulgaria, Georgia, Poland , Romania , …15h ago
-
Data Engineer (Azure Databricks) PLN 192K-258KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsCertification programs | Health insurance | Mentorship | Professional development opportunities | Relocation programMid-level Full TimeWrocław, Lower Silesian Voivodeship, Poland R1d ago
-
Data Engineer (Azure Databricks) PLN 192K-258KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsCertification programs | Health insurance | Mentorship | Professional development | Relocation programMid-level Full TimeŁódź, Łódź Voivodeship, Poland R1d ago
-
Data Engineer (Azure Databricks) PLN 192K-258KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsCertification programs | Health insurance | Internal mobility | Mentorship | Professional developmentMid-level Full TimeKraków, Lesser Poland Voivodeship, Poland R1d ago
-
Data Engineer (Azure Databricks) PLN 192K-258KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsHealth insurance | Inclusive multicultural environment | Internal mobility | Mentorship | Professional development programsMid-level Full TimeWarsaw, Masovian Voivodeship, Poland R1d ago
-
Data Engineer ID52278 PLN 150K-228KAmazon Redshift | Amazon Web Services | DBT | Data Architecture | Data LakesFlextime | Mentorship | Office options | Personalized growth roadmaps | Remote workMid-level Full TimeWrocław, Poland1d ago
-
Data Engineer ID52278 PLN 150K-228KAmazon Redshift | Amazon Web Services | DBT | Data Lakes | Data WarehousingEducation budget | Fitness budget | Flextime | Mentorship | Office work optionsMid-level Full TimePoznań, Poland1d ago
-
Data Engineer ID52278 PLN 150K-228KAmazon Redshift | Amazon Web Services | Computer Science | Computer science fundamentals | DBTFlextime | Mentorship | Office options | Personalized growth roadmap | Remote work optionsMid-level Full TimeWarsaw, Poland1d ago
-
Data Engineer ID52278 PLN 150K-228KAWS | Amazon Redshift | DBT | Data Lakes | Data ModelingEducation budget | Fitness budget | Flexible schedule | Mentorship | Office optionsMid-level Full TimeKraków, Poland1d ago
-
Data Engineer ID52278 PLN 150K-228KAWS Redshift | Algorithms | Amazon Redshift | Amazon Web Services | DBTFlexible schedule | Mentorship | Office options | Personalized growth roadmaps | Remote workMid-level Full TimeLublin, Poland1d ago
-
Data Engineer ID52278 PLN 150K-228KAWS | Amazon Redshift | DBT | Data Architecture | Data LakesEducation budget | Fitness budget | Flextime | Mentorship | Office optionsMid-level Full TimeGdańsk, Poland1d ago
-
Data Engineer ID52278 PLN 150K-228KAmazon Redshift | Amazon Web Services | CI/CD | DBT | Data ArchitectureFlexible schedule | Mentorship | Office option | Personalized growth roadmaps | Remote work optionMid-level Full TimeSzczecin, Poland1d ago
-
Data Engineer PLN 178K-255KAccess Control | Access policies | Airbyte | Apache Airflow | Azure DataFlexible work location | Home office setup budget | Remote-first | Time off | Training budgetMid-level Full TimeWarszawa, Poland1d ago
-
Senior Data Engineer (ADB, Python) USD 129K-178KASP.Net Core | Angular | Apache Spark | Azure Blob | Azure Blob StorageHybrid or remote flexibility | Medical healthcare | Ongoing learning reimbursement | Referral bonuses | Sports compensationSenior-level Full TimeBulgaria, Georgia, Poland , Romania , …1d ago
-
Software Engineer III, Pixel Data Engineering PLN 300K-370KAndroid Frameworks | C++ | Data Structures | Data Structures and Algorithms | Device telemetrySenior-level Full TimeWarsaw, Poland1d ago
-
Senior Data Engineer (ADB, Python) USD 129K-175K.NET | ASP.Net Core | Angular | Apache Spark | Azure BlobHybrid/Remote flexibility | International projects | Medical healthcare | Ongoing learning reimbursement | Referral bonusesSenior-level Full TimeBulgaria, Georgia, Poland , Romania , …1d ago
-
GenAI Engineer (Python,GCP) PLN 250K-400KAI Governance | BigQuery | Cloud Functions | Cloud Run | Cloud platformB2B contract flexibility | Career progression | Collaborative & Inclusive Culture | Continuous learning | Hybrid working modelSenior-level Full TimePoland1d ago
-
Senior Data Engineer (Interior Design) PLN 220K-384KAWS | Apache Airflow | Apache Kafka | Azure | CI/CDSenior-level Full TimeWarsaw, Masovian Voivodeship, Poland1d ago