Senior Data Architect
Tasks
- Architect closed loop data flywheel
- Build and maintain data catalog
- Collaborate with ML teams to translate requirements into datasets and pipelines
- Define data quality frameworks
- Define data selection and sampling strategy
- Design annotation pipeline architecture
- Design dataset schema and dataset discovery infrastructure
- Document data architecture and produce architecture RFCs
- Maintain data pipelines and ETL ELT infrastructure
- Manage data anonymization and data lineage
- Own training environment data architecture end to end
Perks/Benefits
Skills/Tech-stack
Active Learning | Amazon S3 | Amazon SageMaker | Annotation Workflow Management | Apache Airflow | DBT | DPO | Data Annotation | Data Architecture | Data Catalog | Data Deduplication | Data Engineering | Data Generation | Data Governance | Data Lineage | Data Modeling | Data Quality | Data anonymization | Dialog Act Classification | Diversity Optimization | ELT | ETL | Entity recognition | Entity tagging | Evaluation datasets | Instruction Tuning | Intent Recognition | Metadata Management | Named Entity Recognition | PII Redaction | Preference data | Python | RLHF | SQL | SQL Based Data Quality | Sampling strategy | Schema Design | Snowflake | Synthetic Data Generation | Synthetic data | Workflow Management
Education
Roles
Related jobs
-
AI & Software Solutions Architect (Remote, Contract) PLN 263K-400KAI Agents | AWS | AgentOps | Agile | Apache IcebergFull remote | Long-term B2B collaboration | Paid sick leave | Paid vacation | Premium AI tools accessSenior-level ContractPoland R1d ago
-
ML Engineer (Forecasting) | NDA PLN 276K-276KARIMA | AWS | Azure | Data Pipelines | Data PreprocessingMid-level Full TimeEurope - Remote R4d ago
-
Staff Data Engineer PLN 309K-463KApache Airflow | Cloud Computing | DBT | Data Modeling | Data ObservabilityHybrid work flexibility | Mentorship opportunities | Remote work flexibilitySenior-level Full TimePoland - Krakow - Hybrid R4d ago
-
Group IT SaaS - Data Engineer PLN 166K-255KAgile methodology | Azure Data | Azure Data Factory | Azure Key Vault | DAXE-learning access | Hybrid work | Life insurance | Lunch subsidy | Medical insuranceMid-level Full TimeŁódź, PL, 90-118 R4d ago
-
Senior Analytics Engineer (Remote from Poland) PLN 257K-385KDBT | Dagster | Dashboards | Data Modeling | Data QualityRemote work opportunitySenior-level Full TimeWarsaw, Poland - Remote R5d ago
-
ADLS Gen2 | API Key | API key authentication | Application Insights | Argo CDFlexible working hours | Free parking | Medicover sport and health | Modern equipment | Modern officeMid-level Full TimeGdańsk, Pomeranian Voivodeship, Poland R6d ago
-
Middle Data Engineer (Azure Databricks) PLN 276K-276KAzure Data | Azure Data Factory | Azure Databricks | Azure DevOps | CI/CDHealth insurance | Internal mobility | Mentorship | Professional development opportunities | Relocation programMid-level Full TimeKatowice, Silesian Voivodeship, Poland R7d ago
-
Middle Data Engineer (Azure Databricks) PLN 276K-276KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsHealth insurance | Mentorship | Professional development programs | Relocation program | Work from anywhereMid-level Full TimeGdańsk, Pomeranian Voivodeship, Poland R7d ago
-
Middle Data Engineer (Azure Databricks) PLN 276K-276KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsCertification programs | Company social events | Health insurance | Internal mobility | MentorshipMid-level Full TimePoznań, Greater Poland Voivodeship, Poland R7d ago
-
Middle Data Engineer (Azure Databricks) PLN 276K-276KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsHealth insurance | Internal mobility | Internship opportunities | Mentorship | Mentorship programsMid-level Full TimeWrocław, Lower Silesian Voivodeship, Poland R7d ago
-
Middle Data Engineer (Azure Databricks) PLN 276K-276KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsCertification programs | Health insurance | Internal mobility | Internship opportunities | MentorshipMid-level Full TimeŁódź, Łódź Voivodeship, Poland R7d ago
-
Middle Data Engineer (Azure Databricks) PLN 276K-276KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsHealth insurance | Internal mobility | Internship opportunities | Mentorship | Professional developmentMid-level Full TimeWarsaw, Masovian Voivodeship, Poland R7d ago
-
Middle Data Engineer (Azure Databricks) PLN 150K-240KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsCertification programs | Health insurance | Internal mobility | Internship opportunities | MentorshipMid-level Full TimeKraków, Lesser Poland Voivodeship, Poland R7d ago
-
Sr. BI & Data Architect [Contract] PLN 257K-401KCI/CD | Data Lineage | Databricks | Delta Lake | Delta Live TablesSenior-level Full TimeRemote, REMOTE, Poland R7d ago
-
Data Engineer (Azure Databricks) PLN 276K-276KAzure Data | Azure Data Factory | Azure Databricks | Azure DevOps | CI/CDCertification programs | Health insurance | Internal mobility | Mentorship | Professional developmentMid-level Full TimeWarsaw, Masovian Voivodeship, Poland R7d ago
-
Data Engineer (Azure Databricks) PLN 276K-276KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsHealth insurance | Internal mobility | Mentorship | Professional development | Relocation programMid-level Full TimeKraków, Lesser Poland Voivodeship, Poland R7d ago
-
Data Engineer (Azure Databricks) PLN 276K-276KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsHealth insurance | Mentorship | Professional development opportunities | Relocation program | Work from anywhereMid-level Full TimeŁódź, Łódź Voivodeship, Poland R7d ago
-
Data Engineer (Azure Databricks) PLN 276K-276KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsCertification programs | Health insurance | Internal mobility | Mentorship | Professional development opportunitiesMid-level Full TimeWrocław, Lower Silesian Voivodeship, Poland R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Poland) PLN 324K-450KClassification | Context window | Context window optimization | Data cleaning | Data labelingAI tools access | Co-working space budget | Equipment budget | Fully remote | Health and wellness supportSenior-level Full TimePoland R7d ago
-
BI reporting | Data Modeling | Data Quality | Data Reconciliation | Data WarehousingContract position | Professional development | Remote workSenior-level Contract Full TimePoland - Remote R7d ago
-
Senior AI/ML Engineer (AI Lead) PLN 216K-216KCI/CD | Data Drift | Data Drift Detection | Data Quality | Drift DetectionBirthday benefits | Employee training and development program | Health insurance | Hybrid work flexibility | Loyalty benefitsSenior-level Full TimeWarsaw, Warsaw, Poland (Hybrid) R7d ago
-
Senior ML Engineer - Kimchi (LLM Inference Optimization) PLN 292K-400KAWS | ArgoCD | Azure | CUDA | Chunked prefillAnnual hackathon | Conference access | Equipment budget | Equity options | Extra days offSenior-level Full TimePoland R8d ago
-
Software Engineer (Python, Kubernetes, AI/ML) USD 153K-258KAI Inference | Autoscaling | Container Orchestration | Docker | GPU schedulingExtra Paid Sick Leave | Extra paid vacation | Flexible working hours | Language courses | Modern office amenitiesSenior-level Full TimePoland, Serbia, Cyprus, Georgia R11d ago
-
AI Solution Architect USD 184K-300KAnsible | CI/CD | Docker | GitOps | GoExtra Paid Sick Leave | Extra paid vacation | Flexible working hours | Hybrid or remote options | Language coursesSenior-level Full TimePoland, Serbia, Cyprus, Georgia R11d ago
-
Machine Learning Manager PLN 151K-229KAPI Development | C++ | Computer Vision | Data-Driven Decision Making | Data-drivenEnglish-speaking environment | No late evening calls | Partly remote work | Relocation packageMid-level Full TimeWarsaw, Poland R12d ago