Senior Data Architect
Tasks
- Architect closed loop data flywheel
- Build and maintain data catalog
- Collaborate with ML teams to translate requirements into datasets and pipelines
- Define data quality frameworks
- Define data selection and sampling strategy
- Design annotation pipeline architecture
- Design dataset schema and dataset discovery infrastructure
- Document data architecture and produce architecture RFCs
- Maintain data pipelines and ETL ELT infrastructure
- Manage data anonymization and data lineage
- Own training environment data architecture end to end
Perks/Benefits
Skills/Tech-stack
Active Learning | Amazon S3 | Amazon SageMaker | Annotation Workflow Management | Apache Airflow | DBT | DPO | Data Annotation | Data Architecture | Data Catalog | Data Deduplication | Data Engineering | Data Generation | Data Governance | Data Lineage | Data Modeling | Data Quality | Data anonymization | Dialog Act Classification | Diversity Optimization | ELT | ETL | Entity recognition | Entity tagging | Evaluation datasets | Instruction Tuning | Intent Recognition | Metadata Management | Named Entity Recognition | PII Redaction | Preference data | Python | RLHF | SQL | SQL Based Data Quality | Sampling strategy | Schema Design | Snowflake | Synthetic Data Generation | Synthetic data | Workflow Management
Education
Roles
Related jobs
-
Sr. BI & Data Architect [Contractor] PLN 240K-384KCI/CD | Databricks | Delta Lake | Delta Live Tables | Delta Live)Senior-level Contract Full TimeRemote, Remote, Poland R16h ago
-
Data Engineer (Azure Databricks) PLN 192K-258KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsCertification programs | Health insurance | Mentorship | Professional development opportunities | Relocation programMid-level Full TimeWrocław, Lower Silesian Voivodeship, Poland R23h ago
-
Data Engineer (Azure Databricks) PLN 192K-258KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsCertification programs | Health insurance | Mentorship | Professional development | Relocation programMid-level Full TimeŁódź, Łódź Voivodeship, Poland R23h ago
-
Data Engineer (Azure Databricks) PLN 192K-258KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsCertification programs | Health insurance | Internal mobility | Mentorship | Professional developmentMid-level Full TimeKraków, Lesser Poland Voivodeship, Poland R1d ago
-
Data Engineer (Azure Databricks) PLN 192K-258KApache Spark | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsHealth insurance | Inclusive multicultural environment | Internal mobility | Mentorship | Professional development programsMid-level Full TimeWarsaw, Masovian Voivodeship, Poland R1d ago
-
AI-Native Engineer (Full-Stack / Agentic AI Engineer) PLN 246K-378KAWS | Agent Orchestration | Airflow | Automated testing | CrewAIAutonomy | Direct stakeholder access | Paid software licenses | Premium gear | Remote-firstMid-level Full TimeWarsaw, Masovian Voivodeship, Poland - Remote R1d ago
-
Senior Data Platform Engineer (Remote) PLN 232K-370KAnsible | Apache Spark | Data Lake | Data Warehouse | Distributed SystemsRemote work flexible locationSenior-level Full TimeKraków, Poland R1d ago
-
Analytics & Data Engineer PLN 246K-340KAI Agents | BigQuery | Cloud Composer | Cloud Run | Cloud WorkflowsExternal training budget | Foreign language classes co financing | Friendly work environment | Hybrid work model | Hybrid workspaceSenior-level Full TimeWarsaw, Poland R2d ago
-
Senior ML Ops/Devops - Hybrid (She/He/They) PLN 267K-400KAirflow | Ansible | Apache Spark | Automation | BashB2B contract | Flexible collaboration model | Hybrid workSenior-level Full TimePoland R3d ago
-
C++ | Debugging | Docker | Go | High ThroughputFlexible workingSenior-level Full TimePoland R3d ago
-
AWS | Authentication | Authorization | Azure | CI/CDFlexible remote workMid-level Full TimePoland R3d ago
-
Computer Vision | Image Processing | Mechatronics | Physics | PythonEnglish-speaking environment | Equity | Flexible working hours | Partly remote work | Relocation packageSenior-level Full TimeWarsaw, Mazowieckie R5d ago
-
Agent systems | Agentic Workflows | Autonomous Agents | Chatbots | Conversational AIInternal mobility | Professional development | Remote-friendly culture | Work-life balanceSenior-level Full TimePoland, REMOTE, Poland R6d ago
-
Amazon Redshift | DBT | Data Documentation | Data Modeling | Data QualityFully remote work | Healthcare coverage | Home office equipment allowance | Learning and development budget | Paid parental leaveSenior-level Full TimePoland R7d ago
-
Senior AI Engineer - Poland remote PLN 241K-400KAutomated testing | CI/CD | Cloud Computing | Code Quality | Data AnalysisCompany pension | Company sick pay | Hackathons and socials | Healthcare package | Learning and development budgetSenior-level Full TimePoland (Remote) R8d ago
-
Senior Data Engineer (DBT/ Data Vault) PLN 258K-396KAWS | Amazon Redshift | Amazon Web Services | Apache Airflow | CI/CDSenior-level Full TimeRemote, Poland R9d ago
-
Senior Robotics & Software Engineer (202612) PLN 246K-346KAPI Development | Computer Vision | Control Systems | Management systems | PythonEnglish-speaking environment | Equity | Flexible working hours | Partly remote work | Relocation packageSenior-level Full TimeWarsaw, Mazowieckie R9d ago
-
Junior Data Engineer (maternity cover) PLN 96K-132KApache Airflow | CI/CD | Cloud Data | Cloud Data Warehouse | ConsulCafeteria benefits | English classes | Hackathons | Health insurance | Hybrid workEntry-level Full TimeWarszawa, PL, 00-841 R9d ago
-
Junior Data Engineer (maternity cover) PLN 96K-132KApache Airflow | Apache PySpark | BigQuery | CI/CD | ContainerizationCafeteria plan fringe benefits | English classes | Hackathons | Hybrid work model | InsuranceEntry-level Full TimeWarszawa, PL, 00-841 R9d ago
-
Agent Orchestration | CI/CD | Containerization | Deep learning | Distributed SystemsCafeteria benefits | Flexible working hours | Hackathons | Hybrid work model | Remote work daysSenior-level Full TimeWarszawa, PL, 00-841 R9d ago
-
CI/CD | Containerization | Distributed Systems | Docker | Google CloudCafeteria benefits plan | English classes | Ergonomic office equipment | Flexible working hours | Hackathon teamsSenior-level Full TimeWarszawa, PL, 00-841 R9d ago
-
Senior Machine Learning Engineer PLN 300K-400KAWS | Aerospike | Async Processing | BentoML | Cloud Pub/SubGym subscription | Health insurance | Non operational allowance | Paid public holidaysSenior-level Full TimePoland - Remote R10d ago
-
Senior Data Engineer (with Backend Experience) PLN 258K-384KAPI Gateway | AWS EKS | AWS EMR | AWS Lambda | Amazon S3Flexible working hours | Hybrid or in office work option | Internal training sessions | Remote work option | Training budgetSenior-level Full TimeWarszawa, Poland R13d ago
-
Junior AI Engineer PLN 115K-115KAzure Machine Learning | Azure OpenAI | Azure OpenAI API | Data Preprocessing | EmbeddingsDiscounts on Apple products | Integration events | Internal conferences | Knowledge sharing sessions | Life insuranceEntry-level Full TimePoznań, Greater Poland Voivodeship, Poland - … R14d ago
-
Senior Data Engineer PLN 232K-384KAzure Data | Azure Data Factory | Azure Databricks | Cloud Data | Cloud Data PlatformsCollaborative team culture | Flexible work arrangements | Non operational allowance | Supportive HR and management teamSenior-level Full TimePoland - Remote R15d ago