Data engineer Manager
Nationwide, Colombia
A COP 54000K-72000K (estimate) Mid-level Full Time
Tasks
- Assign confidence scores for extracted fields
- Build ETL/ELT pipelines
- Build OCR document intelligence pipelines
- Capture manual inputs
- Classify documents and map fields
- Design human in the loop review workflows
- Detect schema drift and anomalous scores
- Ensure data freshness and consistency
- Extract structured data using NLP and LLM
- Feed validated data into scoring algorithms
- Implement automated data quality checks
- Implement data validation checks
- Implement error handling and retries
- Implement incremental loads
- Ingest audit metadata and document links
- Integrate AI extracted fields into structured tables
- Monitor pipelines and send alerts
- Optimize data models for Tableau dashboards
- Route low confidence outputs for human review
- Support reprocessing and backfills
- Track value provenance manual vs AI derived
Perks/Benefits
Skills/Tech-stack
Apache Airflow | Confidence scoring | DBT | Dagster | Data Lineage | Data Observability | Data Validation | Document Classification | ELT | ETL | Entity Extraction | Incremental loading | LLM | NLP | OCR | Prefect | Prompt engineering | Python | SQL | Snowflake | Tableau
Education
N/A
Regions
Countries
States
Related jobs
-
Senior-level Full TimeColombia17h ago
-
Access Control | Azure Data | Azure Data Factory | Azure Databricks | Azure SynapseSenior-level Full TimeColombia17h ago
-
Access Control | Agile | Azure Data | Azure Data Factory | Azure DatabricksHybrid workSenior-level Full TimeColombia17h ago
-
Senior-level Full TimeColombia17h ago
-
Senior-level Full TimeColombia17h ago
-
Senior-level Full TimeColombia17h ago
-
Automated testing | DBT | Data Pipelines | Data Quality | Data ValidationSenior-level Full TimeColombia17h ago
-
AI Engineer, LLM Systems & Agentic Workflows GBP 97K-130KAgentic Workflows | Anthropic API | Constrained decoding | Evaluation Pipelines | Go100 percent remote | Employee referral bonuses | Flexible time off | Fun events | Room to growSenior-level Full TimeArgentina; Brazil; Chile; Colombia; Costa Rica; … R22h ago
-
Data Engineer ID52278 COP 54000K-74400KAmazon Redshift | Amazon Web Services | DBT | Data Lakes | Data ModelingEducation budget | Fitness budget | Flexible schedule | Mentorship | Remote and office optionsMid-level Full TimeArmenia, Colombia1d ago
-
Data Engineer ID52278 COP 54000K-74400KAWS | Amazon Redshift | Amazon Web Services | Availability | DBTFlextime | Mentorship | Office work options | Personalized growth roadmaps | Remote work optionsMid-level Full TimePopayan, Colombia1d ago
-
Data Engineer ID52278 COP 54000K-74400KAWS | Amazon Redshift | DBT | Data Lakes | Data WarehousingEducation budget | Fitness budget | Flextime | Growth roadmaps | MentorshipMid-level Full TimeIbague, Colombia1d ago
-
Data Engineer ID52278 COP 54000K-74400KAmazon Redshift | Amazon Web Services | Computer Science | Computer science fundamentals | DBTEducation budget | Fitness budget | Flextime | Mentorship | Personalized growth roadmapsMid-level Full TimeItagüí, Colombia1d ago
-
Data Engineer ID52278 COP 54000K-74400KAmazon Redshift | Amazon Web Services | Availability | DBT | Data LakesEducation budget | Fitness budget | Flextime | Mentorship | Office optionsMid-level Full TimeSoledad, Colombia1d ago
-
Data Engineer ID52278 COP 54000K-74400KAWS | Algorithms | Amazon Redshift | DBT | Data LakesFlexible schedule | Mentorship | Office work options | Remote work options | TechtalksMid-level Full TimePalmira, Colombia1d ago
-
Data Engineer ID52278 COP 54000K-74400KAlgorithms | Amazon Redshift | Amazon Web Services | DBT | Data LakesFlextime | Mentorship | Office options | Personalized growth roadmap | Remote workMid-level Full TimeSanta Marta, Colombia1d ago
-
Data Engineer ID52278 COP 54000K-74400KAmazon Redshift | Amazon Web Services | DBT | Data Lakes | Data PipelinesFlexible schedule | Mentorship | Remote and office options | TechtalksMid-level Full TimeChia, Colombia1d ago
-
Data Engineer ID52278 COP 54000K-74400KAmazon Redshift | Amazon Web Services | DBT | Data Lakes | Data ModelingFlexible schedule | Mentorship | Personalized growth roadmaps | Remote and office options | TechtalksMid-level Full TimeValledupar, Colombia1d ago
-
Data Engineer ID52278 COP 54000K-74400KAmazon Redshift | Amazon Web Services | DBT | Data Lakes | Data WarehousingEducation budget | Fitness budget | Flexible schedule | Mentorship | Office optionsMid-level Full TimePasto, Colombia1d ago
-
Data Engineer ID52278 COP 54000K-74400KAWS | Amazon Redshift | DBT | Django | KubernetesEducation budget | Fitness budget | Flexible schedule | Mentorship | Office work optionsMid-level Full TimeTunja, Colombia1d ago
-
Data Engineer ID52278 COP 54000K-74400KAlgorithms | Amazon Redshift | Amazon Web Services | DBT | Data LakesFlexible schedule | Mentorship | Office options | Remote work | TechtalksMid-level Full TimeBello, Colombia1d ago
-
.NET | API proxying | Agentic Workflow | Automated testing | C#Education budget | Fitness budget | Flexible schedule | Mentorship | Office optionsMid-level Full TimeMedellín, Colombia1d ago
-
.NET | API proxying | Agentic Workflows | Automated testing | C#Flexible schedule | Mentorship | Office options | Personalized growth roadmaps | Remote optionsMid-level Full TimeBogota, Colombia1d ago
-
.NET | API Integration | API proxying | Agentic Workflows | Automated testingEducation budget | Fitness budget | Flexible schedule | Mentorship | Office optionsMid-level Full TimeCartagena, Colombia1d ago
-
.NET | API proxying | Automated testing | C# | CI/CDFlexible schedule | Mentorship | Office options | Personalized growth roadmaps | Remote optionsMid-level Full TimePereira, Colombia1d ago
-
.NET | API proxying | Agentic Workflow | Automated testing | C#Flexible schedule | Mentorship | Remote and office options | TechtalksMid-level Full TimeCali, Colombia1d ago