Research Engineer
Tasks
- Analyze pre post results and error breakdowns
- Audit datasets for errors leakage and distribution shift
- Collaborate with researchers engineers and QAs on specs and edge cases
- Define data quality rubrics and boundary cases
- Design and build datasets and RL environments
- Design evaluations and run training experiments
- Generate synthetic data and augmentations
- Implement denoising and difficulty diversity controls
- Implement validation filtering and deduplication pipelines
- Translate research goals into data requirements
Perks/Benefits
Skills/Tech-stack
Ablation Studies | Chart Reading | Code review | Continuous integration | Data Augmentation | Data Decontamination | Data Generation | Data Quality | Data Validation | Data leakage | Data leakage detection | Dataset curation | Debugging | Deduplication | Deep learning | Document Understanding | Evaluation | Human Feedback | Language Models | Leakage detection | Learning from Human Feedback | Long Horizon Planning | Machine Learning | Multimodal reasoning | OCR | Python | Reinforcement Learning | Reinforcement Learning from AI Feedback | Reinforcement Learning from Human Feedback | Reward Modeling | SQL | Synthetic Data Generation | Synthetic data | Trajectory analysis | Unit Testing | Verifier Training | Vision Language Models | Vision-language
Education
N/A
Regions
Countries
States
Related jobs
-
Amazon Web Services | CI/CD | GCP | LLM integration | LangchainEducation budget | Fitness budget | Flextime | Mentorship | Office optionsMid-level Full TimePereira, Colombia5h ago
-
API Integration | AWS | CI/CD | GCP | LLM integrationEducation budget | Fitness budget | Flexible schedule | Mentorship | Office optionsMid-level Full TimeCali, Colombia5h ago
-
Senior-level Full TimeArgentina, Santiago del Estero, Argentina; Bogotá, …1d ago
-
Sr. Data Engineer (Snowflake/dbt) USD 152K-204KAccess Control | Clustering | Compute Optimization | DBT | Data GovernanceFully remoteSenior-level Full TimeRemote (Mexico); Remote (Uruguay); Remote (Chile); … R1d ago
-
CI/CD | Data Integrity | Database backups | Database performance | Database performance tuningEducation budget | Fitness budget | Flextime | Mentorship | Personalized growth roadmapEntry-level Full TimeUsaquen, Colombia3d ago
-
AWS RDS | CI/CD | Cloud SQL | Data Migration | Database backupsFlextime | Mentorship | Personalized growth roadmap | Techtalks | Work from homeEntry-level Full TimeMedellin, Colombia3d ago
-
CI/CD | Database backups | Database performance | Database performance tuning | Database replicationEducation budget | Fitness budget | Flextime | Mentorship | Personalized growth roadmapsEntry-level Full TimeCali, Colombia3d ago
-
CI/CD | Data Integrity | Database backups | Database performance | Database performance tuningFlexible work hours | Mentorship | Personalized growth roadmap | Techtalks | Work from homeEntry-level Full TimeBucaramanga, Colombia3d ago
-
AWS Glue | AWS RDS | CI/CD | Cloud SQL | Data MigrationEducation budget | Fitness budget | Flexible work hours | Mentorship | Personalized growth roadmapEntry-level Full TimeBarranquilla, Colombia3d ago
-
AWS RDS | Amazon RDS | CI/CD | Cloud SQL | Data MigrationEnglish communication support | Fitness budget | Flextime | Mentorship | Personalized growth roadmapEntry-level Full TimeVillavicencio, Colombia3d ago
-
CI/CD | Data Migration | Database backups | Database performance | Database performance tuningEducation budget | Fitness budget | Flextime | Mentorship | Personalized growth roadmapEntry-level Full TimeManizales, Colombia3d ago
-
CI/CD | Data Migration | Database backups | Database performance | Database performance tuningEducation budget | Fitness budget | Flextime | Mentorship | Personalized growth roadmapsEntry-level Full TimePereira, Colombia3d ago
-
AWS Glue | AWS RDS | CI/CD | Cloud SQL | Data MigrationEducation budget | Fitness budget | Flextime | Growth roadmap | MentorshipEntry-level Full TimeCartagena, Colombia3d ago
-
Python & SQL Data Engineer (Middle/Senior) ID32276 COP 54000K-74400KAPIs | AWS | Amazon S3 | Apache Airflow | CI/CDFitness budget | Flextime | Mentorship | Personalized growth roadmap | Team activitiesMid-level Full TimeUsaquen, Colombia3d ago
-
Python & SQL Data Engineer (Middle/Senior) ID32276 COP 54000K-74400KAPIs | AWS | Amazon S3 | Apache Airflow | CI/CDFlextime | Mentorship | Personalized growth roadmaps | Techtalks | Work from homeMid-level Full TimeMedellin, Colombia3d ago
-
Python & SQL Data Engineer (Middle/Senior) ID32276 COP 54000K-74400KAPIs | AWS | Amazon S3 | Apache Airflow | CI/CDFlextime | Mentorship | Personalized growth roadmaps | Techtalks | Work from homeMid-level Full TimeCali, Colombia3d ago
-
Python & SQL Data Engineer (Middle/Senior) ID32276 COP 54000K-74400KAPIs | AWS | Amazon S3 | Apache Airflow | CI/CDEducation budget | Fitness budget | Flextime | Growth roadmaps | MentorshipMid-level Full TimeBucaramanga, Colombia3d ago
-
Data Engineer (Middle/Senior) ID32276 COP 54000K-74400KAPIs | AWS | Amazon S3 | Apache Airflow | CI/CDFlextime | Mentorship | Personalized growth roadmap | Techtalks | Work from home optionMid-level Full TimeUsaquen, Colombia3d ago
-
Data Engineer (Middle/Senior) ID32276 COP 54000K-74400KAPI Integration | AWS | Amazon S3 | Apache Airflow | CI/CDEducation budget | Fitness budget | Flextime | Mentorship | Personalized growth roadmapMid-level Full TimeMedellin, Colombia3d ago
-
Data Engineer (Middle/Senior) ID32276 COP 54000K-74400KAPIs | AWS | Apache Airflow | CI/CD | DBTEducation budget | Fitness budget | Flextime | Mentorship | Personalized growth roadmapMid-level Full TimeCali, Colombia3d ago
-
Data Engineer (Middle/Senior) ID32276 COP 54000K-74400KAPI Integration | AWS | Apache Airflow | CI/CD | DBTEducation budget | Fitness budget | Flextime | Mentorship | Personalized growth roadmapMid-level Full TimeBucaramanga, Colombia3d ago
-
Data Engineer (Middle/Senior) ID32276 COP 54000K-74400KAPI Integration | AWS | Amazon S3 | Apache Airflow | CI/CDEducation budget | Fitness budget | Flextime | Mentorship | Team activitiesMid-level Full TimeBarranquilla, Colombia3d ago
-
AWS CDK | AWS CloudFormation | AWS Glue | AWS Lambda | AWS Step FunctionsPaid time off | Remote work | Work autonomySenior-level Full TimeBogota R4d ago
-
Python & SQL Data Engineer (Middle/Senior) ID32276 COP 54000K-74400KAPIs | AWS | Amazon S3 | Apache Airflow | CI/CDEducation budget | Fitness budget | Flextime | Growth roadmap | MentorshipMid-level Full TimeBarranquilla, Colombia4d ago
-
AWS | Azure | CDC | CloudFormation | DBTCollaborative environment | International client exposure | Mentorship programs | Professional development opportunitiesSenior-level Full TimeColombia4d ago