Member of Engineering (Pre-training / Data Acquisition)
Remote (EMEA/East Coast)
R
USD 175K-270K (estimate) Mid-level Full Time
Tasks
- Align data acquisition priorities with model training
- Build deep crawlers for high value sources
- Build high throughput ingestion pipelines for partner data
- Build observability monitoring and debugging tools
- Design web crawler infrastructure
- Develop data acquisition roadmap
- Ingest and parse large scale web data
Perks/Benefits
- Company-provided equipment
- Flexible hours
- Frequent team get togethers
- Fully remote work
- Health insurance allowance
- Home-office allowance
- Parental leave
- People-first culture
- Vacation and holidays
- Well-being allowances
Skills/Tech-stack
AWS | Data Parsing | Data Privacy | Debugging | Distributed Systems | Distributed job queues | Docker | HTTP | Job queues | Kubernetes | Performance optimization | Python | Robots.txt | Web Crawling
Education
N/A
Related jobs
-
Data Science & AI Specialist GBP 28K-28KAWS Bedrock | Apache Pulsar | Apache Spark | Artificial Intelligence | CI/CDCarers leave bonus | Discounted mobile and broadband | Equalized maternity paternity and adoption leave | Holiday purchase scheme | Paid carer’s leaveMid-level Full TimeLondon, GB, E1 8EP R5h ago
-
Analytics Engineer EUR 122K-122KAirflow | Canonical Data | DBT | Data Contracts | Data ModelingQuarterly in person surges | Remote-firstMid-level Full TimeHybrid - Luxembourg R23h ago
-
CAG | DBT | Data Modeling | Elasticsearch | GrafanaEmployee discounts | Employee events | Flexible working hours | Free drinks | Health offersSenior-level Full TimeHinterm Hauptbahnhof 3-5, 76137 Karlsruhe R23h ago
-
DBT | Elasticsearch | Grafana | Looker | PostgreSQLEmployee discounts | Employee events | Flexible working hours | Health offers | Hybrid workSenior-level Full TimeRevaler Straße 28-31, 10245 Berlin R23h ago
-
Junior Data Engineer EUR 19K-20KApache Airflow | Apache Spark | Data Governance | Data Modeling | Data TransformationGrowth plan | Gym discounts | Learning resources | Mental health support | MentorshipEntry-level Full TimeMadrid R1d ago
-
Junior Data Engineer GBP 30K-40KApache Airflow | Apache Spark | Data Governance | Data Modeling | Data ProcessingGrowth plan | Gym discounts | Learning resources | Mental health support | MentorshipEntry-level Full TimeLondon R1d ago
-
Junior Data Engineer EUR 19K-20KApache Airflow | Apache Spark | Data Governance | Data Modeling | Data PipelinesGym discounts | Learning resources | Mental health support | Mentorship program | Private healthcareEntry-level Full TimeMilan R1d ago
-
DataOps Engineer GBP 50K-60KAWS | Ansible | Bash | Docker | ELTFlexible working hours | Remote work option | Team collaborationMid-level Full TimeUnited Kingdom R1d ago
-
Senior AI Engineer, AI Lab GBP 90K-131KBLEU | Bark | DVC | ElevenLabs | Fine TuningAnnual leave | Employee assistance program | Free Economist content online subscription | Moving home allowance | Parental leaveSenior-level Full TimeLondon - Commercial R1d ago
-
Batch Processing | BigQuery | CI/CD | Cloud Run | Cloud SQLAutonomy | Flexible work location | Fully remote | Inclusive workplace | Knowledge sharing cultureSenior-level Full TimeRomania R1d ago
-
AI Coding Assistants | AI coding | Batch Processing | BigQuery | CI/CDFully remote | High autonomy | Inclusive workplace | Knowledge sharing culture | Minimal bureaucracySenior-level Full TimeItaly R1d ago
-
Batch Processing | BigQuery | CI/CD | Cloud Run | Cloud SQLAutonomy | Collaborative team culture | Equal opportunity | Fully remote | Professional developmentSenior-level Full TimeSwitzerland R1d ago
-
BigQuery | CI/CD | Cloud Run | Cloud SQL | Cloud StorageAutonomy and trust | Fully remote work | Inclusive workplace | Minimal bureaucracy | Professional development networkSenior-level Full TimeSpain R1d ago
-
Data Engineer (All projects, Global) UAH 705K-1549KAPI Integration | AWS | Airflow | BigQuery | CI/CDAccess to company courses | Automated processes | Charity support projects | Flexible schedule | Health and wellness supportEntry-level Full TimeUkraine - Remote R1d ago
-
Senior Embedded Software Test Automation Engineer PLN 185K-268KAgile | Artifactory | CI/CD | Continuous integration | GitHubHybrid work model | Work from home optionSenior-level Full TimeKrakow, Poland R1d ago
-
Ingénieur Data Analyst confirmé (H/F) EUR 45K-52KAzure | Azure Data | Azure Data Factory | Data Factory | Data PipelinesCooptation bonus | Learning opportunities | Meal tickets | Mobility assistance | Paid time offSenior-level Full TimeNantes, FR R1d ago
-
Application design | Azure | CI/CD | Code Reviews | ContainerizationContinuous learning opportunities | Engineering community mentorship | Flexible working hours | Remote home office optionSenior-level Full TimePécs, Baranya, HU, 7622 R1d ago
-
Data Engineer (f/m/d) EUR 38K-38KApache Spark | Azure | Azure Synapse | Azure Synapse Analytics | CI/CDFitness programs | Flexible working hours | Remote workEntry-level Full TimeDitzingen, Germany R1d ago
-
Power BI Data Engineer EUR 18K-18KAutomation | DAX | Data Governance | Data Modeling | Data QualityContinuous training | Discounts on insurance | Employee stock purchase | Flexible working hours | Free fruit and snacksMid-level Full TimeBARCELONA, B, ES, 08014 R1d ago
-
Data Analytics Engineer (m/f/d) EUR 60K-75KAWS Glue | AWS Lambda | AWS Step Functions | Amazon Athena | Amazon Redshift30 vacation days per year | Company pension scheme | Education budget | Hybrid work | Remote work up to 3 months per yearMid-level Full TimeBerlin R1d ago
-
Senior Machine Learning Engineer EUR 66K-78KA/B | A/B Testing | AWS | B testing | CI/CDAnnual holiday allowance | Enhanced family leave | Gym membership | Hybrid work | InsuranceSenior-level Full TimeMunich R1d ago
-
Senior Data Engineer EUR 53K-70KAccess Control | Airbyte | Amazon S3 | Apache Airflow | Apache FlinkCareer advancement | Professional development opportunitiesSenior-level Contract Full TimeCroatia - Remote R2d ago
-
Lead Data Platform Engineer USD 140K-198KAI-assisted coding | Assisted coding | BigQuery | CI/CD | Cloud RunFlexibility | Friendly team | High trust autonomy | Low bureaucracy | Remote workSenior-level Full TimeEurope, Remote R2d ago
-
Machine Learning Manager, Borrowing GBP 100K-160KAI Platform | AWS | BigQuery | Cloud platform | Credit RiskEquipment provided | Flexible working hours | Learning budget | Relocation support | Visa sponsorshipMid-level Full TimeCardiff, London or Remote (UK) R2d ago
-
Apache Kafka | Apache Spark | Azure HDInsight | Batch Processing | CI/CDCareer development | Community volunteering | Conferences and tech events | Employee representative council | Health insuranceSenior-level Full TimeNantes, Pays de la Loire, France R2d ago