Member of Engineering (Pre-training / Data Acquisition)
Remote (EMEA/East Coast)
R
USD 175K-270K (estimate) Mid-level Full Time
Tasks
- Align data acquisition priorities with model training
- Build deep crawlers for high value sources
- Build high throughput ingestion pipelines for partner data
- Build observability monitoring and debugging tools
- Design web crawler infrastructure
- Develop data acquisition roadmap
- Ingest and parse large scale web data
Perks/Benefits
- Company-provided equipment
- Flexible hours
- Frequent team get togethers
- Fully remote work
- Health insurance allowance
- Home-office allowance
- Parental leave
- People-first culture
- Vacation and holidays
- Well-being allowances
Skills/Tech-stack
AWS | Data Parsing | Data Privacy | Debugging | Distributed Systems | Distributed job queues | Docker | HTTP | Job queues | Kubernetes | Performance optimization | Python | Robots.txt | Web Crawling
Education
N/A
Related jobs
-
AWS ECS | AWS EMR | AWS Glue | AWS Lambda | AirflowAsync-friendly culture | Flexible working hours | Fully remote | Minimal bureaucracy | Personal development supportSenior-level Full TimeSwitzerland R11h ago
-
AWS Glue | AWS Lambda | Airflow | Amazon Athena | Amazon ECSAsync-friendly culture | Company offsites | Conference support | Flexible working hours | Fully remoteSenior-level Full TimeFrance R11h ago
-
API Development | AWS | Airflow | Athena | Data ProcessingAsync-friendly culture | Conference support | Flexible working hours | Fully remote | Personal development supportSenior-level Full TimeGermany R11h ago
-
AWS | AWS Glue | AWS Lambda | Airflow | Amazon AthenaAsync-friendly culture | Company offsites | Conference support | Flexible working hours | Fully remoteSenior-level Full TimeSpain R11h ago
-
Software Engineer, Data Infrastructure PLN 300K-347KAWS | Apache Spark | Azure | Data Ingestion | Data LakeCareer growth budget | Dental coverage | Family forming support | Fertility healthcare support | Group life insuranceSenior-level Full TimeWarsaw R13h ago
-
Senior-level Full TimeVitrolles, Provence-Alpes-Côte d'Azur, France R14h ago
-
Machine Learning Engineer EUR 32K-37KDocker | Kubernetes | MLOps | MLflow | Machine LearningRemote workMid-level Full TimeBerlin, Germany; Helsinki, Finland R14h ago
-
Data Engineer - EY GDS Spain - Hybrid EUR 65K-65KAWS | Apache Airflow | Apache Parquet | Avro | AzureContinuous learning programs | Flexible work-life integration | Hybrid work model | Volunteering opportunities | Well-being programsMid-level Full TimeMalaga, ES, 29590 R23h ago
-
Senior AI Engineer BGN 90K-105KAPI Design | AWS | Access Control | Amazon Bedrock | Amazon OpenSearchFully distributed remote | Paid holiday | Professional development | State-of-the-art hardware | Training & CertificationsSenior-level Full TimeSofia, Sofia City Province, Bulgaria - … R23h ago
-
AWS | Airflow | Apache Flink | Apache Hadoop | Apache KafkaFully paid parental leave | Home office stipend | Inclusive, diverse culture | Manager coaching | Paid time offSenior-level Full TimeRomania R1d ago
-
AWS | Apache Airflow | Apache Flink | Apache Kafka | Apache SparkFully paid parental leave | Fully remote first work environment | Home office stipend | Inclusive workplace culture | Manager coachingSenior-level Full TimeItaly R1d ago
-
AWS | Airflow | Apache Flink | Apache Kafka | Apache SparkFully paid parental leave | Fully remote-first | Home office stipend | Inclusive diverse workplace culture | Internal knowledge sharing programsSenior-level Full TimePortugal R1d ago
-
AWS | Airflow | Apache Flink | Apache Kafka | Apache SparkFully paid parental leave | Home office stipend | Inclusive, diverse culture | Manager coaching | Paid time offSenior-level Full TimeNetherlands R1d ago
-
AWS | Apache Airflow | Apache Flink | Apache Kafka | Apache SparkFully paid parental leave | Fully remote-first | Home office stipend | Inclusive diverse workplace | Manager coachingSenior-level Full TimeIreland R1d ago
-
AWS | Airflow | Apache Flink | Apache Kafka | Apache SparkFully paid parental leave | Fully remote-first | Home office stipend | Inclusive diverse workplace culture | Internal knowledge sharing programsSenior-level Full TimeFrance R1d ago
-
AWS | Apache Airflow | Apache Flink | Apache Kafka | Apache SparkHome office stipend | Inclusive diverse workplace culture | Internal knowledge sharing | Manager coaching | Paid parental leaveSenior-level Full TimeSpain R1d ago
-
AWS | Airflow | Apache Flink | Apache Hadoop | Apache KafkaFully paid parental leave | Fully remote first working environment | Home office stipend | Manager coaching | Paid time offSenior-level Full TimeGermany R1d ago
-
Machine Learning Researcher GBP 95K-110KBacktesting | Data pipeline | Deep learning | Experiment tracking | Feature EngineeringFlexible working hours | Foosball | Pension | Private health insurance | Relocation supportMid-level Full TimeLondon R1d ago
-
Experienced Data Engineer - Streaming Platform EUR 60K-80KAmazon Kinesis | Apache Flink | Apache Kafka | Apache Pulsar | AvroChild day care | Gym membership | Healthcare coverage | Lunch vouchers | Relocation assistanceMid-level Full TimeParis R1d ago
-
Amazon Redshift | Data Modeling | Data Pipelines | Data Warehousing | DjangoSenior-level Full TimeGermany R1d ago
-
Senior Python Engineer (GenAI & LLM Orchestration) USD 150K-160KAWS Bedrock | AWS Lambda | Amazon DynamoDB | Amazon ECS | Amazon S3100% remote | Equipment provided | Flexible hours | Paid public holidays | Paid sick leaveSenior-level Full TimeBelgrade R1d ago
-
AWS | Airbyte | Amazon Redshift | Apache Airflow | CI/CDHybrid work | On-call rotation | Remote-first workMid-level Full TimeCape Town, Western Cape, South Africa R1d ago
-
AWS | Agile | Azure | CI/CD | DevOpsEnglish classes | Free office food and drinks | Internal training | Paid certifications | Paid time offSenior-level Full TimeZaragoza, Spain R1d ago
-
Apigee | Artificial Intelligence | Bash | BigQuery | Cloud HostingEnglish classes | Free Office Meals and Drinks | Free parking | Paid certifications | Paid vacationMid-level Full TimeMadrid, Spain R1d ago
-
AI Engineer (Hybrid) USD 125K-175KAgent Orchestration | Agent systems | Autogen | CI/CD | ChromaDBHybrid workMid-level Full TimeGiza, El Omraniya, Egypt R1d ago