Member of Engineering (Pre-training / Data Research)
Remote (EMEA/East Coast)
R
USD 160K-300K (estimate) Mid-level Full Time
Tasks
- Build distributed data pipelines
- Collaborate with pretraining posttraining evals and product teams
- Deduplicate datasets
- Design data curation pipelines
- Generate synthetic data
- Improve pretraining dataset quality
- Optimize data mixing
- Run training experiments and ablations
- Track model evaluation results
Perks/Benefits
- Company-provided equipment
- Flexible hours
- Frequent team get togethers
- Fully remote work
- Health insurance allowance
- Home-office allowance
- Learning allowance
- Vacation and holidays
- Well-being allowance
Skills/Tech-stack
Curriculum learning | Data Ablation | Data Curation | Data Pipelines | Data mixing | Deduplication | Distributed Systems | Distributed data | Distributed data pipelines | GPU clusters | Language Models | Large Language Models | Machine Learning | Prompt engineering | Python | Scaling Laws | Tokenization | Transformers
Education
N/A
Related jobs
-
Consultant.e AI Engineer EUR 50K-60KAI Foundry | APIs | Apache Spark | Azure AI | Azure AI FoundryHybrid work | RTT | Restaurant ticket | TrainingSenior-level Full TimeNiort, Deux-Sèvres, Nouvelle-Aquitaine, FR R17h ago
-
Consultant.e AI Engineer EUR 50K-60KAI Foundry | Apache Spark | Azure | Azure AI | Azure AI FoundryCareer growth | Human-sized company | Hybrid work | Individualized coaching | Meal cardSenior-level Full TimeNantes, Loire-Atlantique, Pays de la Loire, … R17h ago
-
Consultant.e AI Engineer EUR 60K-70KAI Foundry | API Development | Azure AI | Azure AI Foundry | Azure CognitiveHybrid work | Meal card | RTT | Training opportunitiesSenior-level Full TimeParis, Paris, Île-de-France, FR R17h ago
-
AI RMF | C++ | Container Security | Data exfiltration | FedRAMPFinancial benefits | Flexible work arrangements | Health benefits | Remote work | Well-being benefitsSenior-level Full TimePoland R17h ago
-
AI Engineer (m/f/n) PLN 282K-402KAWS | AWS Lambda | Apache Kafka | CI/CD | Cloud platformB2B contract | Flexible office or remote work | International cross functional environment | Remote work | Training opportunitiesSenior-level Full TimeWarszawa, Województwo mazowieckie, Poland R18h ago
-
Senior DevOps Engineer (Full Remote from France) EUR 55K-70KArgoCD | Bash | BigQuery | CI/CD | Cloud StorageFull remoteSenior-level Full TimeParis, IDF, France R19h ago
-
ML Engineer EUR 45K-60KCI/CD | Cloud Computing | Data Pipelines | Data Quality | Data VisualizationFlexible working hours | Hybrid work model | Work from anywhere up to 3 weeks per yearMid-level Full TimeVilnius, Vilnius City Municipality, Lithuania R1d ago
-
Data Engineer, Medical Data Foundation PLN 276K-276KAlation | Azure Data | Azure Data Factory | Azure Synapse | Azure Synapse AnalyticsAfter-work events | Hybrid work | Life insurance | Lunch card | Multisport cardMid-level Full TimePoland - Warsaw R1d ago
-
API Development | API Gateway | AWS Lambda | Agentic AI | Amazon APIHybrid work model | Remote work days per weekSenior-level Full TimeEspoo, Finland R1d ago
-
AI Engineering Team Lead GBP 70K-103KAPI Development | Backend Development | Cloud Computing | Data Pipelines | DeploymentEngaging employee programs | Flexible time off policy | Life assurance | Pension | Private health coverageSenior-level Full TimeUnited Kingdom Remote R1d ago
-
Aerodynamics | Aircraft Conceptual Design | Aircraft Performance | Aircraft Preliminary Design | C++Childcare tickets | Flexible working hours | Language training | Life and accident insurance | Meal cardMid-level Full TimeTres Cantos, Spain R1d ago
-
Data Scientist GBP 50K-58KAgile | Airflow | Cloud Computing | Cloud platform | Data GovernanceContributory pension scheme | Enhanced Adoption Pay | Enhanced maternity pay | Private healthcare | Professional development opportunitiesMid-level Full TimeBirmingham, United Kingdom R1d ago
-
Data Engineer EUR 53K-70KAPI | Agile | Artificial Intelligence | Big Data | Data ModelingCareer advancement | Employee assistance program | Health insurance | Training opportunities | Work-life balanceSenior-level Full TimeHíbrido Venda do Pinheiro/ Porto, Híbrido … R1d ago
-
Apache Spark | Azure HDInsight | Cloudera | Databricks | DataikuCareer development | Community engagement platform | Employee representatives council CSE | Health insurance | Meal vouchersSenior-level Full TimeNantes, Pays de la Loire, France R1d ago
-
Apache Hadoop | Apache Kafka | Apache Spark | CI/CD | Cloud ComputingCareer development | Citizenship platform | Cooptation bonus | Employee representative body | Health insuranceSenior-level Full TimeParis, IDF, France R1d ago
-
Data Scientist GBP 70K-80KA/B | A/B Testing | B testing | Causal Inference | Churn PredictionAuto-enrolment pension | Discounts | Employee assistance programme | Enhanced parental leave | Flexible hoursSenior-level Full TimeLondon, United Kingdom R1d ago
-
Senior Data Engineer Databricks (m/f/d) EUR 36K-45KAgile | Amazon Web Services | Apache Spark | Azure | CI/CDAccess to health wellbeing and legal advice | Childcare support | Coursera access | Experience days | Flexible scheduleSenior-level Full TimeGranada, AN, Spain R1d ago
-
Senior Data Scientist EUR 50K-72KBayesian Modeling | Causal Inference | Cloud Data | Cloud Data Pipelines | Data PipelinesHybrid workSenior-level Full TimeHelsinki Metropolitan Area R1d ago
-
AWS CDK | AWS Lambda | AWS SageMaker | Amazon S3 | Apache IcebergDog-friendly offices | Flexible working hours | Home-office allowance | Hybrid work setup | Learning daysEntry-level Part TimeBerlin, Germany; Hamburg, Germany R1d ago
-
Medior Data Engineer - DLI002392 EUR 51K-74KApache Airflow | Azure | Azure Batch | Azure Event | Azure Event HubsHybrid workMid-level Full TimeMechelen, Belgium R1d ago
-
Data & Analytics Engineer for AI Platform team (hybrid) RON 296K-396KDBT | Elasticsearch | Grafana | LLM Evaluation | LookerE-learning access | Flexible work schedule | Hybrid work option | Meal tickets | Medical services packageSenior-level Full TimeCalea Floreasca 246c, 014476 Bucharest R1d ago
-
.NET | Angular | Azure | Azure DevOps | C#Great Place to Work 10 years | Great place to work | Hybrid work | Job Mobility | Maternity return policySenior-level Full TimeToulouse, Occitanie, France R1d ago
-
AI Engineer SEK 775K-930KAWS | Agentic Workflows | Cloud infrastructure | Evaluation Pipelines | GraphQLAfterworks and team activities | Hybrid work | Ongoing learningEntry-level Full TimeStockholm, Sweden R1d ago
-
Deep Learning Scientist EUR 45K-50KAgile | Air-gapped | Air-gapped systems | CI_CD | Computer VisionEmployee savings and profit sharing | Family health insurance | Meal benefits | Paid time off | Retirement savings plan matchingSenior-level Full TimeRennes R1d ago
-
Data Scientist - UK GBP 60K-70KA/B | A/B Testing | Applied statistics | B testing | BI toolsLearning and development budget | Paid time off | Parental leave | Pension | Sick leaveMid-level Full TimeRemote, United Kingdom R1d ago