Member of Engineering (Pre-training / Data Research)
Remote (EMEA/East Coast)
R
USD 160K-300K (estimate) Mid-level Full Time
Tasks
- Build distributed data pipelines
- Collaborate with pretraining posttraining evals and product teams
- Deduplicate datasets
- Design data curation pipelines
- Generate synthetic data
- Improve pretraining dataset quality
- Optimize data mixing
- Run training experiments and ablations
- Track model evaluation results
Perks/Benefits
- Company-provided equipment
- Flexible hours
- Frequent team get togethers
- Fully remote work
- Health insurance allowance
- Home-office allowance
- Learning allowance
- Vacation and holidays
- Well-being allowance
Skills/Tech-stack
Curriculum learning | Data Ablation | Data Curation | Data Pipelines | Data mixing | Deduplication | Distributed Systems | Distributed data | Distributed data pipelines | GPU clusters | Language Models | Large Language Models | Machine Learning | Prompt engineering | Python | Scaling Laws | Tokenization | Transformers
Education
N/A
Related jobs
-
AI Engineer H/F - CDI EUR 50K-65KAI Agents | Agent systems | Cloud Computing | Deep learning | Fine TuningCooptation bonus | Equipment bonus | Flexible remote work | Health insurance | Meal vouchersMid-level Full TimeParis, IDF, France R11h ago
-
AWS | Azure | Data Governance | Data Marts | Data ModelingCSE | Career development opportunities | Cooptation bonus | Diversity initiatives | Employee representative councilSenior-level Full TimeAix-en-Provence, Provence-Alpes-Côte d'Azur, France R11h ago
-
Data Scientist Confirmé - H/F - CDI EUR 50K-65KAWS SageMaker | Apache Spark | Azure Machine Learning | Big Data | Data VisualizationEmployee stock ownership | Equipment stipend | Health insurance | Mobility | Modern office locationSenior-level Full TimeParis, IDF, France R16h ago
-
AWS | CI/CD | Dataiku | GitLab CI | HDFSEmployee representative council | Health insurance | Meal vouchers | Profit sharing | Referral bonusSenior-level Full TimeVilleneuve-d'Ascq, Hauts-de-France, France R19h ago
-
Data Scientist confirmé / AI Engineer EUR 50K-55KAzure | CI/CD | Docker | Docker Compose | GCPHealth insurance | Telework | Ticket restaurant | Works CouncilMid-level Full TimeCourbevoie, IDF, France R19h ago
-
Data Scientist Paris EUR 55K-55KApache Spark | Docker | Generative AI | Kubernetes | Language ModelsLong-term project | Partial remote workSenior-level Full TimeParis, IDF, France R19h ago
-
AI Agents | Artificial Intelligence | Backend Development | CI/CD | Cloud ComputingCoworking access | Flexible work arrangement | Fully remote | Healthcare coverage | Home-office equipmentMid-level Full TimeItaly R23h ago
-
AI Agents | Backend Development | Cloud Computing | Docker | Frontend DevelopmentCoworking space access | Flexible work from anywhere | Fully remote | Healthcare coverage | Home-office equipmentMid-level Full TimeNetherlands R23h ago
-
AI Agents | Artificial Intelligence | Backend Development | Cloud Platforms | ContainersCoworking access | Fully remote | Healthcare coverage | Home-office equipment | Ownership and autonomyMid-level Full TimeIreland R23h ago
-
AI Agents | Backend Development | Cloud Platforms | Docker | Frontend DevelopmentCoworking space access | Fully remote | Healthcare coverage | Home-office equipment | Ownership and autonomyMid-level Full TimeFrance R23h ago
-
AI Agents | API Integration | Backend Development | Cloud Platforms | ContainerizationCoworking space access | Engineering autonomy | Healthcare coverage | Home-office equipment provided | Remote workMid-level Full TimeSpain R23h ago
-
AI | AI Agents | Backend Development | Cloud Platforms | ContainersCoworking spaces | Flexible work location | Fully remote | Healthcare coverage | Home-office equipmentMid-level Full TimeGermany R23h ago
-
Lead GIS Data Engineer PLN 206K-282KArcGIS | ArcGIS Enterprise | ArcGIS Network Analyst | ArcGIS Pro | ArcpyBonus | Flexible working hours | Life insurance | Medical coverage | Paid time offSenior-level Full TimePoland Home Office, Poland R23h ago
-
Data Engineer - Flutter Functions, Hybrid RON 264K-387KAirflow | Azure Data | Azure Data Factory | Backward Compatibility | DBTAnnual leave | Career growth sessions | Dental insurance | Extended health insurance | Flexible benefitsSenior-level Full TimeCluj-Napoca, Romania R1d ago
-
Analista Mlops (Híbrido Madrid/Málaga) EUR 33K-40KAPI | Apache Spark | CI/CD | ETL | EnglishEmployee benefits club | Good work culture | Hybrid work | Technical team support | Unlimited trainingSenior-level Full TimeMADRID, MADRID R1d ago
-
Mid-level Full TimeRemote - France R1d ago
-
Staff Cyber Security Engineer – AI Data Protection PLN 284K-391KAI Security | AWS | Agile | Automation accounts | AzureFlexible working | Health and wellness coverage | Retirement and savings plans | Work-life balance supportSenior-level Full TimeKrakow, Poland R1d ago
-
Applied AI Engineer GBP 85K-110KA/B | A/B Testing | Anthropic | B testing | ExperimentationFully remote | Global engineering collaboration | High ownership culture | Learning and development budgetMid-level Full TimeUnited Kingdom R1d ago
-
Applied AI Engineer CHF 106K-158KA/B | A/B Testing | API Integration | Anthropic API | B testingFully remote | Global Engineering Organization | High ownership culture | Learning and development budgetMid-level Full TimeZurich, Switzerland R1d ago
-
Mid-level Full TimeTel Aviv-Yafo, center, Israel (Hybrid) R1d ago
-
Embedded Software Engineer (Poland) PLN 180K-338KAnalog circuits | Automated testing | BLE | Bash | Bluetooth Low EnergyEducation and training programs | English teachers | Medical insurance | Remote workSenior-level Full TimeKraków, Poland R1d ago
-
Lead AI Engineer (AI Systems & Automation) GBP 78K-109KAlerting | Anthropic | Distributed Systems | Docker | EmbeddingsFully remote | Global engineering collaboration | High ownership culture | Learning and development budgetSenior-level Full TimeUnited Kingdom R1d ago
-
Lead AI Engineer (AI Systems & Automation) CHF 129K-204KAlerting | Anthropic API | Distributed Systems | Docker | EmbeddingsFully remote | High ownership culture | Learning and development budgetSenior-level Full TimeZurich, Switzerland R1d ago
-
Data Architect (m/w/d) EUR 56K-75KAWS | Analytics | Azure | CI/CD | ClaudeAnnual bonus | Childcare support | Coaching | Company bike leasing | Fitness programMid-level Full TimeSt. Georgen im Schwarzwald, Hannover, 100% … R1d ago
-
Ultralytics LLM Engineer EUR 60K-76KAPI Development | AWS | Azure | Data Quality | DebuggingBirthday off | Home setup allowance | Hybrid work | Independent contractor eligibility | Local holidaysMid-level Full TimeMadrid, Remote EURO R1d ago