Find jobs in AI/ML, Data Science and Big Data
105 results
for DPO
(Skill/Tech stack)
-
Senior Machine Learning Engineer – LLMs EUR 62K-90KAccelerate | Axolotl | BF16 | DPO | Data DeduplicationAutonomy | Hybrid work model | Professional growth | Top-spec equipmentSenior-level Full TimeNetherlands - Amsterdam1d ago
-
Senior-level Full TimeNetherlands - Amsterdam1d ago
-
Machine Learning Research Engineer | Kilby Labs USD 149K-258KC++ | DPO | Deep learning | Embeddings | Few-Shot LearningMid-level Full TimeUnited States1d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAttention Optimization | DPO | Direct Preference Optimization | Distributed Training | EvaluationMid-level Full TimeUnited States - Remote R5d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapter methods | DPO | Dataset curation | Distributed Training | Efficient AttentionMid-level Full TimeUnited States - Remote R5d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapters | Attention | DPO | Dataset curation | Distributed TrainingMid-level Full TimeUnited States - Remote R5d ago
-
Gen-AI Data Scientist Lead USD 140K-180KAccelerate | Anthropic | Artificial Intelligence | Azure OpenAI | DPOFlexible working options | Inclusive work culture | Opportunities to grow | Supportive teamSenior-level Full TimeNew York, United States - New …5d ago
-
AI Researcher PHP 219K-252KASR | Agentic AI | Audio signal processing | DPO | EmbeddingsEnglish communication required | Open ended research opportunities | Remote workMid-level Full TimeSouth America, Europe, Asia R6d ago
-
Applied Scientist 5 INR 2475K-4500K3D Reconstruction | Adapters | CLIP | Computer Vision | ControlNetSenior-level Full TimeBangalore, India R6d ago
-
Applied Scientist 5.5 INR 2475K-4500K3D Reconstruction | Adapters | CLIP | Computer Vision | ControlNetSenior-level Full TimeBangalore, India R6d ago
-
Data Scientist - Agentic AI Systems - Loops USD 140K-150KAgent coordination | Autogen | DPO | Decision Making | Decision-making models401k match | Dental insurance | Disability benefits | Flexible paid time off | Flexible spending accountsMid-level Full TimePalo Alto, California, United States7d ago
-
AI Engineer Intern EUR 22K-22KDPO | Embedding Models | Environment variables | Git | LangchainBirthday day off | Flexible smart working | Free water and coffee at office | Physical and mental well being moments | Training & development opportunitiesEntry-level Full Time InternshipMILAN, Italy7d ago
-
Agentic AI Engineer USD 86K-120KAgent Frameworks | ArangoDB | Attention Mechanism | Autogen | BenchmarksEntry-level Full TimeCARY 02, United States7d ago
-
Director, AI Enterprise Architect USD 175K-250KAPIs | AWS | Agentic Workflows | Anomaly Detection | AzureSenior-level Full TimeUnited States7d ago
-
DPO | Deep learning | Diverse Preference Optimization | Learning algorithms | Machine LearningMid-level Full Time上海8d ago
-
Mid-level Internship上海8d ago
-
Senior-level Full TimeSeoul, Korea8d ago
-
Bias Mitigation | DPO | Data Pipelines | Deep learning | Fine TuningSenior-level Full TimeSeoul, Korea8d ago
-
Cost Optimization | DPO | Data Pipelines | GPU | Inference architectureSenior-level Full TimeSeoul, Korea8d ago
-
Ai 院--多模态团队--多模态理解算法研究员-强化学习方向 CNY 240K-480KDPO | Data Preprocessing | Data cleaning | DeepSpeed | Distributed TrainingSenior-level Full Time北京 R9d ago
-
DPO | Data alignment | Deep learning | Language Processing | Machine LearningSenior-level Full TimeMenlo Park, CA12d ago
-
A/B | A/B Testing | Attention | B testing | CI/CDSenior-level Full TimeIndia12d ago
-
Applied Scientist II, Sponsored Products and Brands - Advertiser Growth and Strategies USD 142K-223KAgent systems | Automated benchmarking | Chain-of-Thought | DPO | Dataset curationMid-level Full TimeNew York, New York, USA13d ago
-
AI Engineer, Agent Platform USD 120K-220KAPI Development | Caching | DPO | Database Design | Evaluation401k matching | Commuter benefits | Dental insurance | FSA | HSASenior-level Full TimeMountain View, California, United States13d ago
-
Senior AI & Data Engineer INR 3000K-4000KA/B | A/B Testing | API Design | Async Processing | AutogenSenior-level Full TimeBengaluru, Karnātaka, India14d ago
-
Principal AI Research Scientist Post-Training Alignment CAD 123K-180KAgentic AI | Alignment research | DPO | Deep learning | Distributed TrainingSenior-level Full TimeAMER - Canada - Ontario - …14d ago
-
Classification | Content Moderation | DPO | Dataset Preparation | Fine TuningCo-working space budget | Fully remote | Health insurance support | Learning budget | Paid time offSenior-level Full TimeUkraine R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Sweden) SEK 738K-930KClassification | DPO | Data labeling | Dataset cleaning | Evaluation FrameworksAI tools access | Annual in-person meetup | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeSweden R15d ago
-
Classifiers | DPO | Data labeling | Evaluation | Fine TuningAnnual in-person meetup | Co-working space budget | Equipment provided | Full remote | Health and wellness supportSenior-level Full TimeLatvia R15d ago
-
Classifier Training | Content Moderation | DPO | Data cleaning | Data labelingAI tools access | Co-working space budget | Equipment provided | Fully remote | Health insurance allowanceSenior-level Full TimeSlovakia R15d ago
-
Classification | Context window | Context window management | DPO | Data cleaningAccess to mental health counseling | Co-working space budget | Company equipment provision | Fully remote | Health insurance allowanceSenior-level Full TimeGreece R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Romania) RON 245K-348KDPO | Fine Tuning | Huggingface | Human Feedback | Inference OptimizationAI tools access | Annual in-person meetup | Co-working space budget | Company equipment provided | Fully remoteSenior-level Full TimeRomania R15d ago
-
Classifier Training | DPO | Fine Tuning | Huggingface | Human FeedbackAI tools access | Annual in-person meetup | Co-working space budget | Company laptop | Fully remoteSenior-level Full TimeSwitzerland R15d ago
-
Classifier Training | Context window | Context window optimization | DPO | Data cleaningAI tools access | Annual in-person meetup | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeLithuania R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Serbia) USD 150K-225KClassification | Context window | Context window optimization | DPO | Data cleaningAnnual in-person meetup | Co-working space budget | Company laptop | Fully remote | Health and wellness supportSenior-level Full TimeSerbia R15d ago
-
DPO | Fine Tuning | Huggingface | LLM | Machine LearningCo-working space budget | Equipment provided | Fully remote | Health insurance allowance | Learning budgetSenior-level Full TimeLuxembourg R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Malta) EUR 79K-100KDPO | Data labeling | Fine Tuning | Huggingface | Human FeedbackAI tools access | Co-working space budget | Equipment provided | Fully remote | Health and wellness supportSenior-level Full TimeMalta R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Moldova) USD 150K-225KDPO | Data labeling | Fine Tuning | Huggingface | Inference OptimizationAI tools access | Co-working space budget | Equipment provided | Fully remote | Health and wellness supportSenior-level Full TimeMoldova R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Italy) EUR 79K-100KClassification Models | Context Management | DPO | Data cleaning | Data labelingAI tools access | Annual in-person gathering | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeItaly R15d ago
-
DPO | Data labeling | Fine Tuning | Huggingface | Human FeedbackAI tools access | Co-working space budget | Equipment provided | Fully remote | Health and wellness supportSenior-level Full TimeSlovenia R15d ago
-
Classification Algorithms | Context window | Context window optimization | DPO | Data labelingAI tools access | Co-working space budget | Company equipment | Fully remote | Health insurance supportSenior-level Full TimeMontenegro R15d ago
-
DPO | Data labeling | Fine Tuning | Huggingface | LLMCo-working space budget | Company laptop and equipment budget | Fully remote | Health insurance allowance | Learning budgetSenior-level Full TimeBelgium R15d ago
-
Classifier Training | DPO | Data cleaning | Data labeling | Fine TuningAI tools access | Annual in-person meetup | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeAustria R15d ago
-
Classification | Context window | Context window optimization | DPO | Data cleaningAI tools access | Annual meetup | Co-working budget | Equipment budget | Fully remoteSenior-level Full TimeEstonia R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Czech Republic) CZK 1020K-1200KClassifiers | Context window | Context window optimization | DPO | Data labelingAI tools access | Co-working budget | Equipment stipend | Fully remote | Health & wellness supportSenior-level Full TimeCzech Republic R15d ago
-
Classification | DPO | Data labeling | Dataset cleaning | Evaluation FrameworksAI tools access | Annual in-person meetup | Co-working space budget | Company laptop provided | Fully remoteSenior-level Full TimeBulgaria R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - UK) GBP 90K-140KDPO | Data labeling | Evaluation Frameworks | Fine Tuning | HuggingfaceAI tools access | Annual in-person meetup | Co-working space budget | Equipment allowance | Fully remoteSenior-level Full TimeUnited Kingdom R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Denmark) DKK 516K-580KDPO | Fine Tuning | Hugging Face | Language Models | Large Language ModelsAI tools access | Annual in-person meetup | Co-working space budget | Equipment budget | Fully remoteSenior-level Full TimeDenmark R15d ago
-
Content Moderation | DPO | Data labeling | Fine Tuning | HuggingfaceAI tools access | Annual in-person meetup | Co-working space budget | Company equipment provided | Fully remoteSenior-level Full TimeFinland R15d ago
-
Senior NLP/LLM Engineer PLN 237K-326KBERT | DPO | Deep learning | Entity recognition | Fine TuningEnglish lessons discount | Health benefits | Professional training reimbursement | Remote work | VacationSenior-level Full TimeWorldwide R17d ago