Find jobs in AI/ML, Data Science and Big Data
74 results
for Preference optimization
(Skill/Tech stack)
-
LLM Fine-Tuning Engineer USD 100K-150KAdapter-Tuning | Automated Benchmarks | Data Curation | Direct Preference Optimization | Distributed TrainingMid-level Full TimeUnited States - Remote R2d ago
-
Mid-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL3d ago
-
Senior AI Engineer - LLMs and Finetuning GBP 84K-109KAWS | Benchmarking | Data Generation | Distillation | Distributed TrainingAI experimentation budget | Enhanced parental leave | Flexible working arrangements | Group life assurance | Hybrid working modelSenior-level Full TimeLondon5d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAttention Optimization | DPO | Direct Preference Optimization | Distributed Training | EvaluationMid-level Full TimeUnited States - Remote R5d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapters | Attention | DPO | Dataset curation | Distributed TrainingMid-level Full TimeUnited States - Remote R5d ago
-
Research Engineer - LLM Training & Alignment Systems CAD 127K-225KAutomation | Benchmarking | C# | C++ | Data CurationMid-level Contract Full TimeKingston, Ontario, Canada7d ago
-
Agentic AI Engineer USD 86K-120KAgent Frameworks | ArangoDB | Attention Mechanism | Autogen | BenchmarksEntry-level Full TimeCARY 02, United States7d ago
-
DPO | Deep learning | Diverse Preference Optimization | Learning algorithms | Machine LearningMid-level Full Time上海7d ago
-
Senior Machine Learning Engineer , AI Platform USD 150K-210KArtificial Intelligence | Batch Processing | Data Analysis | Data Pipelines | Data PrivacySenior-level Full TimeBoston, MA8d ago
-
Sr. Data Scientist INR 2500K-3380KApache Spark | Deep learning | Drift Detection | Evaluation | Experimentation platformsSenior-level Full TimeBangalore,India9d ago
-
Lead AI Research Engineer USD 200K-300KActive Learning | Deep learning | Direct Preference Optimization | Fine Tuning | Language ModelsCompute budget | Open source contribution support | Publication supportSenior-level Full TimeCalifornia11d ago
-
Applied Scientist II, Sponsored Products and Brands - Advertiser Growth and Strategies USD 142K-223KAgent systems | Automated benchmarking | Chain-of-Thought | DPO | Dataset curationMid-level Full TimeNew York, New York, USA13d ago
-
Director, Data Science, Foundation Model AI USD 194K-305KAI architecture | Biological Data | Biological foundation models | Causal analysis | Deep learning401k | Medical/Dental/Vision insurance | Paid Holidays | Retirement benefits | VacationExecutive-level Full TimeUSA - Massachusetts - Cambridge (320 …14d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Andorra) USD 150K-225KDirect Preference Optimization | Fine Tuning | Huggingface | Human Feedback | Information RetrievalCo-working space budget | Equipment provided | Fully remote | Health insurance support | Learning budgetSenior-level Full TimeAndorra R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Turkey) TRY 840K-1080KClassifiers | Data labeling | Direct Preference Optimization | Evaluation | Fine TuningAccess to AI tools | Annual in-person meetup | Co-working space budget | Equipment budget | Fully remoteSenior-level Full TimeTurkey R15d ago
-
Classifier Training | Content Moderation | DPO | Data cleaning | Data labelingAI tools access | Co-working space budget | Equipment provided | Fully remote | Health insurance allowanceSenior-level Full TimeSlovakia R15d ago
-
Classification | Context window | Context window management | DPO | Data cleaningAccess to mental health counseling | Co-working space budget | Company equipment provision | Fully remote | Health insurance allowanceSenior-level Full TimeGreece R15d ago
-
Classification | Context window | Context window management | Data labeling | Direct Preference OptimizationCo-working space budget | Fully remote | Health insurance support | Learning budget | Paid time offSenior-level Full TimeIreland R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Moldova) USD 150K-225KDPO | Data labeling | Fine Tuning | Huggingface | Inference OptimizationAI tools access | Co-working space budget | Equipment provided | Fully remote | Health and wellness supportSenior-level Full TimeMoldova R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Hungary) HUF 11000K-18960KClassifiers | Context window | Context window optimization | Data cleaning | Data labelingAI tools access | Co-working space budget | Company laptop | Fully remote | Health insurance allowanceSenior-level Full TimeHungary R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Norway) NOK 1100K-1250KClassification | Data labeling | Dataset cleaning | Direct Preference Optimization | Fine TuningAI tools access | Annual in-person meetup | Co-working space budget | Company equipment provided | Fully remoteSenior-level Full TimeNorway R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Poland) PLN 324K-450KClassification | Context window | Context window optimization | Data cleaning | Data labelingAI tools access | Co-working space budget | Equipment budget | Fully remote | Health and wellness supportSenior-level Full TimePoland R15d ago
-
Classifier Training | Context window | Context window optimization | Data cleaning | Data labelingAccess to AI tools | Co-working space budget | Equipment provided | Fully remote | Health insurance allowanceSenior-level Full TimeGermany R15d ago
-
Context window | Context window optimization | Data labeling | Dataset cleaning | Direct Preference Optimization1 1 psychologist sessions | AI tools access | Annual in-person meetup | Co-working space budget | Company-provided equipmentSenior-level Full TimeCroatia R15d ago
-
Classifier Training | DPO | Data cleaning | Data labeling | Fine TuningAI tools access | Annual in-person meetup | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeAustria R15d ago
-
Classifier Training | Context window | Context window optimization | Data labeling | Dataset cleaningAI tool access | Annual in-person meetup | Co-working budget | Equipment provided | Health and wellness supportSenior-level Full TimePortugal R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Spain) EUR 80K-100KData labeling | Direct Preference Optimization | Fine Tuning | Huggingface | Human FeedbackCo-working space budget | Company equipment provided | Fully remote | Health and wellness support | Learning budgetSenior-level Full TimeSpain R15d ago
-
Classifier Training | Context window | Context window management | Data labeling | Dataset cleaningAI tools access | Co-working space budget | Equipment provided | Fully remote | Health insurance supportSenior-level Full TimeCyprus R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Czech Republic) CZK 1020K-1200KClassifiers | Context window | Context window optimization | DPO | Data labelingAI tools access | Co-working budget | Equipment stipend | Fully remote | Health & wellness supportSenior-level Full TimeCzech Republic R15d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Gibraltar) USD 150K-225KClassifier Training | Context window | Context window optimization | Data cleaning | Data labelingAI tools access | Annual in-person meetup | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeGibraltar R15d ago
-
Content Moderation | DPO | Data labeling | Fine Tuning | HuggingfaceAI tools access | Annual in-person meetup | Co-working space budget | Company equipment provided | Fully remoteSenior-level Full TimeFinland R15d ago
-
Data labeling | Dataset cleaning | Direct Preference Optimization | Fine Tuning | Hugging Face1 on 1 therapy sessions | Co-working space budget | Equipment provided | Fully remote | Health insurance supportSenior-level Full TimeNetherlands R15d ago
-
Classification | Data labeling | Direct Preference Optimization | Fine Tuning | HuggingfaceAI tools access | Co-working space budget | Company laptop and equipment budget | Fully remote | Health and wellness supportSenior-level Full TimeFrance R15d ago
-
AI Feedback | Agentic Systems | Direct Preference Optimization | Distributed Training | EvaluationSenior-level Full TimeAMER - United States - California … R15d ago
-
Senior Applied Scientist USD 142K-270KAI Model Training | AI model | Fine Tuning | Generative AI | Inference OptimizationSenior-level Full TimeSan Jose, United States R15d ago
-
Data Curation | Deep learning | DeepSpeed | Direct Preference Optimization | EvaluationSenior-level Full TimeSingapore, Singapore16d ago
-
Data labeling | Direct Preference Optimization | Fine Tuning | Huggingface | Human FeedbackAI tools access | Co-working budget | Equipment provided | Fully remote | Health and wellness supportSenior-level Full TimeEurope R16d ago
-
Machine Learning Engineer 5 USD 172K-306KAWS | Algorithms | Azure | Data Structures | Direct Preference OptimizationSenior-level Full TimeSan Jose, United States R19d ago
-
Senior-level Full TimeSan Jose, United States R19d ago
-
Llm基座模型算法实习生 CNY 25K-37KBERT | CLIP | Data Synthesis | Deep learning | Direct Preference OptimizationEntry-level Internship深圳、上海19d ago
-
Agent Orchestration | Agent systems | Autogen | Automated Evaluation | BenchmarkingSenior-level Full TimeSeoul HQ22d ago
-
Research Scientist, Safety Post Training USD 216K-270KAdversarial evaluation | Direct Preference Optimization | Generative AI | Group Relative Policy Optimization | Human FeedbackCommuter stipend | Comprehensive health insurance | Dental insurance | Learning and development stipend | Paid time offSenior-level Full TimeSan Francisco, CA; New York, NY22d ago
-
Staff Engineering Analyst Manager, Veo and Robotics USD 189K-274KCoaching | Data Analysis | Deep learning | Direct Preference Optimization | Fine TuningSenior-level Full TimeSunnyvale, CA, USA23d ago
-
Applied AI Engineer USD 99K-225KAWS | AgentOps | Azure | ChromaDB | Continued Pretraining401k retirement plan | Bike storage | Commuter benefits | Dependent care FSA | Desk setup stipendMid-level Full TimeWashington DC23d ago
-
AI Scientist GBP 46K-46KAzure | Azure OpenAI | Azure OpenAI Services | Databricks | Dataset PreparationMid-level Full TimeLondon, United Kingdom26d ago
-
Lead Data Scientist- Comp Intel INR 2040K-3500KAgent systems | Apache Spark | Deep learning | Drift Detection | Embedding systemsSenior-level Full TimeTower 02, Manyata Embassy Business Park, …26d ago
-
Applied Scientist II, Alexa International Team USD 142K-193KA/B | A/B Testing | AI Feedback | B testing | C++Entry-level Full Time InternshipBellevue, Washington, USA27d ago
-
Machine Learning Engineer, TikTok - Business Governance USD 145K-250KAI Agents | Audio Processing | Content Moderation | Deep learning | Direct Preference OptimizationMid-level Full TimeSan Jose, California, United States28d ago
-
Applied Scientist II, Alexa International Team USD 142K-193KA/B | A/B Testing | AI Feedback | B testing | Deep learningEntry-level Full Time InternshipBellevue, Washington, USA28d ago
-
Researcher, Alignment Oversight USD 250K-445KEvaluation Design | Experimentation | Human-in-the-loop | Language Models | Large Language ModelsHybrid work model | Relocation assistanceMid-level Full TimeSan Francisco29d ago