Find jobs in AI/ML, Data Science and Big Data
47 results
for Direct Preference Optimization
(Skill/Tech stack)
-
LLM Fine-Tuning Engineer USD 100K-150KAdapter-Tuning | Automated Benchmarks | Data Curation | Direct Preference Optimization | Distributed TrainingMid-level Full TimeUnited States - Remote R2d ago
-
Mid-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL2d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAttention Optimization | DPO | Direct Preference Optimization | Distributed Training | EvaluationMid-level Full TimeUnited States - Remote R5d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapters | Attention Optimization | Cluster operations | Data Generation | DeepSpeed ZeRORemote workMid-level Full TimeUnited States - Remote R6d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapters | Attention Optimization | Benchmarking | Dataset curation | Direct Preference OptimizationMid-level Full TimeUnited States - Remote R6d ago
-
Senior Machine Learning Engineer , AI Platform USD 150K-210KArtificial Intelligence | Batch Processing | Data Analysis | Data Pipelines | Data PrivacySenior-level Full TimeBoston, MA7d ago
-
Lead AI Research Engineer USD 200K-300KActive Learning | Deep learning | Direct Preference Optimization | Fine Tuning | Language ModelsCompute budget | Open source contribution support | Publication supportSenior-level Full TimeCalifornia11d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Andorra) USD 150K-225KDirect Preference Optimization | Fine Tuning | Huggingface | Human Feedback | Information RetrievalCo-working space budget | Equipment provided | Fully remote | Health insurance support | Learning budgetSenior-level Full TimeAndorra R14d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Turkey) TRY 840K-1080KClassifiers | Data labeling | Direct Preference Optimization | Evaluation | Fine TuningAccess to AI tools | Annual in-person meetup | Co-working space budget | Equipment budget | Fully remoteSenior-level Full TimeTurkey R14d ago
-
Classification | Context window | Context window management | Data labeling | Direct Preference OptimizationCo-working space budget | Fully remote | Health insurance support | Learning budget | Paid time offSenior-level Full TimeIreland R14d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Hungary) HUF 11000K-18960KClassifiers | Context window | Context window optimization | Data cleaning | Data labelingAI tools access | Co-working space budget | Company laptop | Fully remote | Health insurance allowanceSenior-level Full TimeHungary R14d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Norway) NOK 1100K-1250KClassification | Data labeling | Dataset cleaning | Direct Preference Optimization | Fine TuningAI tools access | Annual in-person meetup | Co-working space budget | Company equipment provided | Fully remoteSenior-level Full TimeNorway R14d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Poland) PLN 324K-450KClassification | Context window | Context window optimization | Data cleaning | Data labelingAI tools access | Co-working space budget | Equipment budget | Fully remote | Health and wellness supportSenior-level Full TimePoland R14d ago
-
Classifier Training | Context window | Context window optimization | Data cleaning | Data labelingAccess to AI tools | Co-working space budget | Equipment provided | Fully remote | Health insurance allowanceSenior-level Full TimeGermany R14d ago
-
Context window | Context window optimization | Data labeling | Dataset cleaning | Direct Preference Optimization1 1 psychologist sessions | AI tools access | Annual in-person meetup | Co-working space budget | Company-provided equipmentSenior-level Full TimeCroatia R14d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Spain) EUR 80K-100KData labeling | Direct Preference Optimization | Fine Tuning | Huggingface | Human FeedbackCo-working space budget | Company equipment provided | Fully remote | Health and wellness support | Learning budgetSenior-level Full TimeSpain R14d ago
-
Classifier Training | Context window | Context window management | Data labeling | Dataset cleaningAI tools access | Co-working space budget | Equipment provided | Fully remote | Health insurance supportSenior-level Full TimeCyprus R14d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Gibraltar) USD 150K-225KClassifier Training | Context window | Context window optimization | Data cleaning | Data labelingAI tools access | Annual in-person meetup | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeGibraltar R14d ago
-
Data labeling | Dataset cleaning | Direct Preference Optimization | Fine Tuning | Hugging Face1 on 1 therapy sessions | Co-working space budget | Equipment provided | Fully remote | Health insurance supportSenior-level Full TimeNetherlands R14d ago
-
Classification | Data labeling | Direct Preference Optimization | Fine Tuning | HuggingfaceAI tools access | Co-working space budget | Company laptop and equipment budget | Fully remote | Health and wellness supportSenior-level Full TimeFrance R14d ago
-
AI Feedback | Agentic Systems | Direct Preference Optimization | Distributed Training | EvaluationSenior-level Full TimeAMER - United States - California … R15d ago
-
Data Curation | Deep learning | DeepSpeed | Direct Preference Optimization | EvaluationSenior-level Full TimeSingapore, Singapore15d ago
-
Data labeling | Direct Preference Optimization | Fine Tuning | Huggingface | Human FeedbackAI tools access | Co-working budget | Equipment provided | Fully remote | Health and wellness supportSenior-level Full TimeEurope R15d ago
-
Machine Learning Engineer 5 USD 172K-306KAWS | Algorithms | Azure | Data Structures | Direct Preference OptimizationSenior-level Full TimeSan Jose, United States R19d ago
-
Llm基座模型算法实习生 CNY 25K-37KBERT | CLIP | Data Synthesis | Deep learning | Direct Preference OptimizationEntry-level Internship深圳、上海19d ago
-
Research Scientist, Safety Post Training USD 216K-270KAdversarial evaluation | Direct Preference Optimization | Generative AI | Group Relative Policy Optimization | Human FeedbackCommuter stipend | Comprehensive health insurance | Dental insurance | Learning and development stipend | Paid time offSenior-level Full TimeSan Francisco, CA; New York, NY22d ago
-
Staff Engineering Analyst Manager, Veo and Robotics USD 189K-274KCoaching | Data Analysis | Deep learning | Direct Preference Optimization | Fine TuningSenior-level Full TimeSunnyvale, CA, USA22d ago
-
Applied AI Engineer USD 99K-225KAWS | AgentOps | Azure | ChromaDB | Continued Pretraining401k retirement plan | Bike storage | Commuter benefits | Dependent care FSA | Desk setup stipendMid-level Full TimeWashington DC23d ago
-
AI Scientist GBP 46K-46KAzure | Azure OpenAI | Azure OpenAI Services | Databricks | Dataset PreparationMid-level Full TimeLondon, United Kingdom26d ago
-
Applied Scientist II, Alexa International Team USD 142K-193KA/B | A/B Testing | AI Feedback | B testing | C++Entry-level Full Time InternshipBellevue, Washington, USA27d ago
-
Machine Learning Engineer, TikTok - Business Governance USD 145K-250KAI Agents | Audio Processing | Content Moderation | Deep learning | Direct Preference OptimizationMid-level Full TimeSan Jose, California, United States27d ago
-
Applied Scientist II, Alexa International Team USD 142K-193KA/B | A/B Testing | AI Feedback | B testing | Deep learningEntry-level Full Time InternshipBellevue, Washington, USA28d ago
-
Senior Manager Data Scientist SGD 120K-162KAWS | Cloud Computing | Cloud platform | Data Preprocessing | Deep learningSenior-level Full TimeSingapore29d ago
-
Machine Learning Engineer, Global Public Sector GBP 100K-170KBenchmarking | Bias Mitigation | Deep learning | Direct Preference Optimization | Distributed TrainingMid-level Full TimeDoha, Qatar; London, UK1mo ago
-
Alignment | Benchmark design | Constitutional AI | Continued Pretraining | Data CurationSenior-level Full TimeDublin, CA (HQ)1mo ago
-
Applied AI Researcher (India) INR 2000K-3465KAWS | Automated testing | Azure | CI/CD | Cloud ComputingMid-level Full TimeIndia/Bengaluru1mo ago
-
Applied AI Researcher (Dublin, CA) USD 239K-331KCI/CD | Computer Vision | Data Preprocessing | Deep learning | Direct Preference OptimizationMid-level Full TimeDublin, CA (HQ)1mo ago
-
Staff Machine Learning Engineer GBP 90K-120KBias Evaluation | Data Pipelines | Direct Preference Optimization | Fine Tuning | GPU OptimizationSenior-level Full TimeUnited Kingdom1mo ago
-
Senior Principal Machine Learning Engineer (Fulfilment) SGD 182K-240KDecision Processes | DeepSpeed | Direct Preference Optimization | Distributed Training | Dynamic ModelsBirthday leave | Confidential Assistance Programme | FlexWork | Medical insurance | Parental leaveExecutive-level Full TimeSingapore, Singapore1mo ago
-
实习-Ai研究员-大语言模型/视觉语言模型算法与后训练(博士优先) CNY 25K-37KAI Feedback | Direct Preference Optimization | Efficient Fine Tuning | Fine Tuning | FlaxEntry-level Internship上海1mo ago
-
Causal Inference | Cross-modal fusion | Data Modeling | Direct Preference Optimization | Graph Neural NetworksEntry-level Full TimeSeattle, Washington, United States1mo ago
-
Data Analysis | Dataset Processing | Direct Preference Optimization | Evaluation Pipelines | Fine TuningEntry-level InternshipSan Jose, California, United States1mo ago
-
Lead Data Scientist - AI SGD 140K-162KAWS | Azure | Cloud Computing | Computer Vision | Data PreprocessingHybrid workSenior-level Full TimeSingapore1mo ago
-
Applied Scientist II, Alexa International INR 360K-420KA/B | A/B Testing | B testing | Data Analysis | Deep learningEntry-level Full Time InternshipBengaluru, Karnataka, IND1mo ago
-
Sr Staff AI Software Development Engineer GBP 55K-61KAWS | Artificial Intelligence | Azure | Databricks | Direct Preference OptimizationAccrued Paid Vacation | Commuter benefits | Dental insurance | Employee assistance program | Employee resource groupsSenior-level Full TimeCambridge, United Kingdom R1mo ago
-
Data Scientist Lead - LLM (Chatbot) TWD 516K-612KAgent systems | Autogen | Bias detection | CrewAI | Direct Preference OptimizationSenior-level Full TimeTaiwan, Taipei1mo ago
-
Senior AI Engineer Specialist INR 2500K-3500KAgentic AI | Apache Spark | Direct Preference Optimization | Distributed Computing | Embedding architecturesSenior-level Full TimeIND - Bengaluru - Esko-Graphics India …1mo ago