Find jobs in AI/ML, Data Science and Big Data
28 results
for Direct Preference Optimization
(Skill/Tech stack)
-
AI Feedback | Deep learning | Direct Preference Optimization | Fine Tuning | Human FeedbackMid-level Full Time上海3d ago
-
Junior Foundation AI Engineer EUR 30KAWS | Accelerate | Azure | CUDA | Cloud ComputingCorporate welfare | Health insurance | Meal vouchers | Smart working | TrainingEntry-level Full TimeMilano (Bassi), Italy5d ago
-
LLM Engineer USD 100K-150KAdapters | DeepSpeed ZeRO | Direct Preference Optimization | Efficient Attention | FSDPMid-level Full TimeUnited States - Remote R5d ago
-
Staff Software Engineer, AI/ML USD 216K-271KAI Feedback | Agentic AI | Data Pipelines | Direct Preference Optimization | Experimentation platformsConference reimbursement | Education reimbursement | Employee assistance program | Employee stock purchase program | Equity compensationSenior-level Full TimeSeattle6d ago
-
Senior Solutions Architect, Generative AI Research USD 184K-287KAI Agents | AI Feedback | Agent evaluation | Artificial Intelligence | BatchingSenior-level Full TimeUS, FL, Remote, United States R6d ago
-
Senior Applied Scientist USD 142K-270KData Pipelines | Diffusion Models | Direct Preference Optimization | Evaluation metrics | Fine TuningSenior-level Full TimeSeattle, United States R6d ago
-
Senior Applied Scientist, Alexa AI USD 167K-227KAgentic Architectures | Automated Training | Automated training pipelines | C++ | DPOSenior-level Full TimeTurin, Piedmont, ITA8d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapter-Tuning | Dataset curation | Direct Preference Optimization | Distributed Training | Efficient AttentionMid-level Full TimeUnited States - Remote R8d ago
-
Senior Software Engineer - Model Training & AI Evals INR 3500K-5000KAI Feedback | Ablation Studies | Benchmarking | CI/CD | Data GenerationSenior-level Full TimeRemote (India) R12d ago
-
Staff AI research scientist USD 234K-296KAdversarial Training | Agentic Systems | Benchmark design | Data Curation | Data GenerationCompany holidays | Company offsites | Dental insurance | Dependent FSA | Fertility supportSenior-level Full TimeSan Francisco, CA12d ago
-
Director, Applied Science, Alexa for Shopping (Rufus) USD 262K-350KAgent systems | Deep Deterministic Policy Gradient | Direct Preference Optimization | Distillation | Experimentation401k matching | Dental insurance | Employee assistance program | Health insurance | Mental health supportExecutive-level Full TimeSeattle, Washington, USA13d ago
-
Data Analysis | Deep learning | Direct Preference Optimization | Fine Tuning | Language ModelsSenior-level Full TimeSunnyvale, CA, USA16d ago
-
Research Scientist, LLM Evaluation & Post-Training USD 150K-300KAI Feedback | Alignment | Benchmarking | Context evaluation | Deep learningMid-level Full TimeRemote Work( USA), United States R18d ago
-
Mid-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL22d ago
-
Senior Machine Learning Engineer , AI Platform USD 150K-210KArtificial Intelligence | Batch Processing | Data Analysis | Data Pipelines | Data PrivacySenior-level Full TimeBoston, MA27d ago
-
AI Feedback | Agentic Systems | Direct Preference Optimization | Distributed Training | EvaluationSenior-level Full TimeAMER - United States - California … R1mo ago
-
Data Curation | Deep learning | DeepSpeed | Direct Preference Optimization | EvaluationSenior-level Full TimeSingapore, Singapore1mo ago
-
Machine Learning Engineer 5 USD 172K-306KAWS | Algorithms | Azure | Data Structures | Direct Preference OptimizationSenior-level Full TimeSan Jose, United States R1mo ago
-
Llm基座模型算法实习生 CNY 25K-37KBERT | CLIP | Data Synthesis | Deep learning | Direct Preference OptimizationEntry-level Internship深圳、上海1mo ago
-
Research Scientist, Safety Post Training USD 216K-270KAdversarial evaluation | Direct Preference Optimization | Generative AI | Group Relative Policy Optimization | Human FeedbackCommuter stipend | Comprehensive health insurance | Dental insurance | Learning and development stipend | Paid time offSenior-level Full TimeSan Francisco, CA; New York, NY1mo ago
-
Applied AI Engineer USD 99K-225KAWS | AgentOps | Azure | ChromaDB | Continued Pretraining401k retirement plan | Bike storage | Commuter benefits | Dependent care FSA | Desk setup stipendMid-level Full TimeWashington DC1mo ago
-
Machine Learning Engineer, TikTok - Business Governance USD 145K-250KAI Agents | Audio Processing | Content Moderation | Deep learning | Direct Preference OptimizationMid-level Full TimeSan Jose, California, United States1mo ago
-
Applied Scientist II, Alexa International Team USD 142K-193KA/B | A/B Testing | AI Feedback | B testing | Deep learningEntry-level Full Time InternshipBellevue, Washington, USA1mo ago
-
Senior Manager Data Scientist SGD 120K-162KAWS | Cloud Computing | Cloud platform | Data Preprocessing | Deep learningSenior-level Full TimeSingapore1mo ago
-
Machine Learning Engineer, Global Public Sector GBP 100K-170KBenchmarking | Bias Mitigation | Deep learning | Direct Preference Optimization | Distributed TrainingMid-level Full TimeDoha, Qatar; London, UK1mo ago
-
Alignment | Benchmark design | Constitutional AI | Continued Pretraining | Data CurationSenior-level Full TimeDublin, CA (HQ)1mo ago
-
Applied AI Researcher (India) INR 2000K-3465KAWS | Automated testing | Azure | CI/CD | Cloud ComputingMid-level Full TimeIndia/Bengaluru1mo ago
-
Applied AI Researcher (Dublin, CA) USD 239K-331KCI/CD | Computer Vision | Data Preprocessing | Deep learning | Direct Preference OptimizationMid-level Full TimeDublin, CA (HQ)1mo ago