Find jobs in AI/ML, Data Science and Big Data
36 results
for Direct Preference Optimization
(Skill/Tech stack)
-
Senior Machine Learning Engineer, Agentic USD 163K-245KArtificial Intelligence | Direct Preference Optimization | Evaluation | Fine Tuning | Human-in-the-loop401k matching | Catered meals | Employee events | Employer-paid disability insurance | Employer-paid life insuranceSenior-level Full TimeBellevue, WA; Menlo Park, CA1d ago
-
Research Scientist, Safety Post Training USD 216K-270KAdversarial evaluation | Direct Preference Optimization | Generative AI | Group Relative Policy Optimization | Human FeedbackCommuter stipend | Comprehensive health insurance | Dental insurance | Learning and development stipend | Paid time offSenior-level Full TimeSan Francisco, CA; New York, NY1d ago
-
Staff Engineering Analyst Manager, Veo and Robotics USD 189K-274KCoaching | Data Analysis | Deep learning | Direct Preference Optimization | Fine TuningSenior-level Full TimeSunnyvale, CA, USA2d ago
-
LLM Fine-Tuning Engineer USD 150K-270KBenchmarking | Direct Preference Optimization | Distributed Training | Efficient Attention | FSDPMid-level Full TimeUnited States - Remote R3d ago
-
AI Scientist GBP 46K-46KAzure | Azure OpenAI | Azure OpenAI Services | Databricks | Dataset PreparationMid-level Full TimeLondon, United Kingdom5d ago
-
Applied Scientist II, Alexa International Team USD 142K-193KA/B | A/B Testing | AI Feedback | B testing | C++Entry-level Full Time InternshipBellevue, Washington, USA6d ago
-
Machine Learning Engineer, TikTok - Business Governance USD 145K-250KAI Agents | Audio Processing | Content Moderation | Deep learning | Direct Preference OptimizationMid-level Full TimeSan Jose, California, United States7d ago
-
Applied Scientist II, Alexa International Team USD 142K-193KA/B | A/B Testing | AI Feedback | B testing | Deep learningEntry-level Full Time InternshipBellevue, Washington, USA7d ago
-
Senior AI/ML Engineer USD 150K-180KAWS | Azure | Azure Machine Learning | CI/CD | Code reviewHealth benefits | Mentorship and career development | Remote-first work environmentSenior-level Full TimeRemote R8d ago
-
Senior Manager Data Scientist SGD 120K-162KAWS | Cloud Computing | Cloud platform | Data Preprocessing | Deep learningSenior-level Full TimeSingapore9d ago
-
Data Analysis | Data Science | Direct Preference Optimization | Fine Tuning | Language ModelsSenior-level Full TimeSunnyvale, CA, USA11d ago
-
Machine Learning Engineer, Global Public Sector GBP 100K-170KBenchmarking | Bias Mitigation | Deep learning | Direct Preference Optimization | Distributed TrainingMid-level Full TimeDoha, Qatar; London, UK11d ago
-
Senior Applied Scientist USD 142K-270KData Pipelines | Diffusion Models | Direct Preference Optimization | Fine Tuning | Generative AISenior-level Full TimeSan Jose, United States R12d ago
-
Amazon Web Services | Apache Beam | Apache Spark | Cloud platform | Data Processing401k retirement plan | Flexible holidays | Health insurance | Meal allowance | Paid HolidaysSenior-level Full TimeNew York, NY14d ago
-
Alignment | Benchmark design | Constitutional AI | Continued Pretraining | Data CurationSenior-level Full TimeDublin, CA (HQ)14d ago
-
Applied AI Researcher (India) INR 2000K-3465KAWS | Automated testing | Azure | CI/CD | Cloud ComputingMid-level Full TimeIndia/Bengaluru14d ago
-
Applied AI Researcher (Dublin, CA) USD 239K-331KCI/CD | Computer Vision | Data Preprocessing | Deep learning | Direct Preference OptimizationMid-level Full TimeDublin, CA (HQ)14d ago
-
Staff Machine Learning Engineer GBP 90K-120KBias Evaluation | Data Pipelines | Direct Preference Optimization | Fine Tuning | GPU OptimizationSenior-level Full TimeUnited Kingdom19d ago
-
Senior Principal Machine Learning Engineer (Fulfilment) SGD 182K-240KDecision Processes | DeepSpeed | Direct Preference Optimization | Distributed Training | Dynamic ModelsBirthday leave | Confidential Assistance Programme | FlexWork | Medical insurance | Parental leaveExecutive-level Full TimeSingapore, Singapore20d ago
-
实习-Ai研究员-大语言模型/视觉语言模型算法与后训练(博士优先) CNY 25K-37KAI Feedback | Direct Preference Optimization | Efficient Fine Tuning | Fine Tuning | FlaxEntry-level Internship上海22d ago
-
Causal Inference | Cross-modal fusion | Data Modeling | Direct Preference Optimization | Graph Neural NetworksEntry-level Full TimeSeattle, Washington, United States23d ago
-
Data Analysis | Dataset Processing | Direct Preference Optimization | Evaluation Pipelines | Fine TuningEntry-level InternshipSan Jose, California, United States25d ago
-
Lead Data Scientist - AI SGD 140K-162KAWS | Azure | Cloud Computing | Computer Vision | Data PreprocessingHybrid workSenior-level Full TimeSingapore26d ago
-
Applied Scientist II, Alexa International INR 360K-420KA/B | A/B Testing | B testing | Data Analysis | Deep learningEntry-level Full Time InternshipBengaluru, Karnataka, IND27d ago
-
Sr Staff AI Software Development Engineer GBP 55K-61KAWS | Artificial Intelligence | Azure | Databricks | Direct Preference OptimizationAccrued Paid Vacation | Commuter benefits | Dental insurance | Employee assistance program | Employee resource groupsSenior-level Full TimeCambridge, United Kingdom R27d ago
-
Data Scientist Lead - LLM (Chatbot) TWD 516K-612KAgent systems | Autogen | Bias detection | CrewAI | Direct Preference OptimizationSenior-level Full TimeTaiwan, Taipei30d ago
-
Senior AI Engineer Specialist INR 2500K-3500KAgentic AI | Apache Spark | Direct Preference Optimization | Distributed Computing | Embedding architecturesSenior-level Full TimeIND - Bengaluru - Esko-Graphics India …30d ago
-
Applied Scientist , Amazon Customer Service USD 142K-222KAgentic AI | Artificial Intelligence | Dataset curation | Direct Preference Optimization | Embedding ModelsMid-level Full TimeSanta Clara, California, USA1mo ago
-
Senior Machine Learning Engineer, Personalization USD 184K-262KAWS | Apache Beam | Apache Spark | Cloud platform | Data Processing401k | Health insurance | Meal allowance | Paid flexible holidays | Paid parental leaveSenior-level Full TimeNew York, NY1mo ago
-
Staff Software Engineer, Generative AI, Core ML USD 207K-300KAI Feedback | Computer Vision | Data Processing | Deep learning | Digital TwinSenior-level Full TimeMountain View, CA, USA1mo ago
-
Senior Applied Scientist USD 180K-230KDirect Preference Optimization | Distributed Training | Human Feedback | LLM-as-a-Judge | Language ModelsSenior-level Full TimePalo Alto1mo ago
-
DDP | Deep learning | Direct Preference Optimization | Distributed Training | DockerSenior-level Full TimePangyo (Software Dream Center), South Korea1mo ago
-
大模型应用算法工程师/专家 CNY 240K-480KC++ | Computer Vision | Deep learning | Direct Preference Optimization | Human Computer DialogueSenior-level Full Time上海、北京1mo ago
-
Senior Applied AI Manager USD 170K-234KAgent systems | Agentic Systems | Curriculum learning | Data Deduplication | Data mixingSenior-level Full TimeSan Mateo, CA1mo ago
-
Agent RL Infra Engineer USD 224K-356KAI Feedback | Active Learning | Cluster management | Continuous Learning | Data CurationSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Applied Reinforcement Learning Engineer USD 150K-160KActor-critic | Agent systems | BCQ | Behavioral cloning | CQLEqual opportunity employer | Hybrid remote work | Research publications opportunityMid-level Full TimeRemote Work( USA), United States R1mo ago