Find jobs in AI/ML, Data Science and Big Data
26 results
for Direct Preference Optimization
(Skill/Tech stack)
-
实习-Ai研究员-大语言模型/视觉语言模型算法与后训练(博士优先) CNY 25K-37KAI Feedback | Direct Preference Optimization | Efficient Fine Tuning | Fine Tuning | FlaxEntry-level Internship上海2d ago
-
Causal Inference | Cross-modal fusion | Data Modeling | Direct Preference Optimization | Graph Neural NetworksEntry-level Full TimeSeattle, Washington, United States2d ago
-
Data Analysis | Dataset Processing | Direct Preference Optimization | Evaluation Pipelines | Fine TuningEntry-level InternshipSan Jose, California, United States4d ago
-
Lead Data Scientist - AI SGD 140K-162KAWS | Azure | Cloud Computing | Computer Vision | Data PreprocessingHybrid workSenior-level Full TimeSingapore5d ago
-
Senior Data Scientist INR 2520K-3880KChunking | Deep learning | Direct Preference Optimization | Document Embeddings | Fine TuningContinuing education program | Continuous learning | Family-friendly perks | Flexible time off | Health care coverageSenior-level Full TimeIN - AHMEDABAD, India5d ago
-
Applied Scientist II, Alexa International INR 360K-420KA/B | A/B Testing | B testing | Data Analysis | Deep learningEntry-level Full Time InternshipBengaluru, Karnataka, IND6d ago
-
Sr Staff AI Software Development Engineer GBP 55K-61KAWS | Artificial Intelligence | Azure | Databricks | Direct Preference OptimizationAccrued Paid Vacation | Commuter benefits | Dental insurance | Employee assistance program | Employee resource groupsSenior-level Full TimeCambridge, United Kingdom R7d ago
-
Senior Product Manager, LLM Post-Training & Evaluation USD 160K-170KAI Feedback | API Design | Agentic Evaluation | Benchmarking | Context evaluationSenior-level Full TimeRemote Work( USA), United States R7d ago
-
Data Scientist Lead - LLM (Chatbot) TWD 516K-612KAgent systems | Autogen | Bias detection | CrewAI | Direct Preference OptimizationSenior-level Full TimeTaiwan, Taipei9d ago
-
Senior AI Engineer Specialist INR 2500K-3500KAgentic AI | Apache Spark | Direct Preference Optimization | Distributed Computing | Embedding architecturesSenior-level Full TimeIND - Bengaluru - Esko-Graphics India …9d ago
-
Applied Scientist , Amazon Customer Service USD 142K-222KAgentic AI | Artificial Intelligence | Dataset curation | Direct Preference Optimization | Embedding ModelsMid-level Full TimeSanta Clara, California, USA13d ago
-
Senior Machine Learning Engineer, Personalization USD 184K-262KAWS | Apache Beam | Apache Spark | Cloud platform | Data Processing401k | Health insurance | Meal allowance | Paid flexible holidays | Paid parental leaveSenior-level Full TimeNew York, NY19d ago
-
AWS | Agent Orchestration | Autogen | Autonomous Agents | Direct Preference OptimizationBicycle subsidy | Corporate discounts | Corporate pension plan | Digital meal vouchers | Educational budgetSenior-level Full TimeBerlin, Germany20d ago
-
Staff Software Engineer, Generative AI, Core ML USD 207K-300KAI Feedback | Computer Vision | Data Processing | Deep learning | Digital TwinSenior-level Full TimeMountain View, CA, USA21d ago
-
Machine Learning Engineer (Post-Training) EUR 57K-84KAWS | Data Pipelines | Data-parallel | DeepSpeed | Direct Preference OptimizationSenior-level Full TimeParis, France21d ago
-
Senior Applied Scientist USD 180K-230KDirect Preference Optimization | Distributed Training | Human Feedback | LLM-as-a-Judge | Language ModelsSenior-level Full TimePalo Alto23d ago
-
DDP | Deep learning | Direct Preference Optimization | Distributed Training | DockerSenior-level Full TimePangyo (Software Dream Center), South Korea28d ago
-
大模型应用算法工程师/专家 CNY 240K-480KC++ | Computer Vision | Deep learning | Direct Preference Optimization | Human Computer DialogueSenior-level Full Time上海、北京29d ago
-
Senior Applied AI Manager USD 170K-234KAgent systems | Agentic Systems | Curriculum learning | Data Deduplication | Data mixingSenior-level Full TimeSan Mateo, CA29d ago
-
Agent RL Infra Engineer USD 224K-356KAI Feedback | Active Learning | Cluster management | Continuous Learning | Data CurationSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Applied Reinforcement Learning Engineer USD 150K-160KActor-critic | Agent systems | BCQ | Behavioral cloning | CQLEqual opportunity employer | Hybrid remote work | Research publications opportunityMid-level Full TimeRemote Work( USA), United States R1mo ago
-
校招-Ai研究科学家-大语言模型/视觉语言模型算法与后训练(博士优先) CNY 500K-500KAdapters | Direct Preference Optimization | Fine Tuning | Flax | Function designNone Full Time上海1mo ago
-
Agent Orchestration | Amazon Web Services | Auto Planning | Autogen | Direct Preference OptimizationBicycle subsidy | Corporate discounts | Corporate pension plan | Digital meal vouchers | Educational budgetSenior-level Full TimeBerlin, Germany1mo ago
-
Benchmark design | Computer Vision | Deep learning | Direct Preference Optimization | Evaluation metricsCar to go subscriptions | Free parking | Learning opportunities | On site bakery | On-site restaurantsMid-level Full TimeJerusalem1mo ago
-
Staff AI Engineer, Model Post-Training and Alignment USD 196K-268KBenchmarking | Deep learning | Direct Preference Optimization | Fine Tuning | Generalized Reward Policy OptimizationCompany events | Comprehensive healthcare | Education subsidy | Learning and development programs | Meal allowancesSenior-level Full TimeAPAC1mo ago
-
Senior AI Research Scientist (6240) USD 170K-270KAdversarial Learning | Attention Networks | Dash | Data Preprocessing | Data WranglingHybrid work schedule | Professional development programs | Travel for training and team buildingSenior-level Full TimeSan Jose, CA, US1mo ago