Find jobs in AI/ML, Data Science and Big Data
45 results
for Preference optimization
(Skill/Tech stack)
-
AI Feedback | Checkpointing | Cost Performance | Cost-performance tradeoffs | Data Decontamination401k matching | Country specific visa support | Flexible work arrangements | Medical, dental, and vision options | Parental leaveSenior-level Full TimePalo Alto, California, United States20h ago
-
LLM Engineer USD 100K-150KAdapter methods | DPO | Deep reinforcement learning | Distributed Training | Efficient AttentionBenefits | Career growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Feedback | Deep learning | Direct Preference Optimization | Fine Tuning | Human FeedbackMid-level Full Time上海3d ago
-
Senior AI/ML engineer GBP 120K-150KAWS | CI/CD | Databricks | Deep learning | Delta LakeAccelerated professional growth | Enhanced parental leave | Female health leave | Fully paid sabbatical | Health pension wellbeing benefitsSenior-level Full TimeLondon R4d ago
-
Junior Foundation AI Engineer EUR 30KAWS | Accelerate | Azure | CUDA | Cloud ComputingCorporate welfare | Health insurance | Meal vouchers | Smart working | TrainingEntry-level Full TimeMilano (Bassi), Italy5d ago
-
Mid-level Full TimeUnited States - Remote R5d ago
-
LLM Engineer USD 100K-150KAdapters | DeepSpeed ZeRO | Direct Preference Optimization | Efficient Attention | FSDPMid-level Full TimeUnited States - Remote R5d ago
-
LLM Engineer USD 100K-150KAdapters | Attention Optimization | DPO | Distributed Training | Evaluation benchmarksMid-level Full TimeUnited States - Remote R5d ago
-
Staff Software Engineer, AI/ML USD 216K-271KAI Feedback | Agentic AI | Data Pipelines | Direct Preference Optimization | Experimentation platformsConference reimbursement | Education reimbursement | Employee assistance program | Employee stock purchase program | Equity compensationSenior-level Full TimeSeattle6d ago
-
Senior Solutions Architect, Generative AI Research USD 184K-287KAI Agents | AI Feedback | Agent evaluation | Artificial Intelligence | BatchingSenior-level Full TimeUS, FL, Remote, United States R6d ago
-
Senior Applied Scientist USD 142K-270KData Pipelines | Diffusion Models | Direct Preference Optimization | Evaluation metrics | Fine TuningSenior-level Full TimeSeattle, United States R6d ago
-
Director, Reinforcement Learning & Agentic Post-Training EUR 151K-200KAI Feedback | API Integration | Distributed Training | Environment Design | EvaluationExecutive-level Full TimeParis, France6d ago
-
Senior Applied Scientist, Alexa AI USD 167K-227KAgentic Architectures | Automated Training | Automated training pipelines | C++ | DPOSenior-level Full TimeTurin, Piedmont, ITA8d ago
-
Senior Software Engineer - Model Training & AI Evals INR 3500K-5000KAI Feedback | Ablation Studies | Benchmarking | CI/CD | Data GenerationSenior-level Full TimeRemote (India) R12d ago
-
Staff AI research scientist USD 234K-296KAdversarial Training | Agentic Systems | Benchmark design | Data Curation | Data GenerationCompany holidays | Company offsites | Dental insurance | Dependent FSA | Fertility supportSenior-level Full TimeSan Francisco, CA13d ago
-
Director, Applied Science, Alexa for Shopping (Rufus) USD 262K-350KAgent systems | Deep Deterministic Policy Gradient | Direct Preference Optimization | Distillation | Experimentation401k matching | Dental insurance | Employee assistance program | Health insurance | Mental health supportExecutive-level Full TimeSeattle, Washington, USA13d ago
-
Senior Machine Learning Engineer USD 188K-282KAdversarial Training | Calibration monitoring | Continuous batching | DPO | Deep learningSenior-level Full TimePalo Alto, CA14d ago
-
Staff/Senior AI Engineer, AI for Code GBP 81K-115KAgentic Workflows | Benchmarking | Context engineering | Cost Optimization | Fine TuningExtra time off | Flexible work location | Internal events | Language classes | Learning and development opportunitiesSenior-level Full TimeAmsterdam, Netherlands; Belgrade, Serbia; Berlin, Germany; … R15d ago
-
Data Analysis | Deep learning | Direct Preference Optimization | Fine Tuning | Language ModelsSenior-level Full TimeSunnyvale, CA, USA17d ago
-
Research Scientist, LLM Evaluation & Post-Training USD 150K-300KAI Feedback | Alignment | Benchmarking | Context evaluation | Deep learningMid-level Full TimeRemote Work( USA), United States R18d ago
-
Mid-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL23d ago
-
Research Engineer - LLM Training & Alignment Systems CAD 127K-225KAutomation | Benchmarking | C# | C++ | Data CurationMid-level Contract Full TimeKingston, Ontario, Canada27d ago
-
Senior Machine Learning Engineer , AI Platform USD 150K-210KArtificial Intelligence | Batch Processing | Data Analysis | Data Pipelines | Data PrivacySenior-level Full TimeBoston, MA28d ago
-
Sr. Data Scientist INR 2500K-3380KApache Spark | Deep learning | Drift Detection | Evaluation | Experimentation platformsSenior-level Full TimeBangalore,India29d ago
-
Applied Scientist II, Sponsored Products and Brands - Advertiser Growth and Strategies USD 142K-223KAgent systems | Automated benchmarking | Chain-of-Thought | DPO | Dataset curationMid-level Full TimeNew York, New York, USA1mo ago
-
AI Feedback | Agentic Systems | Direct Preference Optimization | Distributed Training | EvaluationSenior-level Full TimeAMER - United States - California … R1mo ago
-
Senior Applied Scientist USD 142K-270KAI Model Training | AI model | Fine Tuning | Generative AI | Inference OptimizationSenior-level Full TimeSan Jose, United States R1mo ago
-
Data Curation | Deep learning | DeepSpeed | Direct Preference Optimization | EvaluationSenior-level Full TimeSingapore, Singapore1mo ago
-
Machine Learning Engineer 5 USD 172K-306KAWS | Algorithms | Azure | Data Structures | Direct Preference OptimizationSenior-level Full TimeSan Jose, United States R1mo ago
-
Senior-level Full TimeSan Jose, United States R1mo ago
-
Llm基座模型算法实习生 CNY 25K-37KBERT | CLIP | Data Synthesis | Deep learning | Direct Preference OptimizationEntry-level Internship深圳、上海1mo ago
-
Agent Orchestration | Agent systems | Autogen | Automated Evaluation | BenchmarkingSenior-level Full TimeSeoul HQ1mo ago
-
Research Scientist, Safety Post Training USD 216K-270KAdversarial evaluation | Direct Preference Optimization | Generative AI | Group Relative Policy Optimization | Human FeedbackCommuter stipend | Comprehensive health insurance | Dental insurance | Learning and development stipend | Paid time offSenior-level Full TimeSan Francisco, CA; New York, NY1mo ago
-
Applied AI Engineer USD 99K-225KAWS | AgentOps | Azure | ChromaDB | Continued Pretraining401k retirement plan | Bike storage | Commuter benefits | Dependent care FSA | Desk setup stipendMid-level Full TimeWashington DC1mo ago
-
Lead Data Scientist- Comp Intel INR 2040K-3500KAgent systems | Apache Spark | Deep learning | Drift Detection | Embedding systemsSenior-level Full TimeTower 02, Manyata Embassy Business Park, …1mo ago
-
Machine Learning Engineer, TikTok - Business Governance USD 145K-250KAI Agents | Audio Processing | Content Moderation | Deep learning | Direct Preference OptimizationMid-level Full TimeSan Jose, California, United States1mo ago
-
Applied Scientist II, Alexa International Team USD 142K-193KA/B | A/B Testing | AI Feedback | B testing | Deep learningEntry-level Full Time InternshipBellevue, Washington, USA1mo ago
-
Researcher, Alignment Oversight USD 250K-445KEvaluation Design | Experimentation | Human-in-the-loop | Language Models | Large Language ModelsHybrid work model | Relocation assistanceMid-level Full TimeSan Francisco1mo ago
-
Senior Manager Data Scientist SGD 120K-162KAWS | Cloud Computing | Cloud platform | Data Preprocessing | Deep learningSenior-level Full TimeSingapore1mo ago
-
Machine Learning Engineer, Global Public Sector GBP 100K-170KBenchmarking | Bias Mitigation | Deep learning | Direct Preference Optimization | Distributed TrainingMid-level Full TimeDoha, Qatar; London, UK1mo ago
-
Alignment | Benchmark design | Constitutional AI | Continued Pretraining | Data CurationSenior-level Full TimeDublin, CA (HQ)1mo ago
-
Alignment | Benchmark design | DPO | Data Curation | Data DeduplicationSenior-level Full TimeIndia/Bengaluru1mo ago
-
Constitutional AI | Continued Pretraining | DPO | Data Curation | DeduplicationSenior-level Full TimeBrazil/Remote R1mo ago
-
Applied AI Researcher (India) INR 2000K-3465KAWS | Automated testing | Azure | CI/CD | Cloud ComputingMid-level Full TimeIndia/Bengaluru1mo ago
-
Applied AI Researcher (Dublin, CA) USD 239K-331KCI/CD | Computer Vision | Data Preprocessing | Deep learning | Direct Preference OptimizationMid-level Full TimeDublin, CA (HQ)1mo ago