Find jobs in AI/ML, Data Science and Big Data
26 results
for RLAIF
(Skill/Tech stack)
-
DPO | Deep learning | Diverse Preference Optimization | Learning algorithms | Machine LearningMid-level Full Time上海6h ago
-
Mid-level Internship上海6h ago
-
Agent 全栈研发工程师(前/后端)-MiMo CNY 180K-300KAPI Design | Authorization | Automation | Benchmarking | CI/CDMid-level Full Time北京3d ago
-
Machine Learning Researcher - RL and Agentic Systems USD 190K-287KAgentic Systems | Benchmarking | Data Validation | Dataset Quality Evaluation | Dataset qualityMid-level Full TimeRemote R5d ago
-
Principal AI Research Scientist Post-Training Alignment CAD 123K-180KAgentic AI | Alignment research | DPO | Deep learning | Distributed TrainingSenior-level Full TimeAMER - Canada - Ontario - …7d ago
-
Researcher, Context - Agent Post-Training USD 250K-380KData Pipelines | Deep learning | Experimentation | Grading systems | Language ModelsMid-level Full TimeSan Francisco11d ago
-
Researcher, Connectors - Agent Post-Training USD 250K-380KAPIs | Data Pipelines | Deep learning | Evals | ExperimentationSenior-level Full TimeSan Francisco11d ago
-
Researcher, Computer Use - Agent Post-Training USD 250K-380KAgent systems | Browser Automation | Computer use | Data Pipelines | Desktop AutomationSenior-level Full TimeSan Francisco11d ago
-
Researcher, Artifacts - Agent Post-Training USD 250K-380KData Pipelines | Evals | Evaluation | Experimentation | GradingMid-level Full TimeSan Francisco11d ago
-
Machine Learning Engineer 5 USD 172K-306KAWS | Algorithms | Azure | Data Structures | Direct Preference OptimizationSenior-level Full TimeSan Jose, United States R12d ago
-
Senior-level Full TimeSan Jose, United States R12d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitEntry-level Internship深圳15d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data Retrieval | Data StorageEntry-level Internship深圳15d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data pipeline | Distributed SystemsEntry-level Internship深圳15d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitFlexible work schedule | Internship opportunity | MentorshipEntry-level Internship深圳15d ago
-
Applied Scientist II, Alexa International Team USD 142K-193KA/B | A/B Testing | B testing | DPO | Deep learningEntry-level Full Time InternshipBellevue, Washington, USA20d ago
-
Software Engineer - Machine Learning USD 190K-220KAdversarial Data | Adversarial Data Generation | Adversarial Training | Content Moderation | DPOMid-level ContractMountain View, CA22d ago
-
AI Engineer (LLM & ML) CAD 100K-125KAdversarial Models | Computer Vision | Deep learning | Embeddings | Feature EngineeringEmpowerment | Professional development | Startup culture | Work with top talentMid-level Full TimeSana'a, Yemen29d ago
-
Applied Scientist II, Alexa International INR 360K-420KA/B | A/B Testing | B testing | Data Analysis | Deep learningEntry-level Full Time InternshipBengaluru, Karnataka, IND1mo ago
-
Research Scientist, LLM Evaluation & Post-Training USD 150K-160KBenchmarking | Context evaluation | DPO | Data Processing | Error AnalysisSenior-level Full TimeRemote Work( USA), United States R1mo ago
-
Senior Data Science Lead - R01551331 INR 2500K-4500KARIMA | ARIMAX | BentoML | Decision Trees | Exponential SmoothingSenior-level Full TimeChennai, Tamil Nadu, India1mo ago
-
Senior Machine Learning Engineer (Small Language Models) USD 154K-189KAWS | Adapter-Tuning | Axolotl | Cloud Computing | Data labelingFlexible remote days | Flexible work scheduleSenior-level Full TimeCanada - Remote R1mo ago
-
Principal Data Scientist - R01556906 INR 2500K-4500KAPIs | AWS | Agent systems | Agentic AI | AutogenSenior-level Full TimeBangalore, Karnataka, India1mo ago
-
Applied AI Researcher, Post-Training USD 150K-250KAgentic collaboration | Continual Learning | Continual pretraining | DPO | Data Analysis401k | Commuter benefits | In-office lunch | Medical, dental & vision coverageMid-level Full TimeSan Francisco1mo ago
-
Principal Data Scientist - R01554881 INR 2500K-4500KAWS | Agentic AI | Autogen | Azure Machine Learning | Cloud deploymentSenior-level Full TimeBangalore, Karnataka, India1mo ago
-
Data Scientist - R01551326 INR 2500K-4500KARIMA | ARIMAX | Decision Trees | Exponential Smoothing | FaissSenior-level Full TimeChennai, Tamil Nadu, India1mo ago