Find jobs in AI/ML, Data Science and Big Data
24 results
for RLAIF
(Skill/Tech stack)
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GRPOEntry-level Internship深圳9h ago
-
Mid-level Full Time上海5d ago
-
Mid-level Internship上海、北京5d ago
-
Staff AI research scientist USD 234K-296KAdversarial Training | Agentic Systems | Benchmark design | Data Curation | Data GenerationCompany holidays | Company offsites | Dental insurance | Dependent FSA | Fertility supportSenior-level Full TimeSan Francisco, CA5d ago
-
Senior Machine Learning Engineer USD 188K-282KAdversarial Training | Calibration monitoring | Continuous batching | DPO | Deep learningSenior-level Full TimePalo Alto, CA7d ago
-
Senior AI Engineer, AI Lab GBP 90K-131KBLEU | Bark | DVC | ElevenLabs | Fine TuningAnnual leave | Employee assistance program | Free Economist content online subscription | Moving home allowance | Parental leaveSenior-level Full TimeLondon - Commercial R10d ago
-
Head of AI Training Research USD 175K-280KArtificial Intelligence | Benchmarking | Data Curation | Data Quality | Data quality assessmentExecutive-level Full TimeWorldwide - Remote R12d ago
-
Senior AI/ML Engineering Specialist, Responsible AI CAD 99K-132KAIF360 | Adversarial Testing | Amazon SageMaker | Automated testing | Azure Machine LearningSenior-level Full TimeMississauga, ON, CAN - 2300 Meadowvale …13d ago
-
AI Solutions Architect, Senior Manager USD 142K-266KAI Governance | AWS Bedrock | Amazon EKS | Amazon SageMaker | Azure OpenAIDependent care | Paid leave | Professional development | Tuition assistance | Work-life programsSenior-level Full TimeUSA, DC, Washington (901 15th St …15d ago
-
Senior-level Full TimeIndia17d ago
-
Agent 全栈研发工程师(前/后端)-MiMo CNY 180K-300KAPI Design | Authorization | Automation | Benchmarking | CI/CDMid-level Full Time北京23d ago
-
Machine Learning Researcher - RL and Agentic Systems USD 190K-287KAgentic Systems | Benchmarking | Data Validation | Dataset Quality Evaluation | Dataset qualityMid-level Full TimeRemote R25d ago
-
Principal AI Research Scientist Post-Training Alignment CAD 123K-180KAgentic AI | Alignment research | DPO | Deep learning | Distributed TrainingSenior-level Full TimeAMER - Canada - Ontario - …27d ago
-
Researcher, Context - Agent Post-Training USD 250K-380KData Pipelines | Deep learning | Experimentation | Grading systems | Language ModelsMid-level Full TimeSan Francisco1mo ago
-
Researcher, Connectors - Agent Post-Training USD 250K-380KAPIs | Data Pipelines | Deep learning | Evals | ExperimentationSenior-level Full TimeSan Francisco1mo ago
-
Researcher, Computer Use - Agent Post-Training USD 250K-380KAgent systems | Browser Automation | Computer use | Data Pipelines | Desktop AutomationSenior-level Full TimeSan Francisco1mo ago
-
Researcher, Artifacts - Agent Post-Training USD 250K-380KData Pipelines | Evals | Evaluation | Experimentation | GradingMid-level Full TimeSan Francisco1mo ago
-
Machine Learning Engineer 5 USD 172K-306KAWS | Algorithms | Azure | Data Structures | Direct Preference OptimizationSenior-level Full TimeSan Jose, United States R1mo ago
-
Senior-level Full TimeSan Jose, United States R1mo ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data Retrieval | Data StorageEntry-level Internship深圳1mo ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data pipeline | Distributed SystemsEntry-level Internship深圳1mo ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitFlexible work schedule | Internship opportunity | MentorshipEntry-level Internship深圳1mo ago
-
Software Engineer - Machine Learning USD 190K-220KAdversarial Data | Adversarial Data Generation | Adversarial Training | Content Moderation | DPOMid-level ContractMountain View, CA1mo ago
-
AI Engineer (LLM & ML) CAD 100K-125KAdversarial Models | Computer Vision | Deep learning | Embeddings | Feature EngineeringEmpowerment | Professional development | Startup culture | Work with top talentMid-level Full TimeSana'a, Yemen1mo ago