Find jobs in AI/ML, Data Science and Big Data
7 results
for Preference Learning
(Skill/Tech stack)
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KAgentic RL | DAPO | Distributed Training | Function Calling | GRPOEntry-level Full Time上海、北京3h ago
-
Agentic AI Data Scientist PLN 241K-400KA/B | A/B Testing | Apache Spark | B testing | Data ValidationDental coverage | Flexible working hours | Life insurance | Meal allowance | Medical coverageMid-level Full TimeKrakow, Poland14d ago
-
Automatic Speech Recognition | Deep learning | Diffusion Models | Knowledge Distillation | Language Models401k matching | Disability insurance | Family-forming benefits | Flexible time off | Health insuranceMid-level Full TimeLos Gatos, United States14d ago
-
Data Analysis | Dataset Processing | Direct Preference Optimization | Evaluation Pipelines | Fine TuningEntry-level InternshipSan Jose, California, United States23d ago
-
Applied Scientist, AGI Customization Services USD 142K-193KAmazon SageMaker | Continued Pre Training | Dataset creation | Experiment design | Extended Post TrainingMid-level Full TimeCambridge, Massachusetts, USA1mo ago
-
Staff Applied AI Scientist CNY 200K-500KBenchmarking | Cost Optimization | DPO | Deep learning | DistillationCross-functional collaboration | Direct impact with real customer data | Remote-friendly workSenior-level Full TimeShenzhen, Guangdong Province, China1mo ago
-
LLM Post-Training Engineer, Research & Product USD 212K-389KData Pipelines | Deep learning | Distributed Training | Human preference learning | Instruction TuningSenior-level Full TimeSan Jose, California, United States1mo ago