Find jobs in AI/ML, Data Science and Big Data
12 results
for Preference optimization
(Skill/Tech stack)
-
校招-Ai研究科学家-大语言模型/视觉语言模型算法与后训练(博士优先) CNY 500K-500KAdapters | Direct Preference Optimization | Fine Tuning | Flax | Function designNone Full Time上海19h ago
-
Senior Staff Software Engineer, Model LifeCycle USD 237K-288KAPI Design | CUDA | Checkpointing | DPO | DeepSpeed401k matching | Cell phone stipend | Commuter benefits | Dental insurance | HSA employer contributionsSenior-level Full TimeTel Aviv - IL21h ago
-
Checkpointing | Cloud Networking | Failure recovery | Golang | Human Feedback401k match | Cell phone stipend | Commuter benefits | Dental insurance | HSA employer contributionsSenior-level Full TimeSan Francisco, CA - US1d ago
-
Lead AI Engineer USD 200K-215KA/B | A/B Testing | AWS Bedrock | Agentic LLM | Agentic LLM systemsDental insurance | Employee discounts | Employee equity | Health insurance | Pet insuranceSenior-level Full TimeRemote - United States R1d ago
-
Staff Software Engineer, Model LifeCycle USD 208K-253KAPI Design | Checkpointing | Distributed Training | Failure recovery | Fine Tuning401k match | Cell phone stipend | Commuter benefits | Dental insurance | Employer HSA contributionsSenior-level Full TimeSan Francisco, CA - US1d ago
-
Agent Orchestration | Amazon Web Services | Auto Planning | Autogen | Direct Preference OptimizationBicycle subsidy | Corporate discounts | Corporate pension plan | Digital meal vouchers | Educational budgetSenior-level Full TimeBerlin, Germany2d ago
-
Benchmark design | Computer Vision | Deep learning | Direct Preference Optimization | Evaluation metricsCar to go subscriptions | Free parking | Learning opportunities | On site bakery | On-site restaurantsMid-level Full TimeJerusalem2d ago
-
Staff AI Engineer, Model Post-Training and Alignment USD 196K-268KBenchmarking | Deep learning | Direct Preference Optimization | Fine Tuning | Generalized Reward Policy OptimizationCompany events | Comprehensive healthcare | Education subsidy | Learning and development programs | Meal allowancesSenior-level Full TimeAPAC2d ago
-
Applied Scientist 3 USD 120K-238KFine Tuning | Inference Optimization | Machine Learning | Model Compression | Model DistillationMid-level Full TimeSan Jose, United States2d ago
-
Automated testing | Cryptography | Direct Preference Optimization | Distributed Systems | DockerSenior-level Full TimeRemote R3d ago
-
Senior AI Research Scientist (6240) USD 170K-270KAdversarial Learning | Attention Networks | Dash | Data Preprocessing | Data WranglingHybrid work schedule | Professional development programs | Travel for training and team buildingSenior-level Full TimeSan Jose, CA, US13d ago
-
AI Safety | Agent architectures | Agent-based | Agent-based AI | Data CurationMid-level Full TimeSeattle, Washington, USA24d ago