Find jobs in AI/ML, Data Science and Big Data
28 results
for Preference optimization
(Skill/Tech stack)
-
Machine Learning Engineer, Chakra USD 120K-235KAgentic AI | Benchmarking | Conversational AI | Data Pipelines | Deep learningMid-level Full TimeHybrid in Santa Clara, CA R23h ago
-
AWS | Agent Orchestration | Autogen | Autonomous Agents | Direct Preference OptimizationBicycle subsidy | Corporate discounts | Corporate pension plan | Digital meal vouchers | Educational budgetSenior-level Full TimeBerlin, Germany1d ago
-
Staff Software Engineer, Generative AI, Core ML USD 207K-300KAI Feedback | Computer Vision | Data Processing | Deep learning | Digital TwinSenior-level Full TimeMountain View, CA, USA2d ago
-
Mid-level Full TimeBengaluru, Karnataka, India2d ago
-
Machine Learning Engineer (Post-Training) EUR 57K-84KAWS | Data Pipelines | Data-parallel | DeepSpeed | Direct Preference OptimizationSenior-level Full TimeParis, France2d ago
-
Senior Product Manager, LLM. SGD 132K-156KAI infrastructure | Cost Optimization | Cost Performance | Cost-performance optimization | Data QualitySenior-level Full TimeCrimson House Singapore2d ago
-
AI Agents Applied Engineer - Senior Associate USD 148K-240KA/B | A/B Testing | Auditability | B testing | Bandit AlgorithmsBackup childcare | Financial coaching | Flexible benefits | Health care coverage | Mental health supportSenior-level Full TimeBrooklyn, NY, United States2d ago
-
Senior Applied Scientist USD 180K-230KDirect Preference Optimization | Distributed Training | Human Feedback | LLM-as-a-Judge | Language ModelsSenior-level Full TimePalo Alto3d ago
-
AWS | Attribution Modeling | Azure | C++ | Chain-of-Thought401k | Health benefits | Paid time off | Parental leave | Stock purchaseSenior-level Full Time(USA) Crossman Service Building CA SUNNYVALE …8d ago
-
Research Scientist (Seed-LLM) USD 244K-450KData Construction | Deep learning | Inference Optimization | Instruction Tuning | Language ModelsMid-level Full TimeSan Jose, California, United States9d ago
-
DDP | Deep learning | Direct Preference Optimization | Distributed Training | DockerSenior-level Full TimePangyo (Software Dream Center), South Korea9d ago
-
AI Research Scientist Intern (PhD), Embodied AI USD 93K-180KAction models | Deep learning | Diffusion Models | Fine Tuning | Imitation LearningIn office collaboration 5 days per week | Publication opportunitiesEntry-level InternshipMilpitas, CA9d ago
-
Senior Applied Scientist USD 142K-270KDiffusion Models | Direct Preference Optimization | Fine Tuning | Human Feedback | Inference accelerationSenior-level Full TimeSeattle, United States9d ago
-
大模型应用算法工程师/专家 CNY 240K-480KC++ | Computer Vision | Deep learning | Direct Preference Optimization | Human Computer DialogueSenior-level Full Time上海、北京10d ago
-
Senior Applied AI Manager USD 170K-234KAgent systems | Agentic Systems | Curriculum learning | Data Deduplication | Data mixingSenior-level Full TimeSan Mateo, CA10d ago
-
Agent RL Infra Engineer USD 224K-356KAI Feedback | Active Learning | Cluster management | Continuous Learning | Data CurationSenior-level Full TimeUS, CA, Santa Clara, United States12d ago
-
Applied Reinforcement Learning Engineer USD 150K-160KActor-critic | Agent systems | BCQ | Behavioral cloning | CQLEqual opportunity employer | Hybrid remote work | Research publications opportunityMid-level Full TimeRemote Work( USA), United States R15d ago
-
校招-Ai研究科学家-大语言模型/视觉语言模型算法与后训练(博士优先) CNY 500K-500KAdapters | Direct Preference Optimization | Fine Tuning | Flax | Function designNone Full Time上海21d ago
-
Senior Staff Software Engineer, Model LifeCycle USD 237K-288KAPI Design | CUDA | Checkpointing | DPO | DeepSpeed401k matching | Cell phone stipend | Commuter benefits | Dental insurance | HSA employer contributionsSenior-level Full TimeTel Aviv - IL21d ago
-
Checkpointing | Cloud Networking | Failure recovery | Golang | Human Feedback401k match | Cell phone stipend | Commuter benefits | Dental insurance | HSA employer contributionsSenior-level Full TimeSan Francisco, CA - US21d ago
-
Staff Software Engineer, Model LifeCycle USD 208K-253KAPI Design | Checkpointing | Distributed Training | Failure recovery | Fine Tuning401k match | Cell phone stipend | Commuter benefits | Dental insurance | Employer HSA contributionsSenior-level Full TimeSan Francisco, CA - US22d ago
-
Agent Orchestration | Amazon Web Services | Auto Planning | Autogen | Direct Preference OptimizationBicycle subsidy | Corporate discounts | Corporate pension plan | Digital meal vouchers | Educational budgetSenior-level Full TimeBerlin, Germany22d ago
-
Benchmark design | Computer Vision | Deep learning | Direct Preference Optimization | Evaluation metricsCar to go subscriptions | Free parking | Learning opportunities | On site bakery | On-site restaurantsMid-level Full TimeJerusalem22d ago
-
Staff AI Engineer, Model Post-Training and Alignment USD 196K-268KBenchmarking | Deep learning | Direct Preference Optimization | Fine Tuning | Generalized Reward Policy OptimizationCompany events | Comprehensive healthcare | Education subsidy | Learning and development programs | Meal allowancesSenior-level Full TimeAPAC22d ago
-
Applied Scientist 3 USD 120K-238KFine Tuning | Inference Optimization | Machine Learning | Model Compression | Model DistillationMid-level Full TimeSan Jose, United States22d ago
-
Automated testing | Cryptography | Direct Preference Optimization | Distributed Systems | DockerSenior-level Full TimeRemote R23d ago
-
Senior AI Research Scientist (6240) USD 170K-270KAdversarial Learning | Attention Networks | Dash | Data Preprocessing | Data WranglingHybrid work schedule | Professional development programs | Travel for training and team buildingSenior-level Full TimeSan Jose, CA, US1mo ago
-
AI Safety | Agent architectures | Agent-based | Agent-based AI | Data CurationMid-level Full TimeSeattle, Washington, USA1mo ago