Find jobs in AI/ML, Data Science and Big Data
16 results
for Proximal Policy Optimization
(Skill/Tech stack)
-
AI Feedback | Deep learning | Direct Preference Optimization | Fine Tuning | Human FeedbackMid-level Full Time上海3d ago
-
大模型算法工程师(开放域对话) CNY 180K-300KA/B | A/B Testing | Agentic reinforcement learning | B testing | DeepSpeedMid-level Internship上海、北京3d ago
-
Actor-critic | Computer Vision | Computer Vision Defect Detection | Data Ingestion | Defect DetectionDental insurance | Flexible spending accounts | Life and disability insurance | Medical insurance | Paid vacation and holidaysSenior-level Full TimeNorth Reading, MA, US5d ago
-
Staff Software Engineer, AI/ML USD 216K-271KAI Feedback | Agentic AI | Data Pipelines | Direct Preference Optimization | Experimentation platformsConference reimbursement | Education reimbursement | Employee assistance program | Employee stock purchase program | Equity compensationSenior-level Full TimeSeattle6d ago
-
Actor-critic | Agent systems | Birds Eye View | C++ | Computer VisionEntry-level Full TimeFR REN AMPERE S.T. - Guyancourt, …7d ago
-
Senior Software Engineer - Model Training & AI Evals INR 3500K-5000KAI Feedback | Ablation Studies | Benchmarking | CI/CD | Data GenerationSenior-level Full TimeRemote (India) R12d ago
-
Bayesian optimization | Data Generation | Debugging | DeepSpeed | Distributed SystemsAdditional time off for learning and development | Annual leave | Cycle to work scheme | Employee assistance program | Group personal pensionEntry-level ContractLondon, United Kingdom12d ago
-
Research Scientist, LLM Evaluation & Post-Training USD 150K-300KAI Feedback | Alignment | Benchmarking | Context evaluation | Deep learningMid-level Full TimeRemote Work( USA), United States R18d ago
-
Machine Learning Engineer, LLM Post-Training USD 150K-230KAttention Mechanisms | Data-parallel | DeepSpeed | Fully Sharded Data Parallel | Hugging Face401k match | Commuter benefits | Dental insurance | FSA | HSAMid-level Full TimeMountain View, California, United States19d ago
-
Software Machine Learning Engineer USD 116K-186KApplied AI | Attention Mechanism | Explainable AI | Graph Machine Learning | InterpretabilityDental insurance | Disability insurance | Discretionary bonuses | Flexible spending accounts | Life insuranceEntry-level Full TimeNorth Reading, MA, US21d ago
-
具身智能-强化学习(灵巧操作方向) 实习生 CNY 25K-37KActor-critic | Diffusion Models | Distributed Training | Embodied intelligence | Flow matchingEntry-level Full Time Internship深圳27d ago
-
Senior Staff AI Engineer USD 180K-240KA3C | Actor-critic | Adaptive computation | Benchmarks | C plus plusSenior-level Full TimeLos Altos, California,27d ago
-
AI Feedback | Agentic Systems | Direct Preference Optimization | Distributed Training | EvaluationSenior-level Full TimeAMER - United States - California … R1mo ago
-
Applied AI Engineer USD 99K-225KAWS | AgentOps | Azure | ChromaDB | Continued Pretraining401k retirement plan | Bike storage | Commuter benefits | Dependent care FSA | Desk setup stipendMid-level Full TimeWashington DC1mo ago
-
Senior Machine Learning Engineer, RL / Locomotion USD 220K-336KActor-critic | Domain Randomization | GPU Computing | Isaac Lab | Isaac-GymHealth benefits | Recovery BenefitsSenior-level Full TimeCosta Mesa, California, United States1mo ago
-
Agentic Systems | Deep learning | Diffusion Models | Fine Tuning | Generative AI401k eligibility | Annual bonus | Dental insurance | Medical insurance | Paid time offSenior-level Full TimeLos Altos, CA1mo ago