Find jobs in AI/ML, Data Science and Big Data
26 results
for PPO
(Skill/Tech stack)
-
Machine Learning Research Engineer | Kilby Labs USD 149K-258KC++ | DPO | Deep learning | Embeddings | Few-Shot LearningMid-level Full TimeUnited States1d ago
-
Research Scientist- Robotics AI USD 165K-185K3D Scene | 3D Scene Understanding | BEV | Behavior planning | C++401k matching | Financial planning support | Health insurance | Life and disability protection | Paid time offMid-level Full TimeSunnyvale, CA, United States6d ago
-
Senior Research Scientist- Robotics AI USD 185K-215K3D Scene | 3D Scene Understanding | Autonomous Planning | BEV grid | Behavioral Planning401k matching | Disability insurance | Financial planning support | Health insurance | Life insuranceSenior-level Full TimeSunnyvale, CA, United States6d ago
-
Research Engineer - LLM Training & Alignment Systems CAD 127K-225KAutomation | Benchmarking | C# | C++ | Data CurationMid-level Contract Full TimeKingston, Ontario, Canada6d ago
-
DPO | Deep learning | Diverse Preference Optimization | Learning algorithms | Machine LearningMid-level Full Time上海7d ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KAgentic RL | DAPO | Distributed Training | Evaluation | Function CallingEntry-level Full Time上海、北京7d ago
-
Senior-level Full TimeSeoul, Korea7d ago
-
Principal AI Research Scientist Post-Training Alignment CAD 123K-180KAgentic AI | Alignment research | DPO | Deep learning | Distributed TrainingSenior-level Full TimeAMER - Canada - Ontario - …14d ago
-
Behavior Cloning | C++ | Cloud processing | Computer Vision | ControlEntry-level Internship北京、上海 R14d ago
-
Applied Machine Learning Engineer USD 110K-165KA3C | Apache Kafka | C plus plus | C# | CUDA401k plan | Education assistance | Flexible work schedules | Health care and wellness plans | Paid HolidaysSenior-level Full TimeColorado Springs, United States15d ago
-
Senior AI Researcher (Foundation AI) USD 190K-230KCI/CD | Cloud Computing | Context Parallelism | DPO | Data parallelismSenior-level Full TimeBoston, MA18d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitEntry-level Internship深圳22d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data Retrieval | Data StorageEntry-level Internship深圳22d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data pipeline | Distributed SystemsEntry-level Internship深圳22d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitFlexible work schedule | Internship opportunity | MentorshipEntry-level Internship深圳22d ago
-
Behavior Cloning | Diffusion Models | Embodied AI | Hardware Integration | Imitation LearningEquity | Health benefits | Lunches | Snacks | Team activitiesSenior-level Full TimeSanta Clara, CA1mo ago
-
Adversarial Robustness | Agent learning | Audio Processing | Computer Vision | Content ModerationCareer growth | Research mentorshipNone Full TimeSan Jose, California, United States1mo ago
-
AIGC Detection | Adversarial Learning | Agentic Systems | Cross-modal alignment | GRPONone Full TimeSeattle, Washington, United States1mo ago
-
Adversarial Networks | Adversarial Training | Cross-modal alignment | GRPO | Generative Adversarial NetworksEntry-level InternshipSan Jose, California, United States1mo ago
-
Applied Reinforcement Learning Engineer 2 USD 150K-300KActorCritic | BCQ | BehavioralCloning | CQL | DQNMid-level Full TimeRedmond, Washington, United States1mo ago
-
Senior Software Engineer, RL Post-Training Frameworks USD 184K-356KActor Based Programming | C# | C++ | Consistency models | DPOComprehensive benefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Tech Lead, Robotic AI Model USD 150K-180KAction Chunking | Action Tokenization | Behavior Cloning | DPO | DeepSpeedSenior-level Full TimeEl Segundo, California, United States1mo ago
-
Entry-level Internship上海1mo ago
-
Robotics ML Expert, AI USD 60K-60KAgent systems | Control Theory | Dm_control | Domain Randomization | DrakeAsync collaboration | Fully remote | Independent contractor 1099Senior-level Full TimeMiami R1mo ago
-
Senior Solutions Architect, Retail USD 184K-356KAPI Integration | Agent systems | Agents SDK | Benchmarking | C++Equity | Health benefits | Paid time offSenior-level Full TimeUS, CA, Remote, United States R1mo ago
-
C++ | Deep learning | GPU clusters | GRPO | High PerformanceEquity | Healthcare benefits | Paid time off | Retirement benefitsSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago