Find jobs in AI/ML, Data Science and Big Data
19 results
for PPO
(Skill/Tech stack)
-
Data Scientist (Remote) USD 140K-215KContext Management | DPO | DeepSpeed | Experiment tracking | Experimental DesignEmployee networks | Great Place to Work certification | Paid adoption leave | Paid parental leave | Professional developmentMid-level Full TimeUSA VA Remote, United States R1d ago
-
Software Dev Engineer II, Stores Foundational AI -SFAI USD 165K-223KAsync Rollouts | Batching | C++ | CUDA | Cluster computing401k matching | Adoption reimbursement | Dental insurance | Employee assistance program | Flexible spending accountsMid-level Full TimePalo Alto, California, USA1d ago
-
Software Dev Engineer II, Stores Foundational AI -SFAI USD 143K-194KCUDA | Data Pipelines | Distributed Training | Dynamo | Experiment tracking401k matching | Employee assistance program | Health insurance | Paid time off | Parental leaveMid-level Full TimeSeattle, Washington, USA1d ago
-
Software Dev Engineer II, Stores Foundational AI -SFAI USD 143K-194KAsync Rollouts | Batching | C++ | CUDA | Data Delivery401k matching | Health insurance | Paid time off | Parental leaveMid-level Full TimeSeattle, Washington, USA1d ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KDistributed Training | Function Calling | GRPO | Human Feedback | JSONEntry-level Full Time上海、北京3d ago
-
Director, Reinforcement Learning & Agentic Post-Training EUR 151K-200KAI Feedback | API Integration | Distributed Training | Environment Design | EvaluationExecutive-level Full TimeParis, France6d ago
-
具身智能算法工程师-模型 CNY 500K-500KDeep learning | Distributed Training | IQL | Inference Optimization | Isaac LabMid-level Full Time北京 R6d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GRPOEntry-level Internship深圳7d ago
-
Agent systems | DPO | Distributed Training | Fine Tuning | JAXEntry-level Full TimeUS-WA-Bellevue13d ago
-
Research Scientist- Robotics AI USD 165K-185K3D Scene | 3D Scene Understanding | BEV | Behavior planning | C++401k matching | Financial planning support | Health insurance | Life and disability protection | Paid time offMid-level Full TimeSunnyvale, CA, United States26d ago
-
Senior Research Scientist- Robotics AI USD 185K-215K3D Scene | 3D Scene Understanding | Autonomous Planning | BEV grid | Behavioral Planning401k matching | Disability insurance | Financial planning support | Health insurance | Life insuranceSenior-level Full TimeSunnyvale, CA, United States26d ago
-
Research Engineer - LLM Training & Alignment Systems CAD 127K-225KAutomation | Benchmarking | C# | C++ | Data CurationMid-level Contract Full TimeKingston, Ontario, Canada26d ago
-
Senior-level Full TimeSeoul, Korea27d ago
-
Principal AI Research Scientist Post-Training Alignment CAD 123K-180KAgentic AI | Alignment research | DPO | Deep learning | Distributed TrainingSenior-level Full TimeAMER - Canada - Ontario - …1mo ago
-
Senior AI Researcher (Foundation AI) USD 190K-230KCI/CD | Cloud Computing | Context Parallelism | DPO | Data parallelismSenior-level Full TimeBoston, MA1mo ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data Retrieval | Data StorageEntry-level Internship深圳1mo ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data pipeline | Distributed SystemsEntry-level Internship深圳1mo ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitFlexible work schedule | Internship opportunity | MentorshipEntry-level Internship深圳1mo ago
-
Behavior Cloning | Diffusion Models | Embodied AI | Hardware Integration | Imitation LearningEquity | Health benefits | Lunches | Snacks | Team activitiesSenior-level Full TimeSanta Clara, CA1mo ago