Find jobs in AI/ML, Data Science and Big Data
30 results
for GRPO
(Skill/Tech stack)
-
Computer Scientist - II INR 3000K-5000KAI orchestration | API Development | Agent systems | Data Processing | Event DrivenSenior-level Full TimeNoida, India R2d ago
-
Senior Machine Learning Engineer – LLMs EUR 62K-90KAccelerate | Axolotl | BF16 | DPO | Data DeduplicationAutonomy | Hybrid work model | Professional growth | Top-spec equipmentSenior-level Full TimeNetherlands - Amsterdam4d ago
-
Senior-level Full TimeNetherlands - Amsterdam4d ago
-
Intern Engineer – RL Post-Training for LLMs CAD 58K-104KData Generation | Deep learning | DeepSpeed | Distributed Training | GRPOInternshipEntry-level InternshipVancouver, British Columbia, Canada4d ago
-
Machine Learning Research Engineer | Kilby Labs USD 149K-258KC++ | DPO | Deep learning | Embeddings | Few-Shot LearningMid-level Full TimeUnited States4d ago
-
AI Researcher PHP 219K-252KASR | Agentic AI | Audio signal processing | DPO | EmbeddingsEnglish communication required | Open ended research opportunities | Remote workMid-level Full TimeSouth America, Europe, Asia R8d ago
-
Data Scientist - Agentic AI Systems - Loops USD 140K-150KAgent coordination | Autogen | DPO | Decision Making | Decision-making models401k match | Dental insurance | Disability benefits | Flexible paid time off | Flexible spending accountsMid-level Full TimePalo Alto, California, United States9d ago
-
Research Engineer - LLM Training & Alignment Systems CAD 127K-225KAutomation | Benchmarking | C# | C++ | Data CurationMid-level Contract Full TimeKingston, Ontario, Canada10d ago
-
Agentic AI Engineer USD 86K-120KAgent Frameworks | ArangoDB | Attention Mechanism | Autogen | BenchmarksEntry-level Full TimeCARY 02, United States10d ago
-
Mid-level Internship上海10d ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KAgentic RL | DAPO | Distributed Training | Evaluation | Function CallingEntry-level Full Time上海、北京10d ago
-
Senior AI Researcher (Foundation AI) USD 190K-230KCI/CD | Cloud Computing | Context Parallelism | DPO | Data parallelismSenior-level Full TimeBoston, MA21d ago
-
Senior-level Full TimeSan Jose, United States R22d ago
-
Senior AI Research Scientist USD 139K-221KDAPO | Fine Tuning | GRPO | Language Models | Large Language ModelsSenior-level Full TimeRemote - USA, United States R23d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data Retrieval | Data StorageEntry-level Internship深圳25d ago
-
Forward Deployed Engineer (Inference & Post-Training) USD 270K-300KDPO | GRPO | KV cache | LoRA | Pipeline parallelismEquity | Health insurance | Remote work flexibilitySenior-level Full TimeSan Francisco1mo ago
-
AI Engineer (m/w/d) EUR 47K-47KArgoCD | Automated testing | Clean Code | Code review | DPOCompany pension | Corporate benefits | Professional developmentSenior-level Full TimeBerlin, Berlin, DE1mo ago
-
Adversarial Networks | Computer Vision | Cross-modal alignment | GRPO | Generative Adversarial NetworksEntry-level InternshipSeattle, Washington, United States1mo ago
-
Adversarial Robustness | Agent learning | Audio Processing | Computer Vision | Content ModerationCareer growth | Research mentorshipNone Full TimeSan Jose, California, United States1mo ago
-
AIGC Detection | Adversarial Learning | Agentic Systems | Cross-modal alignment | GRPONone Full TimeSeattle, Washington, United States1mo ago
-
Adversarial Networks | Adversarial Training | Cross-modal alignment | GRPO | Generative Adversarial NetworksEntry-level InternshipSan Jose, California, United States1mo ago
-
Applied Research - Evals & Data USD 150K-300KAccelerate | Data Pipelines | Data Versioning | Distributed Systems | Distributed tracingConference attendance | Professional development budget | Relocation support | Remote work | Team offsitesSenior-level Full TimeSan Francisco1mo ago
-
Causal Inference | Cross-modal fusion | DPO | Data Modeling | Deep learningEntry-level Full TimeSan Jose, California, United States1mo ago
-
Causal Inference | Cross-modal fusion | DPO | Data Modeling | Deep learningMid-level Full TimeSeattle, Washington, United States1mo ago
-
Agent systems | Agentic AI | Artificial Intelligence | Benchmarking | Continual LearningDiversity training | Flexible work options | GPU infrastructure access | International Conference Publishing Support | Paid time offEntry-level Full TimeDresden, DE, 010691mo ago
-
Senior Software Engineer, RL Post-Training Frameworks USD 184K-356KActor Based Programming | C# | C++ | Consistency models | DPOComprehensive benefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Tech Lead, Robotic AI Model USD 150K-180KAction Chunking | Action Tokenization | Behavior Cloning | DPO | DeepSpeedSenior-level Full TimeEl Segundo, California, United States1mo ago
-
Entry-level Internship上海1mo ago
-
Agent systems | Attention Mechanism | CPU | Continuous Improvement | DPODental insurance | Employee assistance program | Flexible Paid Vacation | Flexible paid sick leave | Flexible spending accountSenior-level Full TimePalo Alto, CA1mo ago
-
C++ | Deep learning | GPU clusters | GRPO | High PerformanceEquity | Healthcare benefits | Paid time off | Retirement benefitsSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago