Find jobs in AI/ML, Data Science and Big Data
53 results
for Reward Modeling
(Skill/Tech stack)
-
Machine Learning Engineer - Personalization USD 170K-212KA/B | A/B Testing | AWS | Agile methodology | Apache Beam401k retirement plan | Health insurance | Meal allowance | Paid flexible holidays | Paid parental leaveSenior-level Full TimeNew York, NY3d ago
-
Forward Deployed Engineer Lead | LLM Post-training USD 165K-258KData Generation | Data Pipelines | Dataset versioning | Distributed Training | Evaluation methodologyDental insurance | Disability insurance | Health insurance | Life insurance | Paid time offSenior-level Full TimeNew York4d ago
-
Lead Machine Learning Engineer, LLM Infrastructure USD 172K-285KAWS | Cloud platform | Debugging | Deep learning | Distributed Systems401k | Employee stock purchasing program | Life and disability insurance | Medical/Dental/Vision | Mental health supportSenior-level Full TimeCalifornia - San Francisco, United States4d ago
-
Data Analysis | Dataset Processing | Direct Preference Optimization | Evaluation Pipelines | Fine TuningEntry-level InternshipSan Jose, California, United States6d ago
-
Senior Director, AI Model LifeCycle USD 301K-355KCheckpointing | Dataset versioning | Experiment tracking | Failure recovery | Fine Tuning401k match | Cell phone stipend | Commuter benefits | Dental insurance | HSA contributionsSenior-level Full TimeSan Francisco, CA - US6d ago
-
Researcher, Agentic Post-Training USD 295K-445KAgent systems | Data Pipelines | Diagnostics | Evals | Function CallingSenior-level Full TimeSan Francisco7d ago
-
Senior Software Engineer, RL Post-Training Frameworks USD 184K-356KActor Based Programming | C# | C++ | Consistency models | DPOComprehensive benefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States8d ago
-
Entry-level Internship上海9d ago
-
Mid-level Internship上海9d ago
-
AI Research Scientist - Agentic Systems USD 220K-295KAPIs | Data Augmentation | Data Generation | Fine Tuning | Language Models401k | Medical, dental, and vision insurance | Mental health and wellness support | Unlimited PTO | Work-life balanceMid-level Full TimeNew York, NY14d ago
-
Senior Machine Learning Engineer (Small Language Models) USD 154K-189KAWS | Adapter-Tuning | Axolotl | Cloud Computing | Data labelingFlexible remote days | Flexible work scheduleSenior-level Full TimeCanada - Remote R14d ago
-
LLM Applied Data Scientist (RAG/ NLP) TWD 480K-612KA Star | API Integration | C++ | Deep learning | EmbeddingsCareer growth | Continuous learning | Work from homeMid-level Full TimeTaiwan, Taipei15d ago
-
Principal PMT-ES - AI/ML Training, Annapurna Labs USD 181K-281KAI/ML | Customer Requirements | DPO | Deep learning | Developer experienceCareer growth resources | Flexible organization | Knowledge sharing | Mentorship | Work-life balanceSenior-level Full TimeCupertino, California, USA15d ago
-
Member of technical staff - Research - Model - London GBP 230K-340KData Pipelines | Deep learning | Distributed Training | Evaluation | GitCareer development | Continuous learning | Hybrid work | Professional growthSenior-level Full TimeLondon17d ago
-
Applied AI Researcher, Post-Training USD 150K-250KAgentic collaboration | Continual Learning | Continual pretraining | DPO | Data Analysis401k | Commuter benefits | In-office lunch | Medical, dental & vision coverageMid-level Full TimeSan Francisco17d ago
-
Applied AI Researcher, System Self-Improvement USD 150K-250KAgentic collaboration | Data Analysis | Ensembling | Evaluation | Graph-of-Thoughts401k | Commuter benefits | Equity | In-office lunch | Medical, dental & vision coverageMid-level Full TimeSan Francisco17d ago
-
Autoregressive models | CPU acceleration | Deep learning | Diffusion Models | Distributed TrainingEntry-level Full TimeSingapore-CapitaSky17d ago
-
Senior Machine Learning Engineer - Firefly USD 151K-265KAutoregressive models | Data Pipelines | Data Quality | Diffusion Models | Language ModelsSenior-level Full TimeSan Jose, United States R17d ago
-
Software Engineer - AI Native Development USD 200K-300KAngular | C++ | Data Engineering | Domain Adaptation | Fine TuningMid-level Full TimePalo Alto17d ago
-
Deep learning | Language Processing | Large-scale | Large-scale experimentation | Machine LearningCompany-sponsored medical plan | Paid Holidays | Paid sick leaveEntry-level Full Time InternshipUS-Washington-Bellevue, United States18d ago
-
Deep learning | Language Processing | Natural Language | Natural Language Processing | PyTorchMedical plan enrollment | Paid Holidays | Paid sick leaveEntry-level Full Time InternshipUS-Washington-Bellevue, United States18d ago
-
Helix AI Engineer, Reinforcement Learning USD 150K-350KCredit Assignment | Distributed Training | Experiment Management | Exploration | Model-based reinforcement learningIn-office collaborationSenior-level Full TimeSan Jose, CA21d ago
-
Helix AI Engineer, Pretraining USD 175K-400KComputer Vision | Data Mixture Optimization | Deep learning | Distributed Training | Language ProcessingSenior-level Full TimeSan Jose, CA21d ago
-
Mid-level Full TimeMedellín, Medellín, Antioquia, Colombia, Antioquia, Colombia22d ago
-
AI Engineer - Reinforcement Learning EUR 60K-84KData Pipelines | Evaluation Frameworks | Fine Tuning | Human-in-the-loop | Language model fine-tuningSenior-level Full TimeParis, France22d ago
-
Advisor - AI-Guided Optimization for Biologics USD 166K-244KActive Learning | Antibody Design | Bayesian optimization | Diffusion Models | Distributed TrainingCompany 401K | Employee assistance program | Fitness benefits | Flexible spending accounts | Life insuranceMid-level Full TimeUS: San Diego CA Lilly Biotechnology …22d ago
-
Staff Software Engineer, Generative AI, Core ML USD 207K-300KAI Feedback | Computer Vision | Data Processing | Deep learning | Digital TwinSenior-level Full TimeMountain View, CA, USA23d ago
-
Machine Learning Engineer (Post-Training) EUR 57K-84KAWS | Data Pipelines | Data-parallel | DeepSpeed | Direct Preference OptimizationSenior-level Full TimeParis, France23d ago
-
Machine Learning Engineer I USD 151K-189KAWS | Azure | Classification | Cloud Computing | Code review401k match | Equity | Flexible PTO | Learning stipend | Medical/Dental/Vision insuranceMid-level Full TimeSan Francisco, CA25d ago
-
AI Research Scientist - Safety Alignment Team USD 213K-293KAdversarial prompts | Automation | Computer Vision | DPO | Dataset curationSenior-level Full TimeMenlo Park, CA25d ago
-
Mid-level Full TimeTaiwan, Taipei R26d ago
-
AWS | Amazon S3 | Annotation tools | Batching | Contamination detectionEquity packages | Flexible leave options | Inclusive parental leave | Vibe and Thrive allowanceSenior-level Full TimeVienna, Vienna, Austria29d ago
-
Computer Vision | Deep learning | Generative AI | Language Models | Language ProcessingSenior-level Full TimeBellevue, Washington, USA29d ago
-
Automated testing | Deep learning | Distributed Training | Evaluation systems | Information RetrievalMid-level Full TimeBellevue, Washington, USA29d ago
-
Senior Machine Learning Engineer (Spain) GBP 70K-100KAPI Integration | Agile methodologies | Bias detection | Data Governance | Data QualityEqual pay guaranteed | Flexible working hours | Hybrid work | International exposure | Multicultural environmentSenior-level Full TimeCambourne, United Kingdom of Great Britain …29d ago
-
Research Intern – Reinforcement Learning (RL) INR 300K-420KAgent systems | Fine Tuning | LLM Fine-tuning | Language Processing | Learning environmentsEntry-level InternshipNoida1mo ago
-
Automated Evaluation | Information Retrieval | Language Models | Language Processing | Large Language ModelsMid-level Full TimeBellevue, Washington, USA1mo ago
-
AI Research - Scientist/ Engineer USD 245K-350KBenchmarking | Evaluation Frameworks | Fine Tuning | Language Models | Language ProcessingPeriodic in-person meetings | Work from homeMid-level Full TimeGlobal1mo ago
-
Staff Applied AI Scientist CNY 200K-500KBenchmarking | Cost Optimization | DPO | Deep learning | DistillationCross-functional collaboration | Direct impact with real customer data | Remote-friendly workSenior-level Full TimeShenzhen, Guangdong Province, China1mo ago
-
Agent RL Infra Engineer USD 224K-356KAI Feedback | Active Learning | Cluster management | Continuous Learning | Data CurationSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Chain-of-Thought | Data Compliance | Knowledge Distillation | Language Models | Language ProcessingEntry-level Full TimeSingapore, Singapore1mo ago
-
Sr Machine Learning Engineer USD 159K-236KAWS | Alignment Tuning | Anomaly Detection | Azure | BERTFinancial security support | Flexible hybrid work model | Healthcare coverage | Mental health resources | Paid time offSenior-level Full TimeUSA - California - San Jose …1mo ago
-
Applied Reinforcement Learning Engineer USD 150K-160KActor-critic | Agent systems | BCQ | Behavioral cloning | CQLEqual opportunity employer | Hybrid remote work | Research publications opportunityMid-level Full TimeRemote Work( USA), United States R1mo ago
-
AI Engineer (PhD Required) USD 300K-405KArchitecture Search | Attention Mechanisms | Autogen | Automl | Computer VisionAutonomy in research | Opportunity to deploy research at scale | Remote workMid-level Full TimeLahore, Punjab1mo ago
-
AI Engineer (PhD Required) SGD 96K-138KAttention Mechanisms | Autogen | Chunking | Constitutional AI | Distributed TrainingAnnual team events | Casual team environment | Flexible hours | Internet reimbursement | Opportunity for advancementMid-level Full TimeSingapore, Singapore1mo ago
-
Research Engineer BRL 200K-220KAblation Studies | Code review | Data Curation | Data Validation | Data denoising5 days per week working | Collaborative team | Flexible working hours | Remote work | Supportive work environmentMid-level Full TimeBrazil1mo ago
-
Principal Applied Scientist, Agentic AI USD 181K-305KAI Feedback | DPO | Fine Tuning | Human Feedback | Learning from Human FeedbackMentorship and technical leadership | Remote-first work environmentSenior-level Full TimeRemote-USA, United States R1mo ago
-
LLM Post-Training Engineer, Research & Product USD 212K-389KData Pipelines | Deep learning | Distributed Training | Human preference learning | Instruction TuningSenior-level Full TimeSan Jose, California, United States1mo ago
-
Member of Technical Staff - Imagine Model USD 180K-440KAudio Processing | C++ | Computer Vision | Data Annotation | Data Augmentation401k | Dental insurance | Disability insurance | Employee discounts | Health insuranceSenior-level Full TimePalo Alto, CA; Seattle, WA1mo ago
-
Checkpointing | Cloud Networking | Failure recovery | Golang | Human Feedback401k match | Cell phone stipend | Commuter benefits | Dental insurance | HSA employer contributionsSenior-level Full TimeSan Francisco, CA - US1mo ago