Find jobs in AI/ML, Data Science and Big Data
44 results
for Reward Modeling
(Skill/Tech stack)
-
Data Science / ML Engineer (AcS) USD 69K-140KAPI Design | Data Pipelines | Evaluation | Langchain | Langgraph100 percent remoteMid-level Full TimeRemote Latam R1d ago
-
Helix AI Engineer, Reinforcement Learning USD 150K-350KCredit Assignment | Distributed Training | Experiment Management | Exploration | Model-based reinforcement learningIn-office collaborationSenior-level Full TimeSan Jose, CA1d ago
-
Helix AI Engineer, Pretraining USD 175K-400KComputer Vision | Data Mixture Optimization | Deep learning | Distributed Training | Language ProcessingSenior-level Full TimeSan Jose, CA1d ago
-
Mid-level Full TimeMedellín, Medellín, Antioquia, Colombia, Antioquia, Colombia2d ago
-
AI Engineer - Reinforcement Learning EUR 60K-84KData Pipelines | Evaluation Frameworks | Fine Tuning | Human-in-the-loop | Language model fine-tuningSenior-level Full TimeParis, France2d ago
-
Advisor - AI-Guided Optimization for Biologics USD 166K-244KActive Learning | Antibody Design | Bayesian optimization | Diffusion Models | Distributed TrainingCompany 401K | Employee assistance program | Fitness benefits | Flexible spending accounts | Life insuranceMid-level Full TimeUS: San Diego CA Lilly Biotechnology …2d ago
-
Staff Software Engineer, Generative AI, Core ML USD 207K-300KAI Feedback | Computer Vision | Data Processing | Deep learning | Digital TwinSenior-level Full TimeMountain View, CA, USA3d ago
-
Machine Learning Engineer (Post-Training) EUR 57K-84KAWS | Data Pipelines | Data-parallel | DeepSpeed | Direct Preference OptimizationSenior-level Full TimeParis, France3d ago
-
Machine Learning Engineer I USD 151K-189KAWS | Azure | Classification | Cloud Computing | Code review401k match | Equity | Flexible PTO | Learning stipend | Medical/Dental/Vision insuranceMid-level Full TimeSan Francisco, CA4d ago
-
AI Research Scientist - Safety Alignment Team USD 213K-293KAdversarial prompts | Automation | Computer Vision | DPO | Dataset curationSenior-level Full TimeMenlo Park, CA5d ago
-
Mid-level Full TimeTaiwan, Taipei R6d ago
-
Machine Learning Engineer II USD 170K-212KA/B | A/B Testing | AWS | Agile | Apache Beam401k retirement plan | Health insurance | Meal allowance | Paid flexible holidays | Paid parental leaveSenior-level Full TimeNew York, NY8d ago
-
AWS | Amazon S3 | Annotation tools | Batching | Contamination detectionEquity packages | Flexible leave options | Inclusive parental leave | Vibe and Thrive allowanceSenior-level Full TimeVienna, Vienna, Austria9d ago
-
Computer Vision | Deep learning | Generative AI | Language Models | Language ProcessingSenior-level Full TimeBellevue, Washington, USA9d ago
-
Automated testing | Deep learning | Distributed Training | Evaluation systems | Information RetrievalMid-level Full TimeBellevue, Washington, USA9d ago
-
Senior Machine Learning Engineer (Spain) GBP 70K-100KAPI Integration | Agile methodologies | Bias detection | Data Governance | Data QualityEqual pay guaranteed | Flexible working hours | Hybrid work | International exposure | Multicultural environmentSenior-level Full TimeCambourne, United Kingdom of Great Britain …9d ago
-
Senior Applied Scientist USD 142K-270KDiffusion Models | Direct Preference Optimization | Fine Tuning | Human Feedback | Inference accelerationSenior-level Full TimeSeattle, United States10d ago
-
Research Intern – Reinforcement Learning (RL) INR 300K-420KAgent systems | Fine Tuning | LLM Fine-tuning | Language Processing | Learning environmentsEntry-level InternshipNoida11d ago
-
Automated Evaluation | Information Retrieval | Language Models | Language Processing | Large Language ModelsMid-level Full TimeBellevue, Washington, USA11d ago
-
AI Research - Scientist/ Engineer USD 245K-350KBenchmarking | Evaluation Frameworks | Fine Tuning | Language Models | Language ProcessingPeriodic in-person meetings | Work from homeMid-level Full TimeGlobal12d ago
-
Staff Applied AI Scientist CNY 200K-500KBenchmarking | Cost Optimization | DPO | Deep learning | DistillationCross-functional collaboration | Direct impact with real customer data | Remote-friendly workSenior-level Full TimeShenzhen, Guangdong Province, China12d ago
-
Agent RL Infra Engineer USD 224K-356KAI Feedback | Active Learning | Cluster management | Continuous Learning | Data CurationSenior-level Full TimeUS, CA, Santa Clara, United States13d ago
-
Chain-of-Thought | Data Compliance | Knowledge Distillation | Language Models | Language ProcessingEntry-level Full TimeSingapore, Singapore14d ago
-
Sr Machine Learning Engineer USD 159K-236KAWS | Alignment Tuning | Anomaly Detection | Azure | BERTFinancial security support | Flexible hybrid work model | Healthcare coverage | Mental health resources | Paid time offSenior-level Full TimeUSA - California - San Jose …15d ago
-
Agent systems | DPO | Deep learning | Evaluation | Fine TuningMid-level Full TimeBellevue, Washington, USA16d ago
-
Applied Reinforcement Learning Engineer USD 150K-160KActor-critic | Agent systems | BCQ | Behavioral cloning | CQLEqual opportunity employer | Hybrid remote work | Research publications opportunityMid-level Full TimeRemote Work( USA), United States R16d ago
-
AI Engineer (PhD Required) USD 300K-405KArchitecture Search | Attention Mechanisms | Autogen | Automl | Computer VisionAutonomy in research | Opportunity to deploy research at scale | Remote workMid-level Full TimeLahore, Punjab18d ago
-
AI Engineer (PhD Required) SGD 96K-138KAttention Mechanisms | Autogen | Chunking | Constitutional AI | Distributed TrainingAnnual team events | Casual team environment | Flexible hours | Internet reimbursement | Opportunity for advancementMid-level Full TimeSingapore, Singapore18d ago
-
Senior AI Engineer USD 150K-291KAPI Development | Content Moderation | Content Safety | Data pipeline | Distributed Systems401k | Dental insurance | Flexible vacation policy | Flexible working hours | Health insuranceSenior-level Full TimeLos Altos18d ago
-
Research Engineer BRL 200K-220KAblation Studies | Code review | Data Curation | Data Validation | Data denoising5 days per week working | Collaborative team | Flexible working hours | Remote work | Supportive work environmentMid-level Full TimeBrazil18d ago
-
Ablation Studies | Chart Reading | Code review | Continuous integration | Data AugmentationCollaborative team | Flexible working hours | Remote work | Supportive work environmentMid-level Full TimeColombia, Huila, Colombia18d ago
-
Principal Applied Scientist, Agentic AI USD 181K-305KAI Feedback | DPO | Fine Tuning | Human Feedback | Learning from Human FeedbackMentorship and technical leadership | Remote-first work environmentSenior-level Full TimeRemote-USA, United States R19d ago
-
LLM Post-Training Engineer, Research & Product USD 212K-389KData Pipelines | Deep learning | Distributed Training | Human preference learning | Instruction TuningSenior-level Full TimeSan Jose, California, United States21d ago
-
Member of Technical Staff - Imagine Model USD 180K-440KAudio Processing | C++ | Computer Vision | Data Annotation | Data Augmentation401k | Dental insurance | Disability insurance | Employee discounts | Health insuranceSenior-level Full TimePalo Alto, CA; Seattle, WA21d ago
-
Checkpointing | Cloud Networking | Failure recovery | Golang | Human Feedback401k match | Cell phone stipend | Commuter benefits | Dental insurance | HSA employer contributionsSenior-level Full TimeSan Francisco, CA - US22d ago
-
Principal Engineer, AI Model LifeCycle USD 260K-326KAdapters | Checkpointing | DPO | DeepSpeed | Distributed TrainingCell phone stipend | Commuter benefits | Dental insurance | Health insurance | Mental health wellness supportSenior-level Full TimeSan Francisco, CA - US22d ago
-
Audio generation | Benchmarking | Computer Vision | DPO | Deep learningMid-level Full TimeHaifa23d ago
-
Benchmark design | Computer Vision | Deep learning | Direct Preference Optimization | Evaluation metricsCar to go subscriptions | Free parking | Learning opportunities | On site bakery | On-site restaurantsMid-level Full TimeJerusalem23d ago
-
Staff AI Engineer, Model Post-Training and Alignment USD 196K-268KBenchmarking | Deep learning | Direct Preference Optimization | Fine Tuning | Generalized Reward Policy OptimizationCompany events | Comprehensive healthcare | Education subsidy | Learning and development programs | Meal allowancesSenior-level Full TimeAPAC23d ago
-
Research Scientist - LLM Foundation Models TWD 1200K-1500KA Star | C plus plus | Data Augmentation | Deep learning | Fine TuningEntry-level Full TimeTaiwan, Taipei R23d ago
-
AI Platform | AI platform development | Agent Framework | Agent Orchestration | Agent framework architecturesEquity | Full benefitsSenior-level Full TimePalo Alto, CA30d ago
-
Applied Research - Forward-Deployed USD 280K-350KAgent Frameworks | Ambiguity tolerance | Artifact Development | Communication skills | Customer EngagementCompetitive pay | Conference attendance | Development budget | Equity | Flexible workSenior-level Full TimeSan Francisco1mo ago
-
API Design | Applied Machine Learning | Evaluation Pipelines | Fine Tuning | GenAIDisability programs | Family-forming benefits | Flexible spending accounts | Health plans | Health savings accountsMid-level Full TimeUSA - Remote, United States R1mo ago
-
Staff Research Engineer, MetaAI Assistant Measurement USD 213K-293KA/B | A/B Testing | AI interaction | AI systems | B testingSenior-level Full TimeBellevue, WA | New York, NY1mo ago