Find jobs in AI/ML, Data Science and Big Data
13 results
for Reward Modeling
(Skill/Tech stack)
-
LLM Post-Training Engineer, Research & Product USD 212K-389KData Pipelines | Deep learning | Distributed Training | Human preference learning | Instruction TuningSenior-level Full TimeSan Jose, California, United States9h ago
-
Member of Technical Staff - Imagine Model USD 180K-440KAudio Processing | C++ | Computer Vision | Data Annotation | Data Augmentation401k | Dental insurance | Disability insurance | Employee discounts | Health insuranceSenior-level Full TimePalo Alto, CA; Seattle, WA1d ago
-
Checkpointing | Cloud Networking | Failure recovery | Golang | Human Feedback401k match | Cell phone stipend | Commuter benefits | Dental insurance | HSA employer contributionsSenior-level Full TimeSan Francisco, CA - US1d ago
-
Principal Engineer, AI Model LifeCycle USD 260K-326KAdapters | Checkpointing | DPO | DeepSpeed | Distributed TrainingCell phone stipend | Commuter benefits | Dental insurance | Health insurance | Mental health wellness supportSenior-level Full TimeSan Francisco, CA - US1d ago
-
Audio generation | Benchmarking | Computer Vision | DPO | Deep learningMid-level Full TimeHaifa2d ago
-
Benchmark design | Computer Vision | Deep learning | Direct Preference Optimization | Evaluation metricsCar to go subscriptions | Free parking | Learning opportunities | On site bakery | On-site restaurantsMid-level Full TimeJerusalem2d ago
-
Staff AI Engineer, Model Post-Training and Alignment USD 196K-268KBenchmarking | Deep learning | Direct Preference Optimization | Fine Tuning | Generalized Reward Policy OptimizationCompany events | Comprehensive healthcare | Education subsidy | Learning and development programs | Meal allowancesSenior-level Full TimeAPAC2d ago
-
Research Scientist - LLM Foundation Models TWD 1200K-1500KA Star | C plus plus | Data Augmentation | Deep learning | Fine TuningEntry-level Full TimeTaiwan, Taipei R2d ago
-
AI Platform | AI platform development | Agent Framework | Agent Orchestration | Agent framework architecturesEquity | Full benefitsSenior-level Full TimePalo Alto, CA9d ago
-
Applied Research - Forward-Deployed USD 280K-350KAgent Frameworks | Ambiguity tolerance | Artifact Development | Communication skills | Customer EngagementCompetitive pay | Conference attendance | Development budget | Equity | Flexible workSenior-level Full TimeSan Francisco29d ago
-
API Design | Applied Machine Learning | Evaluation Pipelines | Fine Tuning | GenAIDisability programs | Family-forming benefits | Flexible spending accounts | Health plans | Health savings accountsMid-level Full TimeUSA - Remote, United States R29d ago
-
Staff Software Engineer, Generative AI, Core ML USD 197K-291KBenchmarking | Data Processing | Debugging | Embodied Agents | Evaluation harnessesBenefits | Bonus | EquitySenior-level Full TimeMountain View, CA, USA30d ago
-
Staff Research Engineer, MetaAI Assistant Measurement USD 213K-293KA/B | A/B Testing | AI interaction | AI systems | B testingSenior-level Full TimeBellevue, WA | New York, NY1mo ago