Find jobs in AI/ML, Data Science and Big Data
109 results
for Learning from Human Feedback
(Skill/Tech stack)
-
Engineering Expert - AI & Data Products, Runtime Platform INR 3000K-4000KAI Agents | Apache Spark | Argo CD | CI/CD | Data ContractsSenior-level Full TimeBangalore, KA, IN, 56006620h ago
-
AI Research Engineer - Applied AI INR 2000K-3000KAPI Design | AWS SageMaker | Anomaly Detection | Azure Machine Learning | Bias auditingAsynchronous culture | Distributed team | Remote workMid-level Full TimeRemote - REMOTE, India, India R22h ago
-
Benchmarking | Data Balancing | Data Filtering | Dataset curation | Distributed TrainingRemote work worldwideSenior-level Full TimeRemote job R1d ago
-
Benchmarking | Data Pipelines | Distributed Training | Function Calling | Human FeedbackSenior-level Full TimeRemote job R1d ago
-
Research Scientist, Safety Post Training USD 216K-270KAdversarial evaluation | Direct Preference Optimization | Generative AI | Group Relative Policy Optimization | Human FeedbackCommuter stipend | Comprehensive health insurance | Dental insurance | Learning and development stipend | Paid time offSenior-level Full TimeSan Francisco, CA; New York, NY2d ago
-
Agent systems | Cloud APIs | Cloud platform | Fine Tuning | Google CloudSenior-level Full TimeTokyo, Japan2d ago
-
LLM Fine-Tuning Engineer USD 150K-270KBenchmarking | Direct Preference Optimization | Distributed Training | Efficient Attention | FSDPMid-level Full TimeUnited States - Remote R3d ago
-
Applied Scientist, Wayve Labs GBP 80K-96KAutoregressive models | Depth Estimation | Diffusion Models | Foundation Models | Human FeedbackDaily yoga | Enhanced parental leave | Flexible working hours | Onsite bar | Onsite chefMid-level Full TimeLondon5d ago
-
Llm算法实习生(具身大脑方向) CNY 25K-37KAgentic RL | Data Annotation | Fine Tuning | Human Feedback | LLM AgentEntry-level Internship深圳5d ago
-
Llm算法实习生(具身大脑方向) CNY 25K-37KEmbodied AI | Fine Tuning | Human Feedback | LLM Agents | Language ModelsEntry-level Internship深圳5d ago
-
Llm算法实习生(具身大脑方向) CNY 25K-37KAgentic RL | Fine Tuning | Human Feedback | LLM Agent | Language ModelsEntry-level Internship深圳5d ago
-
Lead AI Research Scientist USD 357K-357KCUDA | Human Feedback | JAX | Language Models | Large Language ModelsSenior-level Full TimeSan Francisco, California5d ago
-
Lead Machine Learning Engineer USD 171K-230KAWS | AWS Kinesis | Agentic Systems | Apache Kafka | AzureSenior-level Full TimeUSA - FL - Kirkman Point …5d ago
-
Applied Scientist II, Alexa International Team USD 142K-193KA/B | A/B Testing | AI Feedback | B testing | C++Entry-level Full Time InternshipBellevue, Washington, USA6d ago
-
Machine Learning Engineer, TikTok - Business Governance USD 145K-250KAI Agents | Audio Processing | Content Moderation | Deep learning | Direct Preference OptimizationMid-level Full TimeSan Jose, California, United States7d ago
-
Machine Learning Engineer, LLM Evals & Observability USD 200K-300KData Pipelines | Distributed Systems | Go | Human Feedback | LLM Evaluation401k contribution | Education stipend | Generous time off | Healthy lunches daily | Home office improvement stipendMid-level Full TimeMountain View, CA7d ago
-
Applied Researcher I (AI Foundations, LLM Customization, Finetuning, Reinforcement Learning) USD 218K-272KAWS | Artificial Intelligence | Data Curation | Dataset labeling | Deep learningHealth insuranceEntry-level Full TimeMcLean, VA, United States7d ago
-
Applied Scientist II, Alexa International Team USD 142K-193KA/B | A/B Testing | AI Feedback | B testing | Deep learningEntry-level Full Time InternshipBellevue, Washington, USA7d ago
-
Software Engineering Manager, LLM Training USD 170K-277KCUDA | Containerization | Context Parallelism | Data I/O | Data parallelismEntry-level Full TimeMountain View, CA, United States7d ago
-
DPO | Data Curation | Data Quality | DeepSpeed | Distributed TrainingCareer growth | Collaborative culture | Continuous learning | Hybrid work | Inclusive workplaceSenior-level Full TimeIndia8d ago
-
AI Architect CZK 1098K-1248KActive Learning | Artificial Intelligence | Evaluation Frameworks | Human Feedback | Language ProcessingSenior-level Full TimePrague, Czech, Czechia8d ago
-
Data Processing | Deep learning | Distributed Training | Generative Models | Human FeedbackFamily leave | Free food and snacks | Health care plan | Life insurance | Long-term disabilitySenior-level Full Time费利蒙9d ago
-
Senior Manager Data Scientist SGD 120K-162KAWS | Cloud Computing | Cloud platform | Data Preprocessing | Deep learningSenior-level Full TimeSingapore9d ago
-
Data Annotation | Data Augmentation | Data Quality | Data cleaning | Data labelingMid-level Full TimeCrimson House Singapore9d ago
-
AI Research Engineer USD 152K-258KContrastive Learning | Deep learning | Distributed Computing | Fine Tuning | Generative AIDental insurance | Flexible-hybrid work | Health insurance | Relocation assistance | Retirement planMid-level Full TimePalo Alto, California, United States10d ago
-
A/B | A/B Testing | B testing | Continual Learning | Dataset ConstructionSenior-level Full TimeSingapore, Singapore11d ago
-
Generative AI Analyst INR 2500K-3000KAssurance testing | Data labeling | Dataset curation | Human Feedback | Language ModelsMid-level Full TimeAsia (Remote), India R11d ago
-
Generative AI Analyst INR 2500K-3000KData labeling | Human Feedback | Language Models | Large Language Models | Learning from Human FeedbackMid-level Full TimeAsia (Remote), India R11d ago
-
Generative AI Analyst INR 2500K-3000KData Tagging | Data labeling | Human Feedback | Language Models | Large Language ModelsMid-level Full TimeAsia (Remote), India R11d ago
-
Generative AI Analyst INR 2500K-3000KHuman Feedback | Labeling | Language Models | Large Language Models | Learning from Human FeedbackEntry-level Full TimeAsia (Remote), India R11d ago
-
Generative AI Analyst INR 2500K-3000KDataset development | Human Feedback | Labeling | Language Models | Large Language ModelsMid-level Full TimeAsia (Remote), India R11d ago
-
Generative AI Analyst INR 2500K-3000KData labeling | Human Feedback | Language Models | Large Language Models | Learning from Human FeedbackMid-level Full TimeAsia (Remote), India R11d ago
-
Technical Program Manager, Discovery USD 365K-435KCompute Planning | Data Infrastructure | Data pipeline | Data pipeline debugging | ForecastingFlexible working hours | Generous vacation | Health and wellness benefits | Hybrid work policy | Optional equity donation matchingMid-level Full TimeSan Francisco, CA | New York …11d ago
-
Machine Learning Engineer, Global Public Sector GBP 100K-170KBenchmarking | Bias Mitigation | Deep learning | Direct Preference Optimization | Distributed TrainingMid-level Full TimeDoha, Qatar; London, UK12d ago
-
Senior Applied Scientist USD 142K-270KData Pipelines | Diffusion Models | Direct Preference Optimization | Fine Tuning | Generative AISenior-level Full TimeSan Jose, United States R12d ago
-
Staff AI Scientist USD 190K-300KAI Feedback | AIBERT Family | Adversarial Machine Learning | Agentic Systems | BERT401k plan | Fastrak reimbursement | Free annual Caltrain pass | Free lunch | Health, dental and vision coverageSenior-level Full TimePalo Alto13d ago
-
Alignment | Benchmark design | Constitutional AI | Continued Pretraining | Data CurationSenior-level Full TimeDublin, CA (HQ)15d ago
-
Senior Applied AI Researcher (India) INR 2500K-4500KArtificial Intelligence | DPO | Data parallelism | DataLoader | DeepSpeedSenior-level Full TimeIndia/Bengaluru15d ago
-
Senior Applied AI Researcher (Brazil) BRL 271K-370KCI/CD | DPO | Data parallelism | Deep learning | DeepSpeedSenior-level Full TimeBrazil/Remote R15d ago
-
Senior Applied AI Researcher (Dublin, CA) USD 190K-300KAutomated testing | Continuous Evaluation | Data parallelism | Deep learning | DeepSpeedSenior-level Full TimeDublin, CA (HQ)15d ago
-
Applied AI Researcher (India) INR 2000K-3465KAWS | Automated testing | Azure | CI/CD | Cloud ComputingMid-level Full TimeIndia/Bengaluru15d ago
-
Applied AI Researcher (Dublin, CA) USD 239K-331KCI/CD | Computer Vision | Data Preprocessing | Deep learning | Direct Preference OptimizationMid-level Full TimeDublin, CA (HQ)15d ago
-
Senior GenAI Specialist – Finance – Vice President INR 2000K-4000KAWS | Azure | CI/CD | Chroma | DjangoSenior-level Full TimePLOT NO-1, S.NO. 77, India15d ago
-
Sr. Engineer USD 122K-200KAgentic design | Amazon S3 | Apache Spark | Artifactory | Artificial IntelligenceBenefits eligible | Employee wellness support | Paid time offSenior-level Full TimeCharlotte, United States15d ago
-
Agentic Systems | Deep learning | Diffusion Models | Fine Tuning | Generative AI401k eligibility | Annual bonus | Dental insurance | Medical insurance | Paid time offSenior-level Full TimeLos Altos, CA16d ago
-
API Integration | Anthropic | Embeddings | Faiss | Hugging FaceFlexible work programs | Inclusive benefits | Mentorship | Wellbeing supportMid-level Full TimeBengaluru Millenia, India19d ago
-
APIs | Anthropic APIs | Data Privacy | Embeddings | FaissFlexibility programs | Inclusive benefits | Mentorship | Wellbeing supportMid-level Full TimeBengaluru Millenia, India19d ago
-
Senior Principal Machine Learning Engineer (Fulfilment) SGD 182K-240KDecision Processes | DeepSpeed | Direct Preference Optimization | Distributed Training | Dynamic ModelsBirthday leave | Confidential Assistance Programme | FlexWork | Medical insurance | Parental leaveExecutive-level Full TimeSingapore, Singapore20d ago
-
Senior Principal Data Scientist (Fulfilment) SGD 224K-252KDecision Processes | DeepSpeed | Distributed Training | Dynamic Models | FSDPBirthday leave | Flexible work arrangements | Life insurance | Medical insurance | Parental leaveExecutive-level Full TimeSingapore, Singapore20d ago
-
Data Science - AI USD 245K-295KA/B | A/B Testing | Annotation Guidelines | B testing | Data labeling401k match | Commuter benefits | Compassionate leave | Family support | Hybrid workMid-level Full TimeSan Francisco22d ago