Find jobs in AI/ML, Data Science and Big Data
129 results
for Supervised Fine Tuning
(Skill/Tech stack)
-
LLM Research Scientist HKD 150K-220KAgents | Amazon Web Services | Anomaly Detection | Big Data | ClassificationCareer growth | High autonomy | Knowledge sharing | Technical leadershipSenior-level Full TimeHong Kong, Hong Kong12h ago
-
Mid-level Full TimeHerndon, VA1d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapter methods | Benchmarking | DPO | DeepSpeed ZeRO | Distributed TrainingCareer growth | MentorshipMid-level Full TimeUnited States - Remote R1d ago
-
大模型算法研究员-MiMo CNY 500K-500KAI Feedback | Active Learning | C++ | Curriculum learning | Deep learningMid-level Full Time北京3d ago
-
Student Researcher (LLM Post Training – Agent & Reinforcement Learning) - 2026 Start (PhD) USD 202K-368KCoding | Data Construction | Fine Tuning | Instruction Tuning | Language ModelsInternshipEntry-level Full TimeSan Jose, California, United States5d ago
-
Benchmarking | Data Preprocessing | Dataset curation | Deep learning | Fine TuningMid-level Full TimeSanta Clara,CA, United States5d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapter methods | DPO | Dataset curation | Distributed Training | Efficient AttentionCareer growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R5d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Andorra) USD 150K-225KDirect Preference Optimization | Fine Tuning | Huggingface | Human Feedback | Information RetrievalCo-working space budget | Equipment provided | Fully remote | Health insurance support | Learning budgetSenior-level Full TimeAndorra R7d ago
-
Classification | Content Moderation | DPO | Dataset Preparation | Fine TuningCo-working space budget | Fully remote | Health insurance support | Learning budget | Paid time offSenior-level Full TimeUkraine R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Turkey) TRY 840K-1080KClassifiers | Data labeling | Direct Preference Optimization | Evaluation | Fine TuningAccess to AI tools | Annual in-person meetup | Co-working space budget | Equipment budget | Fully remoteSenior-level Full TimeTurkey R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Sweden) SEK 738K-930KClassification | DPO | Data labeling | Dataset cleaning | Evaluation FrameworksAI tools access | Annual in-person meetup | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeSweden R7d ago
-
Classifiers | DPO | Data labeling | Evaluation | Fine TuningAnnual in-person meetup | Co-working space budget | Equipment provided | Full remote | Health and wellness supportSenior-level Full TimeLatvia R7d ago
-
Classifier Training | Content Moderation | DPO | Data cleaning | Data labelingAI tools access | Co-working space budget | Equipment provided | Fully remote | Health insurance allowanceSenior-level Full TimeSlovakia R7d ago
-
Classification | Context window | Context window management | DPO | Data cleaningAccess to mental health counseling | Co-working space budget | Company equipment provision | Fully remote | Health insurance allowanceSenior-level Full TimeGreece R7d ago
-
Classification | Context window | Context window management | Data labeling | Direct Preference OptimizationCo-working space budget | Fully remote | Health insurance support | Learning budget | Paid time offSenior-level Full TimeIreland R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Romania) RON 245K-348KDPO | Fine Tuning | Huggingface | Human Feedback | Inference OptimizationAI tools access | Annual in-person meetup | Co-working space budget | Company equipment provided | Fully remoteSenior-level Full TimeRomania R7d ago
-
Classifier Training | DPO | Fine Tuning | Huggingface | Human FeedbackAI tools access | Annual in-person meetup | Co-working space budget | Company laptop | Fully remoteSenior-level Full TimeSwitzerland R7d ago
-
Classifier Training | Context window | Context window optimization | DPO | Data cleaningAI tools access | Annual in-person meetup | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeLithuania R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Serbia) USD 150K-225KClassification | Context window | Context window optimization | DPO | Data cleaningAnnual in-person meetup | Co-working space budget | Company laptop | Fully remote | Health and wellness supportSenior-level Full TimeSerbia R7d ago
-
DPO | Fine Tuning | Huggingface | LLM | Machine LearningCo-working space budget | Equipment provided | Fully remote | Health insurance allowance | Learning budgetSenior-level Full TimeLuxembourg R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Malta) EUR 79K-100KDPO | Data labeling | Fine Tuning | Huggingface | Human FeedbackAI tools access | Co-working space budget | Equipment provided | Fully remote | Health and wellness supportSenior-level Full TimeMalta R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Moldova) USD 150K-225KDPO | Data labeling | Fine Tuning | Huggingface | Inference OptimizationAI tools access | Co-working space budget | Equipment provided | Fully remote | Health and wellness supportSenior-level Full TimeMoldova R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Italy) EUR 79K-100KClassification Models | Context Management | DPO | Data cleaning | Data labelingAI tools access | Annual in-person gathering | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeItaly R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Hungary) HUF 11000K-18960KClassifiers | Context window | Context window optimization | Data cleaning | Data labelingAI tools access | Co-working space budget | Company laptop | Fully remote | Health insurance allowanceSenior-level Full TimeHungary R7d ago
-
DPO | Data labeling | Fine Tuning | Huggingface | Human FeedbackAI tools access | Co-working space budget | Equipment provided | Fully remote | Health and wellness supportSenior-level Full TimeSlovenia R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Norway) NOK 1100K-1250KClassification | Data labeling | Dataset cleaning | Direct Preference Optimization | Fine TuningAI tools access | Annual in-person meetup | Co-working space budget | Company equipment provided | Fully remoteSenior-level Full TimeNorway R7d ago
-
Classification Algorithms | Context window | Context window optimization | DPO | Data labelingAI tools access | Co-working space budget | Company equipment | Fully remote | Health insurance supportSenior-level Full TimeMontenegro R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Poland) PLN 324K-450KClassification | Context window | Context window optimization | Data cleaning | Data labelingAI tools access | Co-working space budget | Equipment budget | Fully remote | Health and wellness supportSenior-level Full TimePoland R7d ago
-
DPO | Data labeling | Fine Tuning | Huggingface | LLMCo-working space budget | Company laptop and equipment budget | Fully remote | Health insurance allowance | Learning budgetSenior-level Full TimeBelgium R7d ago
-
Classifier Training | Context window | Context window optimization | Data cleaning | Data labelingAccess to AI tools | Co-working space budget | Equipment provided | Fully remote | Health insurance allowanceSenior-level Full TimeGermany R7d ago
-
Context window | Context window optimization | Data labeling | Dataset cleaning | Direct Preference Optimization1 1 psychologist sessions | AI tools access | Annual in-person meetup | Co-working space budget | Company-provided equipmentSenior-level Full TimeCroatia R7d ago
-
Classifier Training | DPO | Data cleaning | Data labeling | Fine TuningAI tools access | Annual in-person meetup | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeAustria R7d ago
-
Classifier Training | Context window | Context window optimization | Data labeling | Dataset cleaningAI tool access | Annual in-person meetup | Co-working budget | Equipment provided | Health and wellness supportSenior-level Full TimePortugal R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Spain) EUR 80K-100KData labeling | Direct Preference Optimization | Fine Tuning | Huggingface | Human FeedbackCo-working space budget | Company equipment provided | Fully remote | Health and wellness support | Learning budgetSenior-level Full TimeSpain R7d ago
-
Classification | Context window | Context window optimization | DPO | Data cleaningAI tools access | Annual meetup | Co-working budget | Equipment budget | Fully remoteSenior-level Full TimeEstonia R7d ago
-
Classifier Training | Context window | Context window management | Data labeling | Dataset cleaningAI tools access | Co-working space budget | Equipment provided | Fully remote | Health insurance supportSenior-level Full TimeCyprus R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Czech Republic) CZK 1020K-1200KClassifiers | Context window | Context window optimization | DPO | Data labelingAI tools access | Co-working budget | Equipment stipend | Fully remote | Health & wellness supportSenior-level Full TimeCzech Republic R7d ago
-
Classification | DPO | Data labeling | Dataset cleaning | Evaluation FrameworksAI tools access | Annual in-person meetup | Co-working space budget | Company laptop provided | Fully remoteSenior-level Full TimeBulgaria R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Gibraltar) USD 150K-225KClassifier Training | Context window | Context window optimization | Data cleaning | Data labelingAI tools access | Annual in-person meetup | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeGibraltar R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - UK) GBP 90K-140KDPO | Data labeling | Evaluation Frameworks | Fine Tuning | HuggingfaceAI tools access | Annual in-person meetup | Co-working space budget | Equipment allowance | Fully remoteSenior-level Full TimeUnited Kingdom R7d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Denmark) DKK 516K-580KDPO | Fine Tuning | Hugging Face | Language Models | Large Language ModelsAI tools access | Annual in-person meetup | Co-working space budget | Equipment budget | Fully remoteSenior-level Full TimeDenmark R7d ago
-
Content Moderation | DPO | Data labeling | Fine Tuning | HuggingfaceAI tools access | Annual in-person meetup | Co-working space budget | Company equipment provided | Fully remoteSenior-level Full TimeFinland R7d ago
-
Data labeling | Dataset cleaning | Direct Preference Optimization | Fine Tuning | Hugging Face1 on 1 therapy sessions | Co-working space budget | Equipment provided | Fully remote | Health insurance supportSenior-level Full TimeNetherlands R7d ago
-
Classification | Data labeling | Direct Preference Optimization | Fine Tuning | HuggingfaceAI tools access | Co-working space budget | Company laptop and equipment budget | Fully remote | Health and wellness supportSenior-level Full TimeFrance R7d ago
-
Research Engineer Intern (Multimodal LLM) USD 180K-254KComputer Vision | Distributed Training | Fine Tuning | GPU Computing | Language ModelsCollaboration with industry experts | Internship community events | Remote workSenior-level InternshipRemote job R7d ago
-
Senior Applied Scientist USD 142K-270KAI Model Training | AI model | Fine Tuning | Generative AI | Inference OptimizationSenior-level Full TimeSan Jose, United States R7d ago
-
Data Curation | Deep learning | DeepSpeed | Direct Preference Optimization | EvaluationSenior-level Full TimeSingapore, Singapore8d ago
-
Data labeling | Direct Preference Optimization | Fine Tuning | Huggingface | Human FeedbackAI tools access | Co-working budget | Equipment provided | Fully remote | Health and wellness supportSenior-level Full TimeEurope R8d ago
-
Senior Data Scientist (LLM Post-Training) INR 2800K-4000KAI Agents | Data Augmentation | Data Generation | Data cleaning | Data labelingBirthday leave | Confidential Employee Assistance Program | FlexWork | Medical insurance | Parental leaveSenior-level Full TimeBangalore, India8d ago
-
Data Scientist (Generative AI) USD 125K-160KAPIs | AWS Bedrock | AWS Kendra | AWS SageMaker | Adversarial NetworksEntry-level Full TimeMcLean, VA, United States9d ago