Find jobs in AI/ML, Data Science and Big Data
112 results
for Supervised Fine Tuning
(Skill/Tech stack)
-
AI Research Scientist (Multimodal post-training) USD 166K-258KComputer Vision | Cross-modal fusion | Deep learning | Fine Tuning | Human FeedbackDiscretionary vacation | Equity shares | Flexible working hours | Health, dental, and vision insurance | Meal allowanceMid-level Full TimeEurope4d ago
-
Senior Software Engineer - Model Training & AI Evals INR 3500K-5000KAI Feedback | Ablation Studies | Benchmarking | CI/CD | Data GenerationSenior-level Full TimeRemote (India) R5d ago
-
Mid-level Full Time上海5d ago
-
Mid-level Internship上海、北京5d ago
-
Staff AI research scientist USD 234K-296KAdversarial Training | Agentic Systems | Benchmark design | Data Curation | Data GenerationCompany holidays | Company offsites | Dental insurance | Dependent FSA | Fertility supportSenior-level Full TimeSan Francisco, CA5d ago
-
AI Research Engineer (Multi-Modal & Vision) USD 100K-150KComputer Vision | Data Balancing | Data Filtering | Dataset curation | Distributed TrainingGlobal team collaboration | Professional publication support | Remote workSenior-level Full TimeRemote job R6d ago
-
AI Research Engineer (Multi-Modal & Vision) USD 100K-150KBenchmarking | Data Balancing | Data Filtering | Dataset curation | Distributed GPURemote workSenior-level Full TimeRemote job R6d ago
-
AI Research Engineer (Multi-Modal & Vision) USD 100K-150KData Curation | Dataset Filtering | Distributed Training | Efficient Fine Tuning | Evaluation benchmarksRemote workSenior-level Full TimeRemote job R6d ago
-
Computer Vision | Content Moderation | Data Curation | Deep learning | Few-Shot LearningMid-level Full TimeSingapore, Singapore6d ago
-
Staff Machine Learning Engineer INR 2817K-4401KA/B | A/B Testing | AWS | Agent Orchestration | Agentic AIFinancial security support | Healthcare coverage | Mental health resources | Paid time offSenior-level Full TimeIND - Karnataka - Bangalore - …7d ago
-
Senior Machine Learning Engineer USD 188K-282KAdversarial Training | Calibration monitoring | Continuous batching | DPO | Deep learningSenior-level Full TimePalo Alto, CA7d ago
-
Senior-level Full TimeTaipei, Taiwan8d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapters | DPO | Dataset curation | Efficient Attention | Evaluation benchmarksBenefits package | H1B transfer support | Remote workMid-level Full TimeUnited States - Remote R8d ago
-
Senior Applied Scientist, AGI Customization USD 167K-260KAmazon SageMaker | C++ | Deep learning | Distributed Systems | Experiment designHealth insurance | Paid time off | Parental leaveSenior-level Full TimeSunnyvale, California, USA8d ago
-
Cloud AI Engineer INR 2500K-4000KAWS | Agentic Systems | Azure | Data Engineering | ETLElder care | Health checks | Insurance with top-ups | New parent support | Partner coverageMid-level Full TimeMumbai (ex Bombay), IN8d ago
-
Machine Learning Engineer USD 170K-315KData Preprocessing | Deep learning | Evaluation benchmarks | Fine Tuning | GPU ProfilingHealth benefits | Hybrid work model | Retirement benefits | Vacation timeMid-level Full TimeUSA - CA - Santa Clara, …8d ago
-
Data Analysis | Deep learning | Direct Preference Optimization | Fine Tuning | Language ModelsSenior-level Full TimeSunnyvale, CA, USA9d ago
-
Senior AI Engineer, AI Lab GBP 90K-131KBLEU | Bark | DVC | ElevenLabs | Fine TuningAnnual leave | Employee assistance program | Free Economist content online subscription | Moving home allowance | Parental leaveSenior-level Full TimeLondon - Commercial R10d ago
-
Research Scientist, LLM Evaluation & Post-Training USD 150K-300KAI Feedback | Alignment | Benchmarking | Context evaluation | Deep learningMid-level Full TimeRemote Work( USA), United States R11d ago
-
Mid-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL15d ago
-
大语言模型后训练算法工程师 CNY 240K-480KDistributed Training | Docker | Fine Tuning | Human Feedback | KubernetesMid-level Full Time深圳、上海17d ago
-
Senior AI Engineer - LLMs and Finetuning GBP 84K-109KAWS | Benchmarking | Data Generation | Distillation | Distributed TrainingAI experimentation budget | Enhanced parental leave | Flexible working arrangements | Group life assurance | Hybrid working modelSenior-level Full TimeLondon17d ago
-
Research Engineer - LLM Training & Alignment Systems CAD 127K-225KAutomation | Benchmarking | C# | C++ | Data CurationMid-level Contract Full TimeKingston, Ontario, Canada19d ago
-
Senior Machine Learning Engineer , AI Platform USD 150K-210KArtificial Intelligence | Batch Processing | Data Analysis | Data Pipelines | Data PrivacySenior-level Full TimeBoston, MA20d ago
-
Student Researcher (LLM Post Training – Agent & Reinforcement Learning) - 2026 Start (PhD) USD 202K-368KCoding | Data Construction | Fine Tuning | Instruction Tuning | Language ModelsInternshipEntry-level Full TimeSan Jose, California, United States25d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Andorra) USD 150K-225KDirect Preference Optimization | Fine Tuning | Huggingface | Human Feedback | Information RetrievalCo-working space budget | Equipment provided | Fully remote | Health insurance support | Learning budgetSenior-level Full TimeAndorra R27d ago
-
Classification | Content Moderation | DPO | Dataset Preparation | Fine TuningCo-working space budget | Fully remote | Health insurance support | Learning budget | Paid time offSenior-level Full TimeUkraine R27d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Turkey) TRY 840K-1080KClassifiers | Data labeling | Direct Preference Optimization | Evaluation | Fine TuningAccess to AI tools | Annual in-person meetup | Co-working space budget | Equipment budget | Fully remoteSenior-level Full TimeTurkey R27d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Sweden) SEK 738K-930KClassification | DPO | Data labeling | Dataset cleaning | Evaluation FrameworksAI tools access | Annual in-person meetup | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeSweden R27d ago
-
Classifiers | DPO | Data labeling | Evaluation | Fine TuningAnnual in-person meetup | Co-working space budget | Equipment provided | Full remote | Health and wellness supportSenior-level Full TimeLatvia R27d ago
-
Classifier Training | Content Moderation | DPO | Data cleaning | Data labelingAI tools access | Co-working space budget | Equipment provided | Fully remote | Health insurance allowanceSenior-level Full TimeSlovakia R27d ago
-
Classification | Context window | Context window management | DPO | Data cleaningAccess to mental health counseling | Co-working space budget | Company equipment provision | Fully remote | Health insurance allowanceSenior-level Full TimeGreece R27d ago
-
Classification | Context window | Context window management | Data labeling | Direct Preference OptimizationCo-working space budget | Fully remote | Health insurance support | Learning budget | Paid time offSenior-level Full TimeIreland R27d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Romania) RON 245K-348KDPO | Fine Tuning | Huggingface | Human Feedback | Inference OptimizationAI tools access | Annual in-person meetup | Co-working space budget | Company equipment provided | Fully remoteSenior-level Full TimeRomania R27d ago
-
Classifier Training | DPO | Fine Tuning | Huggingface | Human FeedbackAI tools access | Annual in-person meetup | Co-working space budget | Company laptop | Fully remoteSenior-level Full TimeSwitzerland R27d ago
-
Classifier Training | Context window | Context window optimization | DPO | Data cleaningAI tools access | Annual in-person meetup | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeLithuania R27d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Serbia) USD 150K-225KClassification | Context window | Context window optimization | DPO | Data cleaningAnnual in-person meetup | Co-working space budget | Company laptop | Fully remote | Health and wellness supportSenior-level Full TimeSerbia R27d ago
-
DPO | Fine Tuning | Huggingface | LLM | Machine LearningCo-working space budget | Equipment provided | Fully remote | Health insurance allowance | Learning budgetSenior-level Full TimeLuxembourg R27d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Malta) EUR 79K-100KDPO | Data labeling | Fine Tuning | Huggingface | Human FeedbackAI tools access | Co-working space budget | Equipment provided | Fully remote | Health and wellness supportSenior-level Full TimeMalta R27d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Moldova) USD 150K-225KDPO | Data labeling | Fine Tuning | Huggingface | Inference OptimizationAI tools access | Co-working space budget | Equipment provided | Fully remote | Health and wellness supportSenior-level Full TimeMoldova R27d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Italy) EUR 79K-100KClassification Models | Context Management | DPO | Data cleaning | Data labelingAI tools access | Annual in-person gathering | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeItaly R27d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Hungary) HUF 11000K-18960KClassifiers | Context window | Context window optimization | Data cleaning | Data labelingAI tools access | Co-working space budget | Company laptop | Fully remote | Health insurance allowanceSenior-level Full TimeHungary R27d ago
-
DPO | Data labeling | Fine Tuning | Huggingface | Human FeedbackAI tools access | Co-working space budget | Equipment provided | Fully remote | Health and wellness supportSenior-level Full TimeSlovenia R27d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Norway) NOK 1100K-1250KClassification | Data labeling | Dataset cleaning | Direct Preference Optimization | Fine TuningAI tools access | Annual in-person meetup | Co-working space budget | Company equipment provided | Fully remoteSenior-level Full TimeNorway R27d ago
-
Classification Algorithms | Context window | Context window optimization | DPO | Data labelingAI tools access | Co-working space budget | Company equipment | Fully remote | Health insurance supportSenior-level Full TimeMontenegro R27d ago
-
Tech Lead, LLM & Generative AI (Full Remote - Poland) PLN 324K-450KClassification | Context window | Context window optimization | Data cleaning | Data labelingAI tools access | Co-working space budget | Equipment budget | Fully remote | Health and wellness supportSenior-level Full TimePoland R27d ago
-
DPO | Data labeling | Fine Tuning | Huggingface | LLMCo-working space budget | Company laptop and equipment budget | Fully remote | Health insurance allowance | Learning budgetSenior-level Full TimeBelgium R27d ago
-
Classifier Training | Context window | Context window optimization | Data cleaning | Data labelingAccess to AI tools | Co-working space budget | Equipment provided | Fully remote | Health insurance allowanceSenior-level Full TimeGermany R27d ago
-
Context window | Context window optimization | Data labeling | Dataset cleaning | Direct Preference Optimization1 1 psychologist sessions | AI tools access | Annual in-person meetup | Co-working space budget | Company-provided equipmentSenior-level Full TimeCroatia R27d ago
-
Classifier Training | DPO | Data cleaning | Data labeling | Fine TuningAI tools access | Annual in-person meetup | Co-working space budget | Equipment provided | Fully remoteSenior-level Full TimeAustria R27d ago