Applied AI Researcher, Post-Training
Tasks
- Adapt foundation models for real world performance
- Align large models with human and system objectives
- Build continual adaptation pipelines
- Conduct model evaluation experiments
- Curate training data and run reward modeling
- Develop preference optimization techniques
- Develop supervised fine tuning techniques
- Prototype intelligent model based systems
- Tune and adapt LLMs SLMs to specialized domains
Perks/Benefits
Skills/Tech-stack
Agentic collaboration | Continual Learning | Continual pretraining | DPO | Data Analysis | Data Curation | Ensembling | Fine Tuning | Graph-of-Thoughts | Instruction Tuning | LLM | LoRA | Model Evaluation | PEFT | Preference optimization | RLAIF | RLHF | React | Reward Modeling | SLM | Supervised Fine Tuning
Education
N/A
Roles
Regions
Countries
States
Related jobs
-
Research Scientist, Frontier Health, DeepMind USD 174K-252KClinical Reasoning | Evaluation | Experimentation | GRPO | Human evaluationMid-level Full TimeMountain View, CA, USA3d ago
-
Senior UX Quantitative Researcher, Human Factors USD 159K-231KC++ | Data Analysis | Data Manipulation | Econometrics | GoInternal tools | Mentorship | Regular meetupsSenior-level Full TimeSan Jose, CA, USA; Miami, FL, …4d ago
-
Student Researcher (LLM Post Training – Agent & Reinforcement Learning) - 2026 Start (PhD) USD 202K-368KCoding | Data Construction | Fine Tuning | Instruction Tuning | Language ModelsInternshipEntry-level Full TimeSan Jose, California, United States5d ago
-
Staff AI Researcher USD 148K-210KData Preprocessing | Deep learning | Distributed Systems | Feature Engineering | Fine Tuning401k match | Dental insurance | Educational reimbursement | Flexible work schedule | Health insuranceSenior-level Full TimeRemote, United States R7d ago
-
Algorithmic trading | Automated Execution | Data Analysis | Econometrics | Execution Strategy OptimizationAnnual discretionary bonus | Flexible time off | Healthcare benefits | Hybrid work model | Retirement benefitsSenior-level Full TimeNY7 - 50 Hudson Yards, New … R8d ago
-
AI Governance | Adjudication | Artificial Intelligence | Calibration | Data labelingSenior-level Full TimeSan Francisco, California, United States8d ago
-
Researcher, Context - Agent Post-Training USD 250K-380KData Pipelines | Deep learning | Experimentation | Grading systems | Language ModelsMid-level Full TimeSan Francisco11d ago
-
Senior AI Researcher (Foundation AI) USD 190K-230KCI/CD | Cloud Computing | Context Parallelism | DPO | Data parallelismSenior-level Full TimeBoston, MA11d ago
-
Equity Index Quantitative Researcher- USD 155K-285KData Analysis | Equity Index | Equity Index Research | Factor investing | Index research401k match | Dental insurance | Life insurance | Long-term disability | Medical insuranceSenior-level Full TimeNew York11d ago
-
Applied Researcher, Vision Language Models/VLM - TikTok USD 145K-355KData Processing | Deep learning | Human Feedback | Image Captioning | Language ModelsMid-level Full TimeSan Jose, California, United States11d ago
-
Data Analysis | MAXQDA | Machine Learning | Project coordination | PythonCareer growth based on performance | Direct customer contact | High responsibility | MacBook | Modern toolsEntry-level Full TimeHomeoffice R11d ago
-
UX Researcher, Quantitative Analysis USD 90K-130KA/B | A/B Testing | AI Tooling | B testing | Behavioral DataDisability benefits | Equity awards | Health insurance | Life insurance | Paid time offEntry-level Full TimeSan Jose, California11d ago
-
Agent systems | Air gapped deployment | Air-gapped | Artificial Intelligence | Cost Optimization401k | Dental insurance | Equity incentives | FSA | Health insuranceSenior-level Full TimeSeattle, WA or McLean, VA or … R12d ago
-
Agent systems | Agentic Systems | Air gapped deployments | Air-gapped | Artificial Intelligence401k | Career advancement | Employer paid health care | Equity incentives | FSASenior-level Full TimeSeattle, WA or McLean, VA or … R12d ago
-
Principal AI Researcher USD 139K-304KAWS Bedrock | AWS SageMaker | Agent systems | Artificial Intelligence | Autogen401k match | Dental insurance | Medical insurance | Time off | Training and developmentSenior-level Full TimeRemote, Europe; Remote, UK; Remote, US R12d ago
-
Data Analysis | Data Retrieval | Data entry | Database search | ExcelMid-level Full TimeRockville, MD, USA13d ago
-
Data Curation | Data Generation | Deep learning | Distributed Training | Fine TuningInternship benefitsEntry-level Full Time InternshipUS, CA, Santa Clara, United States13d ago
-
Senior Staff Research Scientist, Agentic AI & RL USD 150K-200KDocker | Fine Tuning | LLM Fine-tuning | Language Models | Language ProcessingHigh autonomy | MentorshipSenior-level Full TimeRemote Work( USA), United States R14d ago
-
Postdoctoral Fellow In Statistical Genetics USD 60K-75KAdmixed Population Studies | Association analysis | Bioinformatics | Computational Methods | Data AnalysisMentorship opportunities | Student supervisionNone Full TimeCenter for Life Sciences, Boston, United …15d ago
-
Applied AI Researcher, Multi-Agent Systems USD 150K-250KAgent Orchestration | Agent systems | Communication Protocols | Data Analysis | Graph Neural Networks401k | Commuter benefits | Dental insurance | Health insurance | Hybrid workExecutive-level Full TimeSan Francisco15d ago
-
UX Quantitative Researcher USD 157K-235KBehavioral analytics | Data Analysis | Data Visualization | Data dashboards | Descriptive Statistics401k | Disability insurance | Life insurance | Medical/Dental/Vision insurance | Paid HolidaysSenior-level Full Time388 GREENWICH STREET - TOWER, United …16d ago
-
Principal AI Researcher USD 190K-342KAgent systems | Context | DPO | Data Privacy | Enterprise DataFlexible work schedule | In person work at least 50 percent of time per quarterSenior-level Full TimeUSA, CA, Pleasanton, United States18d ago
-
AI Researcher, LLMs USD 200K-300KDataset curation | Distributed Training | Distributed inference | Fine Tuning | GPU ComputingEntry-level Full TimeLondon, United Kingdom; New York, NY, …18d ago
-
AI Researcher Intern USD 60KAsynchronous programming | Autogen | Concurrency Control | Containerization | CrewAIBilingual Mandarin Chinese and English | Remote workEntry-level InternshipUnited States18d ago
-
Experimental Dynamic Materials - Postdoctoral Researcher USD 123K-123KCondensed Matter Physics | Condensed matter | Data Analysis | EXAFS | High pressure physics401 K | Education reimbursement program | Flexible work schedule | Hybrid schedule | Relocation assistanceEntry-level Full TimeLivermore, CA, United States R19d ago