AI Research Scientist, Text Data Research - MSL FAIR
Bellevue, WA | Menlo Park, CA | Seattle, WA
USD 147K-208K (estimate) Entry-level Full Time
Tasks
- Advance data research for synthetic data
- Apply expertise in agentic data and synthetic data
- Architect scalable data curation systems and pipelines
- Collaborate with cross functional teams to develop foundational models
- Execute pre training mid training post training data curation projects
- Improve data velocity using data tooling
- Lead complex technical projects end to end
Perks/Benefits
- N/A
Skills/Tech-stack
Agentic data | Apache Hive | Apache Spark | Data Curation | Data Generation | Data Optimization | Data Scaling Laws | Data scaling | Language Models | Language Processing | Large Language Models | Machine Learning | Natural Language | Natural Language Processing | PyTorch | Reasoning data | SQL | Scaling Laws | Synthetic Data Generation | Synthetic data
Education
Roles
Regions
Countries
States
Related jobs
-
AI strategy | Client Relationship Management | Client relationship | Data Engineering | Generative AIGlobal flexibility | Travel requiredExecutive-level Full TimeRedmond, United States2h ago
-
Associate Data Scientist USD 92K-159KAlteryx | Business Intelligence | Data Visualization | ETL | Machine LearningMid-level Full TimeUnited States-Ohio-Cleveland3h ago
-
Data Scientist USD 84K-114KExcel | Machine Learning | Optimization | PowerPoint | Predictive ModelingEntry-level Full TimeArlington/Rosslyn, Virginia, United States3h ago
-
Research Scientist in Large Language Model (LLM) - Seed - Graduates - 2027 Start (PhD) USD 244K-450KData Construction | Instruction Tuning | Language Models | Large Language Models | Model OptimizationEntry-level Full TimeSan Jose, California, United States3h ago
-
Research Scientist in Large Language Model (LLM) - Seed - Graduates - 2027 Start (BS/MS) USD 212K-387KCode generation | Data Construction | Instruction Tuning | Language Models | Large Language ModelsEntry-level Full TimeSan Jose, California, United States3h ago
-
Student Researcher (LLM - Seed) - 2027 Start (PhD) USD 200K-300KAlgorithm Design | Language Models | Language Processing | Large Language Models | Machine LearningMid-level Full TimeSan Jose, California, United States3h ago
-
Student Researcher (LLM - Seed) - 2027 Start (BS/MS) USD 187K-300KAlgorithms | Deep learning | Language Models | Language Processing | Large Language ModelsMid-level Full TimeSan Jose, California, United States3h ago
-
TikTok Shop - Data Scientist - User Product USD 167K-312KA/B | A/B Testing | B testing | Dashboarding | Data analyticsEntry-level Full TimeSeattle, Washington, United States3h ago
-
Research Scientist (Seed-LLM) USD 244K-450KData Construction | Deep learning | Inference Optimization | Instruction Tuning | Language ModelsMid-level Full TimeSan Jose, California, United States3h ago
-
3D Pose Estimation | 3D Reconstruction | C++ | Computer Vision | Data AnalysisOpen source collaboration | Publication opportunities | Research mentorship | Work authorization supportEntry-level InternshipBellevue, WA | Menlo Park, CA …4h ago
-
Software Engineer - Language (Technical Leadership) USD 213K-293KASR | Automatic Speech Recognition | C++ | Conversational AI | Deep learningSenior-level Full TimeMenlo Park, CA | Seattle, WA …4h ago
-
AI Research Scientist, Media Data Research - MSL FAIR USD 147K-208KComputer Vision | Data Curation | Data Generation | Distributed Computing | HiveEntry-level Full TimeMenlo Park, CA4h ago
-
Computer Science Research - US - IC5 USD 166K-244KData Pipelines | Deep learning | Entity recognition | Generative Models | Image to Video GenerationEntry-level Full TimeBellevue, WA | Menlo Park, CA4h ago
-
DPO | Data Curation | Data Generation | Data Quality | Data pipelineSenior-level Full TimeMenlo Park, CA4h ago
-
Solutions Architect, Business AI USD 150K-180KAPI Development | Agent Orchestration | Agentic AI | Agile methodologies | Bias MitigationTravel opportunitiesSenior-level Full TimeMenlo Park, CA | Seattle, WA …4h ago
-
Entry-level Full TimeNew York, NY4h ago
-
Mid-level Full TimeSeattle, WA4h ago
-
Senior-level Full TimeMenlo Park, CA4h ago
-
Data Scientist USD 193K-196KClustering | Data Preprocessing | Data Visualization | Data cleaning | Data pipelineMid-level Full TimeMenlo Park, CA4h ago
-
Sr. Data Scientist USD 209K-235KCausal Inference | Cause analysis | Dashboards | Data Visualization | ETLInternational travelSenior-level Full TimeMenlo Park, CA4h ago
-
Data Scientist, Small Business Group USD 190K-235KClustering | Descriptive Statistics | Inferential Statistics | MATLAB | Machine LearningSenior-level Full TimeMenlo Park, CA4h ago
-
Staff Data Scientist, Product, Google Shopping USD 192K-278KAttribution | Data Analysis | Log Analysis | Machine Learning | MetricsSenior-level Full TimeMountain View, CA, USA4h ago
-
AI accelerators | As-a-Service | Bash | Cloud Performance | Cloud Performance ProfilingSenior-level Full TimeNew York, NY, USA; Atlanta, GA, …4h ago
-
Automated Machine Learning | Bayesian statistics | Machine Learning | Model Maintenance | Model Validation401k | Dental insurance | Health savings account | Medical insurance | Paid time offSenior-level Full TimeNew York, NY, US, NY 10019 R8h ago
-
Senior Scientist, Multi-Omics Analytics USD 120K-181KATAC-seq | Bioinformatics | Cloud Computing | Cross Study Integration | Data NormalizationHealth insurance | Paid time off | Retirement contributionsSenior-level Full TimeTemecula, California, US, 925909h ago