Research Scientist - Data
Tasks
- Build data synthesis pipelines for code and mathematics
- Contribute to research papers
- Create automated data quality assessment frameworks
- Curate web scale training data for foundation models
- Design data recipes for diverse domains
- Develop data collection and curation methodologies for LLMs and multi modal models
- Evaluate data impact from pre training to model capabilities
- Optimize data model co design for training dynamics
- Represent organization at industry conferences
Perks/Benefits
- 401k plan
- Dental insurance
- Disability insurance
- Employee assistance program
- Life insurance
- Medical insurance
- Paid Holidays
- Paid parental leave
- Paid sick leave
- Paid time off
- Vision insurance
Skills/Tech-stack
Data Curation | Data Pipelines | Data Synthesis | Data collection | Dataset evaluation | Deep learning | Fine Tuning | LLM-as-a-Judge | Language Models | Language Processing | Large Language Models | Machine Learning | Model Evaluation | Multi-Modal | Multi-Modal Learning | Natural Language | Natural Language Processing | Prompt engineering | Python | Tokenization | Web data | Web data collection
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R3d ago
-
Data Scientist - Inference, Community Support USD 151K-175KBayesian Modeling | Causal Inference | Experimental Design | Feature Engineering | Machine LearningEmployee travel credits | Inclusion and Belonging Culture | Remote eligibleSenior-level Full TimeRemote - USA R22h ago
-
Data Scientist - Algorithms, Community Support USD 151K-175KCausal Inference | Language Models | Language Processing | Large Language Models | Machine LearningSenior-level Full TimeRemote - USA R22h ago
-
Data Scientist, RNA Biology USD 135K-180KAlphafold | Data Analysis | Data Curation | Machine Learning | Next-Generation SequencingHybrid work modelMid-level Full TimeSouth San Francisco, California, United States1d ago
-
Senior Applied Scientist - Search USD 200K-200KData Science | Fine Tuning | Information Retrieval | Knowledge graphs | Language Models401k retirement | Dental insurance | Equity package | Health insurance | Hybrid work scheduleSenior-level Full TimeNew York City R1d ago
-
AI Research Scientist USD 240K-350KBenchmarking | Convolutional Neural Networks | Diffusion Models | Distributed Training | Federated Learning401k match | Continuing education support | Equity options | Flexible time off | Free parkingMid-level Full TimeAustin, TX1d ago
-
Research Scientist - NLP USD 137K-258KAlgorithm Design | Data Processing | Deep learning | Language Modeling | Language Processing401k plan | Disability insurance | Employee assistance program | Life insurance | Medical/Dental/Vision insuranceMid-level Full TimeSunnyvale, CA1d ago
-
Research Scientist USD 150K-260KAPIs | Data Pipelines | Data Processing | Dataset curation | Debugging401k plan | Disability insurance | Employee assistance program | Holidays | Life insuranceMid-level Full TimeSunnyvale, CA1d ago
-
Forward Deployed AI Engineer, Operations USD 112K-300KAnalytics | C++ | Data Processing | Data Processing Pipelines | JavaDental insurance | Equity compensation | Medical insurance | Paid time off | Travel opportunitiesSenior-level Full TimeSouth San Francisco, California, USA1d ago
-
Associate Data Scientist USD 141K-202KAPIs | BigQuery | Cloud Run | Cloud Storage | Data GovernanceMid-level Full TimeUS - NJ - BIRLASOFT OFFICE, …1d ago
-
Computational Biologist I, Internal Medicine USD 42K-42KBioinformatics | Clinical Research | Containerization | Data Management | Data VisualizationOn-site childcare | Paid Parental Leave Benefit | Paid time off | Public Service Loan Forgiveness Qualified Employer | Tuition reimbursementEntry-level Full TimeTexas-Dallas-5323 Harry Hines Blvd2d ago
-
Data Scientist (Technical Leadership) USD 173K-242KAgent Orchestration | Bias Mitigation | Causal Inference | Data Analysis | Data MiningCareer growthSenior-level Full TimeSunnyvale, CA2d ago
-
AI Research Scientist, Text Data Research - MSL FAIR USD 147K-208KAgentic data | Apache Hive | Apache Spark | Data Curation | Data Scaling LawsEntry-level Full TimeMenlo Park, CA2d ago
-
Senior Business Data Scientist, AI/ML, Google Cloud USD 163K-237KAI Agents | Deep learning | Generative AI | Hugging Face | Language ModelsSenior-level Full TimeSunnyvale, CA, USA2d ago
-
Business Data Scientist, Customer Voice, Analytics USD 138K-198KClustering Algorithms | Confidence Intervals | Embedding Models | Fine Tuning | Language ModelsMid-level Full TimeMountain View, CA, USA; Chicago, IL, …2d ago
-
Staff Data Scientist, Search AI Overview USD 207K-301KData Analysis | GenAI | Machine Learning | Python | RSenior-level Full TimeMountain View, CA, USA2d ago
-
Senior-level Full TimeHouston, TX, US2d ago
-
Senior-level Full TimeNew York, NY2d ago
-
Senior Applied AI Scientist USD 182K-220KCausal Inference | Data Pipelines | Experiment design | LLM | Machine LearningSenior-level Full TimeNew York, NY2d ago
-
AI Scientist, Computational Protein Design USD 120K-240KArtificial Intelligence | Deep learning | Distributed Training | GPU Computing | Generative AIMid-level Full TimeSouth San Francisco, California, United States2d ago
-
Junior Data Scientist / Sr Analyst (0030) USD 80K-100KAI ethics | Data Analysis | Data Governance | Data Visualization | Databricks401k | Dental insurance | Life insurance | Long-term disability | Medical insuranceMid-level Full TimeMcLean, Virginia, United States2d ago
-
Senior-level Full TimeFort Meade, MD2d ago
-
Sr. Data Scientist, Performance Marketing USD 139K-287KCausal Inference | Dashboarding | ETL | Experimentation | ForecastingSenior-level Full TimeSan Francisco, CA, US; Remote, US R2d ago
-
Applied Scientist 3 USD 114K-234KAnomaly Detection | Architecture Search | Automatic Speech Recognition | C# | C++401k match | Adoption Assistance | Dental insurance | Health insurance | Life insuranceMid-level Full TimeAustin, TX, United States2d ago
-
Senior Data Scientist USD 170K-210KA/B | A/B Testing | Apache Spark | B testing | Causal Inference401k plan | Fitness and wellness programs | Flexible time off | Hybrid work | Paid parental leaveSenior-level Full TimeSan Mateo, CA, United States2d ago