Research Engineer, Post-Training - Meta Superintelligence Labs
Tasks
- Build data processing pipelines
- Design data collection pipelines
- Develop agentic environment systems
- Ensure dataset safety
- Filter synthetic data at scale
- Generate synthetic training data
- Ingest and prepare high quality datasets
- Measure dataset quality
- Perform data deduplication
- Run contamination checking
- Scale post training data workflows
- Securely source datasets from vendors
Perks/Benefits
- N/A
Skills/Tech-stack
Code review | Contamination Checking | Data Generation | Data Ingestion | Data Pipelines | Data Processing | Data Quality | Data Quality Filtering | Deduplication | Distributed Systems | Language Models | Language Processing | Large Scale Data | Large-scale | Large-scale Data Processing | Machine Learning | Natural Language | Natural Language Processing | PyTorch | Python | RLHF | SFT | Synthetic Data Generation | Synthetic data | Testing | Version control
Education
N/A
Roles
Regions
Countries
States
Cities
Related jobs
-
Software Engineer/Researcher, AI-Native Database Systems USD 156K-387KDatabase Storage | Database Storage Engine | Distributed Systems | Embedding Ingestion | IndexingSenior-level Full TimeSan Jose, California, United States2h ago
-
Partner Engineering GenAI - US USD 140K-203KAPIs | Agent Orchestration | Bias Mitigation | C++ | Cloud ComputingSenior-level Full TimeMenlo Park, CA | Seattle, WA …2h ago
-
Software Engineer - Language (Technical Leadership) USD 213K-293KASR | Automatic Speech Recognition | C++ | Conversational AI | Deep learningSenior-level Full TimeMenlo Park, CA | Seattle, WA …2h ago
-
Machine Learning Hardware Research Scientist USD 91K-145KASIC design | Architecture Search | Bias Mitigation | C# | C++Entry-level Full TimeSunnyvale, CA2h ago
-
Code review | Data Deduplication | Data Quality | Data Quality Filtering | Data pipelineEntry-level Full TimeMenlo Park, CA2h ago
-
Data Engineer, BPE USD 195K-235KData Modeling | Data Visualization | Data Warehousing | Data integration | Dimensional ModelingMid-level Full TimeMenlo Park, CA2h ago
-
Data Engineer USD 199K-235KCompliance | Data Modeling | Data Quality | Data Visualization | Data WarehousingSenior-level Full TimeMenlo Park, CA2h ago
-
Data Engineer, Product Analytics USD 192K-235KBig Data | Data Store | Data Warehouse | Dimensional Modeling | ETLEntry-level Full TimeNew York, NY2h ago
-
Data Engineer, Analytics USD 205K-235KBig Data | Data Governance | Data Modeling | Data Quality | Data SecurityEntry-level Full TimeSunnyvale, CA2h ago
-
Software Engineer III, AI/ML, Search Ads USD 147K-211KC++ | Data Storage | Data Structures | Data Structures and Algorithms | Deep learningSenior-level Full TimeMountain View, CA, USA; New York, …3h ago
-
Site Reliability Engineer, ML Compute SRE USD 147K-211KAlgorithms | Automation | Cause analysis | Data Structures | DebuggingMid-level Full TimeRaleigh, NC, USA; Durham, NC, USA3h ago
-
Software Engineer III, Flume ML USD 147K-211KBackend Development | C++ | Colossus | Data Processing | Data StorageSenior-level Full TimeSunnyvale, CA, USA3h ago
-
Algorithms | Artificial Intelligence | Code review | Data Storage | Data StructuresSenior-level Full TimeMountain View, CA, USA3h ago
-
Software Engineer III, AI/ML, Search USD 147K-211KAI content | AI content safety | Application Security | C++ | Content SafetySenior-level Full TimeMountain View, CA, USA; Chicago, IL, …3h ago
-
Research Software Engineer, Gen AI/LLM Development USD 174K-252K3D Modeling | Agent reasoning | Data Structures | Data Structures and Algorithms | Diffusion ModelsMid-level Full TimeMountain View, CA, USA3h ago
-
Staff Site Reliability Engineer, DeepMind SRE USD 207K-300KAutomation | C# | C++ | Capacity Planning | Distributed SystemsSenior-level Full TimePittsburgh, PA, USA3h ago
-
Algorithms | C++ | Data Structures | Distributed Systems | GoSenior-level Full TimeMountain View, CA, USA3h ago
-
Senior Software Engineer, AI/ML, Search USD 174K-252KAI content | AI content safety | Accessibility | Algorithms | Artificial IntelligenceSenior-level Full TimeMountain View, CA, USA; Cambridge, MA, …3h ago
-
Autocuration | Data Processing | Data Storage | Debugging | Distributed ComputingSenior-level Full TimeMountain View, CA, USA3h ago
-
RTL Design Engineer, Machine Learning USD 163K-237KASIC | Code review | High Performance | High-Performance Design | Low powerSenior-level Full TimeSunnyvale, CA, USA3h ago
-
C++ | CSS | Dashboards | Data Infrastructure | Data WarehousingMid-level Full TimeBoulder, CO, USA; Atlanta, GA, USA3h ago
-
Cloudflare | Docker | Event Processing | Go | JavaScriptHigh ownership culture | Remote work flexibility | Startup environmentSenior-level Full TimeRemote, US R7h ago
-
Staff Machine Learning Engineer USD 154K-231KData Pipelines | Feature Engineering | Machine Learning | Model Evaluation | NumPyOccasional travelSenior-level Full Time#, CA, US, #8h ago
-
Sr. Solutions Engineer USD 152K-209KAccount Management | Amazon Web Services | Artificial Intelligence | Azure | Big DataSenior-level Full TimeBoston, Massachusetts9h ago
-
Senior Data Warehouse Engineer USD 121K-170KData Ingestion | Data Modeling | Data Quality | Data Warehousing | Data orchestration401k | Dental insurance | Disability insurance | Employee assistance program | Employee recognitionSenior-level Full TimeDraper, Utah, United States; San Jose, …9h ago