Member of Technical Staff, Pre-training Data
Tasks
- Build large-scale web crawling pipelines
- Collaborate on corpus strategy
- Design filtering and deduplication systems
- Improve data pipeline observability and reliability
- Manage data quality and versioning
- Optimize distributed data processing
- Run data ablation experiments
Perks/Benefits
- 401k match
- Health, dental, vision insurance
- Relocation stipend
- Unlimited paid time off
- Visa sponsorship
Skills/Tech-stack
Data Deduplication | Data Filtering | Data Processing | Data Systems | Data pipeline | Data pipeline optimization | Distributed data | Distributed data systems | Experiment design | Pipeline Optimization | Scalability | Software Engineering | System Reliability
Education
Regions
Countries
States
Related jobs
-
Ad Ranking | Algorithms | C++ | Data Processing | Data StructuresSenior-level Full TimeMountain View, CA, USA16h ago
-
Senior Software Engineer, AI/ML GenAI, Core USD 174K-252KAlgorithms | C++ | Computer Vision | Data Processing | Data StructuresSenior-level Full TimeKirkland, WA, USA; Sunnyvale, CA, USA16h ago
-
Software Engineer III, AI/ML, Core USD 147K-211KAlgorithms | Data Processing | Data Storage | Data Structures | DebuggingSenior-level Full TimeSunnyvale, CA, USA16h ago
-
Software Engineer III, AI/ML, Google Workspace USD 147K-211KC++ | Data Processing | Debugging | Language Processing | ML InfrastructureSenior-level Full TimeBoulder, CO, USA16h ago
-
Principal Software Engineer - Robotics & Drones USD 170K-200KAPIs | Accelerators | CPU | Camera Signal Processing | Cloud DataSenior-level Full TimeBoston, MA - USA, United States1d ago
-
Distinguished Software Engineer, Data Infrastructure USD 248K-406KAI | Batch Processing | Data Infrastructure | Data Privacy | Data ProcessingExecutive-level Full TimeMountain View, CA, United States1d ago
-
Sr. Staff Software Engineer, Data Product Platform USD 208K-429KAI coding | AI coding tools | AI-assisted analytics | Apache Airflow | Apache SupersetEquity | Flexible work modelSenior-level Full TimeSan Francisco, CA, US; Remote, US R1d ago
-
Software Engineer III, AI/ML, Google Cloud AI USD 147K-211KC++ | Data Processing | Debugging | GenAI | Google CloudSenior-level Full TimeSunnyvale, CA, USA1d ago
-
Staff Software Engineer, On-Device Machine Learning USD 207K-300KAndroid | Data Processing | Data Structures | Data Structures and Algorithms | DebuggingSenior-level Full TimeSunnyvale, CA, USA1d ago
-
Software Engineer, Next Generation AI/ML Infrastructure USD 147K-211KC++ | Data Processing | Data Storage | Distributed Processing | Feature StoresMid-level Full TimeSunnyvale, CA, USA1d ago
-
Senior Software Engineer, Connect Sales, CRM, GenAI, Ads USD 174K-252KComputer Vision | Data Processing | Debugging | Distributed Computing | Generative AISenior-level Full TimeMountain View, CA, USA1d ago
-
Analytics | Concurrency | Containerization | Core Java | Data pipeline401k plan | Commuter benefits | Disability benefits | Life insurance | Paid time offSenior-level Full Time112265-NJ-MetroPark, Iselin, United States2d ago
-
Benchmarking | C plus plus | CUDA | Code optimization | Data ProcessingSenior-level Full TimeUS, CA, Santa Clara, United States2d ago
-
Information Technologist I USD 119K-175KBusiness Process | Business process automation | Cloud Data | Cloud Data Preparation | Data PreparationFlexible work environment | Remote-friendly work environmentSenior-level Full TimeMichigan, East Lansing2d ago
-
Senior Software Engineer, AI/ML GenAI USD 174K-252KC++ | Capacity Management | Cloud platform | Computer Vision | Data ProcessingSenior-level Full TimeSunnyvale, CA, USA2d ago
-
C++ | Data Processing | Debugging | Generative AI | Language ModelsSenior-level Full TimeMountain View, CA, USA2d ago
-
C++ | Data Processing | Data Structures | Data Structures and Algorithms | DebuggingSenior-level Full TimeMountain View, CA, USA2d ago
-
Staff Software Engineer, Generative AI, Core ML USD 207K-300KAI Feedback | Computer Vision | Data Processing | Deep learning | Digital TwinSenior-level Full TimeMountain View, CA, USA2d ago
-
Software Engineer III, AI/ML GenAI, YouTube USD 147K-211KC++ | Computer Vision | Data Processing | Debugging | Distributed ComputingSenior-level Full TimeMountain View, CA, USA2d ago
-
Senior Software Engineer, Generative AI, Search Health USD 174K-252KA/B | A/B Testing | B testing | Data Analysis | Data MiningSenior-level Full TimeMountain View, CA, USA2d ago
-
Software Engineer III, AI/ML, Google Ads USD 147K-211KAlgorithms | C++ | Data Processing | Data Structures | DebuggingSenior-level Full TimeMountain View, CA, USA; Los Angeles, …2d ago
-
Staff Software Engineer, AI/ML GenAI, Google Cloud USD 207K-300KComputer Vision | Data Processing | Debugging | Distributed Computing | Fine TuningSenior-level Full TimeSunnyvale, CA, USA; San Francisco, CA, …2d ago
-
Cloud Computing | Cloud TPU | Cloud platform | Data Processing | DebuggingSenior-level Full TimeKirkland, WA, USA; Seattle, WA, USA2d ago
-
Lead Software Engineer – Development for AI Applications USD 128K-215KAWS | Auditing | Azure | Azure Cosmos | Azure Cosmos DB401k plan | Adoption reimbursement | Disability benefits | Employee assistance programs | Employee discountsSenior-level Full TimeUSA:TX:Dallas / Two AT&T Plaza (211 …3d ago
-
Software Engineer, Data Infrastructure USD 153K-376KAI systems | Access Control | Apache Airflow | Apache Flink | Apache KafkaCell phone reimbursement | Company recharge days | Generous PTO | Learning and development stipend | Mental health and wellness benefitsMid-level Full TimeSan Francisco, CA • New York, … R3d ago