Data Engineer (ML Platform & Data Foundations)
San Francisco, CA, United States
USD 156K-220K (estimate) Mid-level Full Time
Tasks
- Build data pipelines for image and video datasets
- Contribute to data and ML platform architecture decisions
- Debug and fix data pipeline and dataset issues
- Define and maintain schemas metadata and data documentation
- Enable workflow handoff from experimentation to production
- Establish best practices for data modeling and pipeline design
- Extract and organize media files and metadata
- Implement data quality checks validation and monitoring
- Ingest process validate and store visual data
- Maintain data versioning lineage and reproducibility
- Optimize data systems for performance and reliability
- Partner with ML scientists on data requirements
- Prepare datasets for model training and testing
Perks/Benefits
- N/A
Skills/Tech-stack
AWS | Data Governance | Data Lineage | Data Modeling | Data Pipelines | Data Quality | Data Versioning | Data quality monitoring | Docker | GCP | Image Processing | Linux | Python | Quality monitoring | Video Processing
Education
Roles
Regions
Countries
States
Related jobs
-
C++ | Cloud Storage | Data Analysis | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeSunnyvale, CA, USA; Seattle, WA, USA1h ago
-
Senior Software Engineer, Data USD 126K-189KAPI Development | AWS | Airflow | Docker | ETL401k plan | Commuting Stipend | Dental insurance | Dependent Care Flexible Spending Account | Employee assistance programSenior-level Full TimeSeattle, WA8h ago
-
Senior-level Full TimeNew York11h ago
-
Staff software engineer, data products USD 202K-255KBackend Development | Data Integrity | Data Modeling | Data Pipelines | Data TransformationIn office 4 days per week | Remote-friendlySenior-level Full TimeSan Francisco12h ago
-
Agentic AI Developer USD 140K-200KAgent Framework | Asynchronous programming | Autonomous Agents | Azure Cognitive | Azure Cognitive SearchMid-level Full TimeSan Francisco, CA, United States12h ago
-
Data Mining | Data labeling | Dataset creation | Deep learning | Knowledge DistillationSenior-level Full TimeFoster City, CA12h ago
-
Senior Data Engineer USD 82K-193KAgile | BigQuery | CI/CD | Data Engineering | Data PipelinesCompany-Paid Holidays | Dental insurance | Disability insurance | Employee assistance program | Life insuranceSenior-level Full TimeBridgewater, NJ, US12h ago
-
Senior-level Full TimeNashville, Tennessee, United States12h ago
-
Senior Data Engineer USD 155K-190KAmazon Redshift | Apache Airflow | Dagster | Data Compliance | Data GovernanceSenior-level Full TimeNortheast13h ago
-
Senior Data Engineer USD 155K-190KAmazon Redshift | Apache Airflow | Dagster | Data Governance | Data QualitySenior-level Full TimeSan Francisco13h ago
-
Machine Learning Engineer - ETA Team USD 137K-299KAirflow | Apache Spark | Deep learning | Experimentation | Feature Engineering401k plan employer matching | Basic life insurance | Commuter benefits match | Dental insurance | Disability insuranceMid-level Full TimeSunnyvale, CA; San Francisco, CA; Seattle, …14h ago
-
Analytics Engineer, Sentry USD 146K-194KDashboarding | Data Engineering | Data Modeling | Data Pipelines | Data QualityCompetitive benefits | Health insurance | Paid time offMid-level Full TimeIrvine, California, United States14h ago
-
Manufacturing Engineer, Analytics USD 129K-171KLean Manufacturing | Microsoft Excel | PLSQL | PowerBI | PythonHealth benefits | Recovery BenefitsMid-level Full TimeCosta Mesa, California, United States14h ago
-
Senior-level Full TimeSeattle, Washington, United States14h ago
-
Sr Engineer II, Machine Learning USD 139K-183KAWS | Bioinformatics | Cell analysis | Cloud Computing | Computer Vision401k match | Discretionary bonus | Flexible paid time off | Health insurance | Stock optionsSenior-level Full TimeSan Diego - Headquarters14h ago
-
Data Engineer USD 84K-110KAPI Development | AWS | Apache Airflow | Apache Spark | AzureHybrid work scheduleEntry-level Full TimeSan Diego, California15h ago
-
Senior Principal AI/ML Engineer - AI Program USD 172K-225KCI/CD | Cloud Computing | Computer Vision | Data Engineering | Data ScienceSenior-level Full TimeRochester, MN, United States15h ago
-
Machine Learning and AI Developer USD 85K-192KAI experiments | Agent Builder | Agentic AI | Cloud Orchestration | Cloud platformAdoption and surrogacy expense reimbursement | Community service time off | Employee resource groups | Fertility treatments | Flexible family care daysMid-level Full TimeDearborn, MI, United States15h ago
-
Senior Data Engineer, Solutions Architecture USD 110K-145KAPIs | AWS | Access Control | Amazon S3 | AnthropicAutonomous work | Company-provided meals and events | Hybrid work environmentSenior-level Full TimeSan Francisco, California, United States15h ago
-
Senior Data Engineer, Solutions Architecture USD 110K-145KAPI | AWS | Access Control | Airflow | AzureCompany-provided meals | Events | Hybrid work environmentSenior-level Full TimeSan Diego, California, United States15h ago
-
Senior Data Engineer, Solutions Architecture USD 110K-145KAPI | AWS | Access Control | Airflow | AzureAutonomy | Hybrid work environment | Opportunity for impactSenior-level Full TimeScottsdale, Arizona, United States15h ago
-
Senior Data Engineer USD 128K-161KAPI Integration | Apache Spark | Azure Data | Azure Data Factory | Azure Data Lake401k match | Hybrid work schedule | Paid time off | Parental leaveSenior-level Full TimeBloomington, MN, United States15h ago
-
Senior AI Data Engineer USD 160K-200KAWS | AWS Athena | AWS Glue | AWS Lambda | Amazon Redshift401k matching | Dental insurance | Disability insurance | Life insurance | Medical insuranceSenior-level Full TimeSan Diego, California, United States R15h ago
-
Causal Inference | Classification | Clustering | Data Warehousing | Experiment designFlexible PTO | Home office stipend | Learning budget | Paid health, dental, vision | Parental leaveSenior-level Full TimeBoston or Remote R16h ago
-
Senior Partner Engineer, Databricks Alliance USD 170K-240KBusiness Intelligence | Cloud Data | Cloud Data Platforms | Data Engineering | Data Warehousing401k | Commuter benefits | Dog-friendly office | Equity | FSA benefitsSenior-level Full TimeSan Francisco, CA16h ago