Applied ML Engineer, Data
Tasks
- Build data pipelines for video generation models
- Build internal tools and automation for dataset preparation and monitoring
- Collect relevant video data
- Design annotation task and quality control
- Develop preprocessing filtering and parsing workflows
- Drive pipeline projects end-to-end
- Improve data quality across the pipeline
- Optimize preprocessing time and cost constraints
- Orchestrate annotation workflows
- Prepare and deliver datasets to training clusters
- Prepare training samples
- Profile and optimize inference scripts for preprocessing
- Train and evaluate supporting models for data filtering and quality assessment
- Validate labels
Perks/Benefits
- 401k retirement plan
- Company equity
- Company holidays
- Dental insurance
- Fertility support
- Lifestyle spending account
- Lunch and snacks
- Medical insurance
- One Medical membership
- Paid time off
- Parental leave
- Sick days
- Vision insurance
Skills/Tech-stack
AWS S3 | Amazon DynamoDB | Annotation Workflows | Data Filtering | Data Parsing | Data Pipelines | Data Preprocessing | Data Processing | Dataset curation | Deep learning | Distributed Systems | Distributed data | Distributed data processing | Inference Optimization | Kubernetes | Label Quality Assurance | Machine Learning | PyTorch | Python | Quality Assurance
Education
Related jobs
-
Bias Measurement | Calibration | Experiment design | Human-in-the-loop | Language ModelsEquity | Flexible work model | In office collaboration 1 to 2 times per quarterSenior-level Full TimePalo Alto, CA, US; Remote, US R11h ago
-
Senior Machine Learning Engineer, Trust USD 191K-223KA/B | A/B Testing | Anomaly Detection | Apache Airflow | Apache KafkaSenior-level Full TimeRemote-USA R12h ago
-
Senior Machine Learning Engineer II USD 201K-253KAutoregressive models | Bias Mitigation | CTR Prediction | Causal Inference | Conversion RateAnnual refresh grants | Equity grant | Flex First work policy | Remote workSenior-level Full TimeUnited States - Remote R13h ago
-
Senior Machine Learning Engineer, Gen AI USD 165K-210KASR | AWS | Audio Processing | Cloud Computing | ContainersOpportunity to work in office if located near headquarters | Remote work optionSenior-level Full TimeUS Remote R13h ago
-
Senior Software Engineer, Data Platform USD 163K-247KAWS | Amazon EMR | Amazon Kinesis | Amazon MSK | Amazon RedshiftHybrid workSenior-level Full TimeDenver, CO;San Francisco, CA;New York, NY;Los … R13h ago
-
Senior Data Engineer USD 122K-195KAWS Redshift | DBT | Data Governance | Data Lineage | Data ModelingEquity | Health insurance | Hybrid work | LifeTime Membership | Parental leaveSenior-level Full TimeRemote - United States R13h ago
-
Staff Data Engineer USD 140K-224KApache Spark | CDC | DBT | Data Governance | Data ModelingGenerous parental leave | Healthcare coverage | Hybrid work schedule | Lifetime Headspace membership | Monthly wellness stipendSenior-level Full TimeRemote - United States R13h ago
-
AI Lead USD 82K-175KAPI Development | AWS | Inference Pipelines | LLM Operations | Language ModelsBackground check required | Remote workSenior-level Full TimeSchenectady, New York, United States, Remote R14h ago
-
Senior Software Engineer, Data Governance & Foundations USD 166K-210KApache Airflow | Apache Flink | Apache Hudi | Apache Iceberg | Apache SparkSenior-level Full TimeUnited States - Remote R14h ago
-
Data Engineer USD 185K-225KAWS EMR | AWS Glue | AWS S3 | Airflow | Amazon Athena401k match | Flexible PTO | Health and wellness allowance | Health insurance | Paid parental leaveSenior-level Full TimeSan Francisco (Hybrid) R14h ago
-
Customer Success Engineer - Database (2nd Shift) USD 75K-94KAnsible | Backups | ClickHouse | Cloud infrastructure | Database performanceConference reimbursement | Employee assistance program | Flexible time off | Remote work | Training reimbursementEntry-level Full TimeSeattle R15h ago
-
Customer Success Engineer - Database (2nd Shift) USD 75K-94KAnsible | Automation | Cause analysis | ClickHouse | Cloud infrastructureConference reimbursement | Employee assistance program | Employee equity options | Flexible time off | LinkedIn Learning accessEntry-level Full TimeDenver R15h ago
-
Customer Success Engineer - Database (2nd Shift) USD 75K-94KAnsible | Backups | ClickHouse | Helm | Incident ResponseConference reimbursement | Employee assistance program | Employee meetups | Flexible time off | LinkedIn Learning accessEntry-level Full TimeBoston R15h ago
-
Customer Success Engineer - Database (2nd Shift) USD 75K-94KAnsible | ClickHouse | Database Administration | Database backups | Database performanceConference reimbursement | Employee assistance program | Employee stock purchase program | Flexible time off | LinkedIn Learning accessEntry-level Full TimeAustin R15h ago
-
Customer Success Engineer - Database (2nd Shift) USD 75K-94KAI | AWS | Ansible | Automation | AzureRemote workEntry-level Full TimeSan Francisco R15h ago
-
Full Stack AI Engineer (Staff level) USD 160K-226KAWS | Agent Orchestration | Agentic Workflows | Context engineering | Distributed SystemsSenior-level Full TimeUS Remote R16h ago
-
Senior Software Engineer, Data Engineering USD 149K-198KAWS | Amazon Redshift | Apache Iceberg | CI/CD | DBT401k match | Dental insurance | Health savings account | Hybrid work schedule | Life insuranceSenior-level Full TimePittsburgh, Pennsylvania, United States R16h ago
-
Senior Software Engineer, Data Engineering USD 149K-198KAWS | Amazon Redshift | Apache Iceberg | CI/CD | DBTSenior-level Full TimeRemote U.S. R16h ago
-
Senior Software Engineer, Data Engineering USD 149K-198KAWS | Amazon Redshift | Apache Iceberg | CI/CD | DBTSenior-level Full TimeLas Vegas, Nevada, United States R16h ago
-
Senior Software Engineer, Data Engineering USD 149K-198KAWS | Amazon Redshift | Apache Iceberg | CI/CD | CLI toolsSenior-level Full TimeBoston, Massachusetts, United States R16h ago
-
Senior AI Automation Engineer - Internal Platform USD 60K-120KAI Agents | APIs | AWS | Agile | Artificial Intelligence401k plan with company match | Annual learning and development stipend | Dental insurance | Holiday pay | Home office & equipment stipendEntry-level Full TimeUS - Remote R16h ago
-
AWS GovCloud | Agent Orchestration | Azure Government | Data Engineering | Databricks401k | Company equity (RSUs) | Comprehensive health benefits | Counseling and well being programs | Generous paid time offSenior-level Full TimeRemote, US, California R17h ago
-
AWS | Airflow | CI/CD | DBT | Data Governance401k contribution | Dental insurance | Health insurance | Inclusive workplace | Life insuranceSenior-level Full TimeMassachusetts R18h ago
-
AWS | Apache Airflow | CI/CD | DBT | Data GovernanceDisability coverage | Health insurance | Life insurance | Paid time off | Professional developmentSenior-level Full TimeMinnesota R18h ago
-
AI | AWS | Apache Airflow | CI/CD | DBT401k contribution | Dental insurance | Disability coverage | Health insurance | Inclusive workplace cultureSenior-level Full TimeIllinois R18h ago