Senior Research Data Engineer (US)
Tasks
- Automate data quality filtering and synthesis
- Bridge semantics with AI research needs
- Build reusable silver to gold pipelines
- Curate datasets across modalities
- Maintain dataset versioning and lineage
- Own gold data layer
- Reverse engineer data semantics
- Support point in time correct features for ML and AI
- Transform silver tables into curated gold datasets
Perks/Benefits
- N/A
Skills/Tech-stack
Airflow | Dagster | Data Drift | Data Generation | Data Lake | Data Wrangling | Databricks | Deidentification | Delta Lake | Duplicate Detection | Embeddings | Feature Engineering | Feature Store | Generative AI | Git | HIPAA | Hugging Face | Hugging Face Datasets | LLM | LSH | MLflow | MinHash | Near-duplicate detection | Parquet | Prefect | PySpark | Python | RAG | SQL | Spark | Synthetic Data Generation | Synthetic data | Tokenization | Train Validation Test | Train validation test split | Unity Catalog | Weak Supervision
Education
N/A
Related jobs
-
Data Scientist / ML Engineer USD 170K-210KAWS | Azure | Bias Evaluation | Cloud Computing | Cloud platformFlexible working hours | Remote workSenior-level Full TimeNew York, NY, US, Remote R9h ago
-
Associate Data Solutions Engineer USD 95K-162KAI workflows | Apache Iceberg | DBT | Data Governance | Data Modeling401k with employer match | Advancement opportunities | Flexible spending account | Health benefits package | Long-term disability insuranceMid-level Full TimeUS-Remote R16h ago
-
Early-Career Network Engineer (RAN Optimization) USD 82K-128K4G | 5G | Automation | C Band | CBRSEducational assistance | Matching gifts | Paid sick time | Paid vacation | Parental leaveMid-level Full TimePlano,Texas,United States R16h ago
-
Artificial Intelligence/Machine Learning Engineer USD 119K-200KAKS | Anomaly Detection | Artificial Intelligence | Azure Data | Azure Data FactoryHybrid work | Remote workSenior-level Full TimeAustin, TX, United States R17h ago
-
Applied AI Engineer - AI Solutions USD 172K-300KAgentic Workflows | Airflow | Apache Spark | Chroma | CrewAIAnnual travel up to 25% | Employee stock options | Hybrid work | Professional developmentMid-level Full TimeNew York City, NY (Hybrid); Redwood … R1d ago
-
Product Analytics Engineer USD 130K-140KA/B | A/B Testing | Airflow | B testing | DBT401k retirement savings plan | Employer-sponsored healthcare | Flexible spending account | Health savings account | Paid parental leaveSenior-level Full TimeRemote, USA R1d ago
-
Edge AI Engineer USD 100K-150KC++ | Core ML | DSP | Embedded Systems | Federated LearningCareer growth | H1B transfer support | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Senior-level Full TimeUnited States - Remote R1d ago
-
Senior-level Full TimeUnited States - Remote R1d ago
-
A2A protocols | API Integration | Agent Orchestration | Agentic Systems | AuthenticationRemote work | Training and support opportunitiesSenior-level Full TimeRemote - USA, United States R1d ago
-
AI Research Engineer USD 100K-150KAccelerator hardware | Agentic Systems | Computer Vision | Data Quality | Data quality monitoringMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Computer Vision | Data Quality | Data quality monitoringCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer USD 100K-150KAccelerator hardware | Computer Vision | Data Quality | Deep learning | Distributed TrainingBenefits package | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Hadoop Big Data Developer USD 100K-150KAWS EMR | Airflow | Apache Atlas | Apache Flink | Apache HBaseBenefits | Full-time W2 employment | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Hadoop Big Data Developer USD 100K-150KAirflow | Apache Atlas | Apache Flink | Apache Hive | Apache HudiCareer growth | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Mid-level Full TimeUnited States - Remote R1d ago
-
Principal Data Engineer USD 151K-220KAWS | Cloud Computing | Data Governance | Data Management | Data Modeling401k matching | Business resource groups | Dental insurance | Family and medical leave | Health insuranceSenior-level Full TimeKS Remote, United States R1d ago
-
Mid-level Full TimeUnited States - Remote R1d ago
-
LLM Engineer USD 100K-150KAdapter-Tuning | Direct Preference Optimization | Efficient Attention | Evaluation methodology | FSDPMid-level Full TimeUnited States - Remote R1d ago
-
LLM Engineer USD 100K-150KAdapters | DeepSpeed ZeRO | Direct Preference Optimization | Efficient Attention | FSDPMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineer USD 100K-150KAgent architecture | Chunking | Embeddings | Evaluation Frameworks | Fine TuningMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering USD 100K-150KAgentic Workflows | Chunking | Design Patterns | Deterministic systems | EmbeddingsRemote workMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineer USD 100K-150KAgent architecture | Agent systems | Chunking | Embeddings | EvaluationMid-level Full TimeUnited States - Remote R1d ago
-
Robotics Software Engineer USD 100K-150KBehavior Trees | C++ | Computer Vision | Concurrent programming | Embedded SystemsCareer growth | Mentorship | Remote work | Technical documentation supportMid-level Full TimeUnited States - Remote R1d ago
-
Robotics Software Engineer USD 100K-150KBehavior Trees | C++ | Computer Vision | Concurrent programming | Control SystemsMid-level Full TimeUnited States - Remote R1d ago