AI Data Engineer
United States - Remote
R
USD 100K-150K (estimate) Mid-level Full Time
Tasks
- Build evaluation dataset construction pipelines with integrity and contamination controls
- Build high throughput data loading systems for GPU utilization
- Build ingestion systems for text image audio video and structured signals
- Design and operate large scale data pipelines for AI training and evaluation
- Design storage architectures balancing cost throughput and latency
- Develop dataset versioning lineage and provenance tracking
- Document data systems schemas and operational procedures
- Drive observability of data quality drift and pipeline health
- Implement data cleaning deduplication filtering and quality assurance
- Implement data privacy redaction and consent enforcement
- Implement labeling workflows active learning and human in the loop improvement
- Optimize cost and performance with compression format selection and caching
Perks/Benefits
Skills/Tech-stack
Active Learning | Apache Beam | Apache Spark | CI/CD | Caching | Code review | Compression | Data Lineage | Data Modeling | Data Privacy | Data Quality | Data redaction | Dataset versioning | Distributed Systems | GPU Utilization | Human-in-the-loop | JVM | Observability | Provenance tracking | Python | Ray | Storage Formats | Testing | The Loop
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Roles
Related jobs
-
IT Data Engineer USD 117K-124KANSI X12 | ARM Templates | Automl | Azure | Azure DataBackground screening provided | Flexible work hours | Mentorship | Professional development opportunitiesSenior-level Full TimeUS - Remote R12h ago
-
Analytics Engineer USD 147K-225KApache Airflow | BigQuery | DBT | Data Modeling | Data Visualization401k | Comprehensive benefits | Equity | Flexible time offSenior-level Full TimeUS Remote, Los Angeles, CA; San … R21h ago
-
Autonomy | C++ | CPU GPU | CPU GPU Debugging | Critical Systems401k | Health insurance | Paid Company Holidays | Paid time off | Phone stipendSenior-level Full TimeSan Carlos - Hybrid R23h ago
-
Autonomy | C++ | Data Ingestion | Data Ingestion Pipelines | Deployment401k | Health insurance | Paid Holidays | Paid time off | Phone stipendMid-level Full TimeSan Carlos - Hybrid R23h ago
-
Agent Orchestration | Airflow | Argo Workflows | Artifact versioning | Autonomous workflowsRemote work flexibilitySenior-level Full TimeRemote - United States R1d ago
-
Senior Databricks Engineer USD 180K-247KAWS | Autoscaling | Azure | CI/CD | CachingVisa sponsorshipSenior-level Full TimeCanada R1d ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Agentic Systems | Data Quality | Data labeling100 percent remote work | Career growth opportunities | Visa transfer support for qualified candidatesMid-level Full TimeUnited States - Remote R1d ago
-
Hadoop Big Data Developer USD 100K-150KAWS EMR | Airflow | Apache Atlas | Apache Flink | Apache HBaseLong-term career growth | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Hadoop Big Data Developer USD 100K-150KAWS EMR | Airflow | Apache Atlas | Apache Flink | Apache HiveCareer growth | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Mid-level Full TimeUnited States - Remote R1d ago
-
Software Engineer (Remote) USD 50K-130KAgile | ETL | Git | Google BigQuery | Google DataformRemote workEntry-level Full TimeTEXAS - VIRTUAL - TX01, United … R1d ago
-
Sr Staff - Data Platform Engineer USD 220K-255KAWS EMR | AWS Lambda | AWS S3 | Airflow | Apache HudiDental insurance | Disability insurance | Flexible spending account | Health insurance | Health savings accountSenior-level Full TimeCalifornia - Remote Office, United States R1d ago
-
LLM Engineer USD 100K-150KAdapter-Tuning | Automated benchmarking | DPO | Dataset curation | Direct Preference OptimizationCareer growth potential | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Data & Integrations Engineer USD 100K-100KAPI Gateway | API Management | AWS API | AWS API Gateway | AWS LambdaCareer development | Fully remoteSenior-level Full TimeRemote Ohio, United States R1d ago
-
Senior-level Full TimeCanada R1d ago
-
Sr. AI Engineer (Applied AI & ML Systems) USD 132K-165KAgentic AI | Context engineering | Continuous Improvement | Data Engineering | Data PipelinesE learning license | Hackathons | Healthcare benefits | Home office setup allowance | Identity theft protectionSenior-level Full TimeUnited States R1d ago
-
Data Engineer USD 95K-140KApache Spark | Automated testing | Azure Databricks | CI/CD | Data ModelingMid-level Full TimeUS Remote R1d ago
-
Senior Analytics Engineer USD 180K-208KBigQuery | Cube | Dashboards | Data Modeling | Data orchestration401k with payroll match | Dental vision and mental health care | Employer sponsored medical care | Equity | Flexible PTOSenior-level Full TimeSan Francisco R1d ago
-
A/B | A/B Testing | AWS | Airflow | Amazon Redshift401k matching | Employee assistance program | Flexible time off | Flexible work arrangement | Paid HolidaysMid-level Full TimeRemote, US R1d ago
-
Associate AI Engineer USD 144K-180K.NET | APIs | ASPNet | AWS | Azure401k matching | Dental insurance | Hybrid work model | Medical insurance | Paid time offMid-level Full TimeIrving, TX R1d ago
-
Agentic AI Engineer USD 130K-170KAgentic AI | Concurrency | Context engineering | Data Compression | Data IngestionCareer growth | Health and well-being programs | Remote work | Supportive teamMid-level Full TimeRemote - United States R1d ago
-
Data Engineer-Secret Clearance Required USD 100K-127KAWS | AWS Glue | AWS Redshift | Azure | Azure Data401k match | Bereavement leave | Disability insurance | Employee assistance program | Employee discount programSenior-level Full TimeRemote - Nationwide, United States R1d ago
-
AI Foundry | AKS | ARM | Agent 365 | Agentic AI401k plan with company matching | Bereavement leave | Employee assistance program | Employee discount program | Health, dental, and vision careSenior-level Full TimeNew York, NY, United States R1d ago
-
Senior Healthcare Data Engineer USD 104K-199KConditional Aggregation | Data Modeling | Data Quality | Data Reconciliation | Data Validation401k matching | Employee assistance program | Family building benefits | Flexible spending accounts | HolidaysSenior-level Full TimeSeattle, Washington, United States R1d ago
-
Machine Learning Engineer, Data Mining USD 144K-192KActive Learning | Batch inference | CI/CD | Data Augmentation | Data Curation401k match | Dental insurance | Health savings account | Life insurance | Medical insuranceSenior-level Full TimePittsburgh, Pennsylvania, United States; Remote U.S. R1d ago