AI Data Engineer
Tasks
- Build evaluation dataset construction pipelines with integrity controls
- Build high throughput data loading systems for GPU utilization
- Build ingestion systems for text image audio video and structured signals
- Design and operate large scale AI data pipelines
- Design storage architectures for cost throughput and latency
- Develop dataset versioning lineage and provenance tracking
- Document data systems schemas and operational procedures
- Drive observability of data quality drift and pipeline health
- Implement data cleaning deduplication filtering and quality assurance
- Implement data privacy redaction and consent enforcement
- Implement labeling workflows and active learning pipelines
- Optimize cost and performance using compression format selection and caching
Perks/Benefits
Skills/Tech-stack
Apache Beam | CI/CD | Code review | Data Lineage | Data Modeling | Data Privacy | Data Quality | Dataset versioning | Distributed Systems | Java | Python | Ray | Rédaction | Scala | Spark | Testing
Education
Roles
Related jobs
-
AWS Glue | AWS Lambda | AWS S3 | Access Control | Data GovernanceCareer growth opportunities | Collaborative and inclusive work environment | Diverse and inclusive culture | Flexible work arrangements | Permanent remote working modelSenior-level Full TimeCanada R18h ago
-
Senior-level Full TimeUnited States - Remote R23h ago
-
Edge AI Engineer USD 100K-150KC++ | Core ML | Cross Platform Inference | Cross-platform | DSPCareer growth potential | Full-time remote work | H1B transfer supportSenior-level Full TimeUnited States - Remote R23h ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Data Quality | Data labeling | Data quality monitoring100 percent remote | Career growth | Full-time employment | W2 employmentMid-level Full TimeUnited States - Remote R23h ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Agentic Systems | Computer Vision | Data QualityMid-level Full TimeUnited States - Remote R23h ago
-
Hadoop Big Data Developer USD 100K-150KAWS EMR | Airflow | Apache Atlas | Apache Flink | Apache SparkRemote workSenior-level Full TimeUnited States - Remote R23h ago
-
Hadoop Big Data Developer USD 100K-150KAWS EMR | Airflow | Apache Atlas | Apache Flink | Apache HiveRemote workSenior-level Full TimeUnited States - Remote R23h ago
-
AI Data Engineer USD 100K-150KActive Learning | Apache Beam | CI/CD | Caching | Code review100 percent remote | Career growth | Full-time employment | H1B transfer support | W2 employmentMid-level Full TimeUnited States - Remote R23h ago
-
Engineer – Data Engineer III USD 86K-123KDimensional Modeling | Informatica PowerCenter | Perl | Python | SQLMentorship programs | Paid caregiver leave | Paid parental leave | Training programs | Volunteer activitiesSenior-level Full TimeUSA - PA - Conshohocken - … R23h ago
-
LLM Engineer USD 100K-150KAdapter methods | DPO | Deep reinforcement learning | Distributed Training | Efficient AttentionBenefits | Career growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R23h ago
-
LLM Engineer USD 100K-150KDPO | Deep learning | Distributed Training | Efficient Attention | Efficient Fine TuningRemote workMid-level Full TimeUnited States - Remote R23h ago
-
Prompt Engineer USD 100K-150KAgent architecture | Agent architectures | Agentic Workflows | Chunking | Deterministic systemsLong-term engagement | Mentorship | Remote workMid-level Full TimeUnited States - Remote R23h ago
-
Prompt Engineering USD 100K-150KAgent systems | Agentic Workflows | Embeddings | Evaluation Pipelines | Fine TuningCareer growth potential | H1B transfer support | Long-term engagement | Remote work | Technical coding assessment requiredMid-level Full TimeUnited States - Remote R23h ago
-
Robotics Software Engineer USD 100K-150KBehavior Tree | C++ | Camera integration | Concurrent Systems | Data Pipelines100 percent remote work | Career growth | Technical mentorshipMid-level Full TimeUnited States - Remote R23h ago
-
Robotics Software Engineer USD 100K-150KAutonomous Robots | Behavior Trees | C++ | Cameras | Concurrent SystemsCareer growth | Code review and design review | Mentorship | Remote work | Technical documentation and runbooksMid-level Full TimeUnited States - Remote R23h ago
-
Principal Applied AI Engineer, Finance USD 193K-340KAPI Development | AWS | Bias Mitigation | CI/CD | Churn modeling401k matching | Adoption Assistance | Development and career growth opportunities | Fertility treatments | Flexible work schedulesSenior-level Full TimeVirtual Office (Massachusetts), United States R23h ago
-
Senior-level Full TimeRemote US, United States R23h ago
-
Applied AI Engineer USD 120K-158KA/B | A/B Testing | API Integration | Anthropic API | B testingCareer growth | Fully remote | Global Engineering Organization | High ownership culture | Learning and development budgetMid-level Full TimeUnited States R1d ago
-
Lead AI Engineer (AI Systems & Automation) USD 130K-260KAlerting | Anthropic API | Automation | Distributed Systems | DockerFully remote | Global Engineering Organization | High ownership culture | Learning and development budget | Modern engineering practicesSenior-level Full TimeUnited States R1d ago
-
Senior AI Engineer USD 153K-259KAgent Frameworks | Embeddings | Evaluation | Graph Databases | Human-in-the-loop401k plan | Flexible vacation policy | Flexible work policy | Health and wellness benefits | Paid HolidaysSenior-level Full TimeRemote - US R1d ago
-
Machine Learning Engineer V USD 231K-382KAWS | Agent Orchestration | Automated testing | Azure | CI/CDBonus eligibility | Disability insurance | Life insurance | Paid parental leave | Paid time offSenior-level Full TimeRemote, United States R2d ago
-
Senior AI Engineer USD 145K-181KAWS | Alerting | Azure | Docker | Embeddings401k match | Commuter benefits | Dental | Healthcare | Remote friendly workplaceSenior-level Full Time3750 Market Street, Philadelphia, PA, United … R2d ago
-
AWS | AWS CDK | Access Control | Airflow | Athena401k plan | Health insurance | Paid Holidays | Paid time off | Phone stipendSenior-level Full TimeSan Carlos - Hybrid R3d ago
-
Business Analytics | Dashboard Development | Data Engineering | Data Modeling | Data TransformationEmployer matched 401k plan | Health insurance | Paid Holidays | Paid time off | Remote workSenior-level Full TimeRedmond, WA, United States R3d ago
-
Sr AI Engineer - Agentic Systems USD 166K-205KAI Safety | API Integration | Agent Orchestration | Artificial Intelligence | Distributed SystemsSenior-level Full TimeAnywhere, US R3d ago