AI Data Infrastructure Engineer
United States - Remote
R
USD 146K-189K (estimate) Mid-level Full Time
Tasks
- Build evaluation dataset construction pipelines with integrity controls
- Build high throughput data loading to maximize GPU utilization
- Build ingestion systems for multimodal data
- Design large scale data pipelines for AI training and evaluation
- Design storage architectures across data tiers
- Develop dataset versioning lineage and provenance tracking
- Document data systems schemas and operational procedures
- Drive observability for data quality drift and pipeline health
- Implement data cleaning deduplication filtering and quality assurance
- Implement data privacy redaction and consent enforcement
- Implement labeling workflows active learning and human in the loop
- Optimize cost and performance with compression caching and format selection
Perks/Benefits
Skills/Tech-stack
Apache Beam | CI/CD | Code review | Data Lineage | Data Modeling | Data Privacy | Data Quality | Data Versioning | Data redaction | Dataset Integrity | Distributed Systems | ETL | GPU Utilization | Java | Observability | Python | Ray | Scala | Spark | Storage Architecture | Testing
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Related jobs
-
AI Developer - Model Creation & Full Stack USD 150K-175KAWS | Angular | Azure | CI/CD | D3.jsRemote work | USPS Public Trust Clearance eligibleMid-level Full TimeWork from home, VA, United States R8h ago
-
API Integration | AWS | AWS Glue | Batch Processing | Code reviewSenior-level Full TimeIndianapolis, IN, United States R10h ago
-
Applied AI Engineer, Agentic Systems USD 115K-192K.NET | APIs | Anthropic | CrewAI | Evaluation FrameworksAI and productivity tools access | Remote work accessSenior-level Full TimeRemote - United States R19h ago
-
Senior Industrial Engineer, Process Optimization USD 100K-120K5S | AutoCAD | Cause analysis | Cost modeling | Excel401k | Dental insurance | Disability insurance | Flexible spending account | Health savings accountSenior-level Full TimeBethlehem, PA, United States R23h ago
-
Senior Data Engineer USD 119K-165KAzure Data | Azure Data Factory | Azure Databricks | Azure DevOps | Azure FunctionsCareer growth | Healthcare industry experience | MentorshipSenior-level Full TimeUS - Remote, United States R1d ago
-
Machine Learning Engineer II GBP 124K-186KAWS | Anomaly Detection | Athena | Bedrock | C++Formal learning opportunities | Hybrid work | On-the-job learningMid-level Full TimeUSA – MN – Minneapolis, United … R1d ago
-
Edge AI Engineer USD 130K-200KBenchmarking | C++ | Core ML | Edge Computing | Embedded SystemsCareer growth | Health benefits | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 150K-222KAccelerator hardware | Agentic Systems | Data Quality | Data quality monitoring | Deep learningCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Distinguished Engineer, Applied AI USD 150K-300KAWS | Agentic AI | Algorithms | Artificial Intelligence | Auto-failover401k match | Adoption Assistance | Career mentorship | Certification assistance | Employee trainingSenior-level Full TimeCA Palo Alto Office, United States R1d ago
-
AI Data Infrastructure Engineer USD 146K-189KActive Learning | Apache Beam | CI/CD | Caching | Code reviewMid-level Full TimeUnited States - Remote R1d ago
-
LLM Fine-Tuning Engineer USD 150K-270KAdapter-Tuning | DPO | Dataset curation | Distributed Training | Evaluation methodologyCareer growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
LLM Platform Engineer (Windchill / Teamcenter) USD 116K-177KAWS | Ansible | Azure | CAD Integration | CI/CDCareer growth opportunities | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Performance Optimization Engineer USD 136K-258KC++ | Continuous batching | Deep learning | Distributed Systems | FSDPMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering Architect USD 119K-228KAgent systems | Agentic Systems | Embeddings | Evaluation Frameworks | LLM APIsCareer growth | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Quantitative Developer (Fintech) USD 121K-213KAudit trails | Backtesting | C++ | Cloud Native | Cloud Native ArchitectureMid-level Full TimeUnited States - Remote R1d ago
-
Storage Engineer (NetApp / Pure / Ceph) USD 151K-228KAnsible | Backup | CRUSH maps | Capacity Planning | CephRemote workSenior-level Full TimeUnited States - Remote R1d ago
-
Robotics Software Engineer USD 125K-169KBehavior Trees | C++ | Cameras | Concurrent Systems | Control SystemsCareer growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AWS | Cloud Data | Cloud data warehousing | Data Modeling | Data WarehousingSenior-level Contract Full TimeRemote, OR, United States R1d ago
-
Edge AI Engineer USD 141K-200KC++ | Core ML | Edge inference | Energy optimization | Federated LearningSenior-level Full TimeUnited States - Remote R2d ago
-
Senior-level Full TimeUnited States - Remote R2d ago
-
Edge AI Engineer USD 141K-200KBenchmarking | C++ | Core ML | Digital Signal | Digital Signal ProcessorSenior-level Full TimeUnited States - Remote R2d ago
-
Senior-level Full TimeUnited States - Remote R2d ago
-
Senior-level Full TimeUnited States - Remote R2d ago
-
AI Research Engineer (Applied AI) USD 150K-222KAgentic Systems | Computer Vision | Data Quality | Data labeling | Data quality monitoringMid-level Full TimeUnited States - Remote R2d ago
-
AI Research Engineer (Applied AI) USD 150K-222KAccelerator hardware | Agentic Systems | Data Quality | Data labeling | Data quality monitoringMid-level Full TimeUnited States - Remote R2d ago