AI Data Infrastructure Engineer
Tasks
- Build evaluation dataset construction pipelines
- Build high throughput data loading for GPU utilization
- Build ingestion systems for multimodal data
- Design and operate large scale AI data pipelines
- Design storage architectures balancing cost throughput latency
- Develop dataset versioning lineage and provenance tracking
- Document data systems schemas and operational procedures
- Drive observability of data quality drift and pipeline health
- Implement data cleaning deduplication filtering and quality assurance
- Implement data privacy redaction and consent enforcement
- Implement labeling workflows and active learning pipelines
- Optimize cost and performance with compression formats and caching
Perks/Benefits
Skills/Tech-stack
Apache Beam | CI/CD | Code review | Data Lineage | Data Modeling | Data Privacy | Data Quality | Data Storage | Dataset versioning | Distributed Systems | Java | Machine Learning | Observability | Python | Ray | Scala | Spark | Testing
Education
Related jobs
-
AI Transformation Lead USD 155K-175KAI Agents | API Integration | Agent systems | Anthropic | Data FlowsConference support | English learning support | Flexible hours | Hybrid work | International team cultureSenior-level Full TimeCyprus - Remote R23h ago
-
Senior Data Engineer USD 38K-40KApache Airflow | Artificial Intelligence | Change Data Capture | Cloud Computing | Cloud platformHybrid work schedule | MentorshipSenior-level Full Time1300 Gezon Pkwy SW, Wyoming MI, … R23h ago
-
AI Prompt Engineer USD 61K-122KAI Safety | Agile | Data Privacy | Embeddings | Few-Shot LearningContinuing education | Flexible time off | Healthcare | Learning resources | Retirement benefitsEntry-level Full Time405 ASHBURN VA (ASHBURN CACI/CLIENT REIMB … R23h ago
-
Artificial Intelligence (AI) Engineer - Remote USD 100K-155KAI Search | API Integration | Agent Orchestration | Azure AI | Azure AI Search401k employer match | Adoption, Fertility and Surrogacy Reimbursement | Certification reimbursement | Emergency backup care | Free CEUsMid-level Full TimeVirginia Remote, United States R23h ago
-
AI Research Strategist Intern USD 60KA/B | A/B Testing | Ahrefs | Artificial Intelligence | B testingMentorship | Networking opportunities | Potential full-time conversionEntry-level Internship Part TimeRemote Work( USA), United States R23h ago
-
Senior-level Full TimeUnited States - Remote R23h ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Accelerator hardware | Computer Vision | Data Quality | Data labeling100 percent remote work | Career growth opportunities | H1B transfer supportMid-level Full TimeUnited States - Remote R23h ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapter-Tuning | Automated Benchmarks | DPO | Distributed Training | Evaluation methodologyCareer growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R23h ago
-
AI Performance Optimization Engineer USD 100K-150KC++ | CUDA | Continuous batching | DeepSpeed | Distributed Training100 percent remote | No H1B sponsorship transfers supported | W2 employmentMid-level Full TimeUnited States - Remote R23h ago
-
Mid-level Full TimeUnited States - Remote R23h ago
-
Robotics Software Engineer USD 100K-150KBehavior Trees | C++ | Concurrent programming | Control Systems | Fault detectionBenefits | Career growth potential | Remote workMid-level Full TimeUnited States - Remote R23h ago
-
Senior AI Engineer USD 139K-218KAPIs | APIs integration | Access Control | Agent Orchestration | Agentic architectureAsynchronous work | High-performance culture | Remote workSenior-level Full TimeRemote, US R1d ago
-
Principal GenAI Platform Engineer (US) USD 106K-173KAPI Security | Authentication | Authorization | Automation | CI/CDRemote workSenior-level Full TimeRemote, USA R1d ago
-
Machine Learning Engineer II USD 142K-210KAirflow | Anthropic | Artificial Intelligence | CatBoost | Document processingEmployee stock purchase plan | Flexible spending wallets | Health care coverage | Paid time off | Remote-firstMid-level Full TimeRemote US R1d ago
-
Langchain | MLOps | Machine Learning | Matplotlib | NumPyMid-level FreelanceUnited States - Remote R1d ago
-
AI Native Software Engineer USD 130K-220KAgent Orchestration | Agent systems | Autogen | CI/CD | ContainersSenior-level Full TimeRemote (United States) R1d ago
-
Sr. Manager, AI Lead - Semantic Layer - Remote USD 168K-224KAPI Integration | Analytics | Artificial Intelligence | Data Governance | Data ModelingRemote workSenior-level Full TimeCalifornia - Home Teleworkers, United States R1d ago
-
A/B | A/B Testing | AWS | Apache Spark | B testingSenior-level Full TimeWork from Home, United States, United … R1d ago
-
Software & Analytics Engineer USD 80K-120KAutomated testing | CI/CD | Dash | Data Pipelines | Data TransformationSenior-level Full TimeUnited States - Remote R1d ago
-
API Integration | Benchmarking | Data Pipelines | Debugging | Deep learningDirect ownership | High-impact work | Onsite opportunity | Remote work flexibilitySenior-level Full TimeNew York, New York; Onsite R1d ago
-
Principal AI/ML Researcher / Engineer Reasoning, Planning, and Decision-making systems USD 296K-370KAgent systems | Artificial Intelligence | Belief State Tracking | Caching | Causal modelingSenior-level Full TimeUnited States R2d ago
-
Media Software Engineer, Speech (All Levels) USD 120K-180KAndroid | Artificial Intelligence | Audio Processing | C# | C++401k retirement savings plan | Company holidays | Complimentary lunch and snacks | Fertility support | Medical, dental, and vision insuranceEntry-level Full TimeSunnyvale R2d ago
-
Principal Data Engineer (streaming) USD 118K-134KAWS | Alerting | Apache Flink | Apache Hudi | Apache KafkaAllyship and inclusion communities | Caregiver leave | Continuous development support program | Employee assistance program | Employee recognitionSenior-level Full TimeRemote, USA R2d ago
-
Mid-level Full TimeUnited States R2d ago
-
Deep learning | LLMs | Langchain | MLOps | Machine LearningFlexible schedule | Part-time availability | Project based workMid-level FreelanceUnited States - Remote R2d ago