AI Data Infrastructure Engineer
Tasks
- Build evaluation dataset construction pipelines with integrity controls
- Build high throughput data loading systems for GPU utilization
- Build ingestion systems for multimodal data
- Design and operate large scale data pipelines for AI training and evaluation
- Design storage architectures for cost throughput and latency
- Develop dataset versioning lineage and provenance tracking
- Document data systems schemas and operational procedures
- Drive observability for data quality drift and pipeline health
- Implement data cleaning deduplication filtering and quality assurance
- Implement data privacy redaction and consent enforcement
- Implement labeling workflows active learning and human in the loop systems
- Optimize cost and performance through compression format selection and caching
Perks/Benefits
- N/A
Skills/Tech-stack
Apache Beam | CI/CD | Caching | Code review | Compression | Data Governance | Data Lineage | Data Modeling | Data Privacy | Data Quality | Data Storage | Data Versioning | Distributed Systems | GPU Utilization | Java | Python | Ray | Rédaction | Scala | Spark | Testing
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Related jobs
-
Senior Software Engineer, Storage USD 166K-210KAmazon CloudWatch | Amazon EC2 | Backups | Cause analysis | Cloud-basedAnnual equity refresh grants | Equity grants | Remote workSenior-level Full TimeUnited States - Remote R15h ago
-
Senior Software Engineer II, Storage USD 192K-242KAmazon CloudWatch | Amazon EC2 | Amazon RDS | Backups | Cloud platformAnnual refresh grants | Equity grant | Remote workSenior-level Full TimeUnited States - Remote R15h ago
-
Senior Software Engineer, Data Governance & Foundations USD 166K-210KApache Airflow | Apache Flink | Apache Hudi | Apache Iceberg | Apache KafkaAnnual refresh grants | Equity grant | Remote work flexibilitySenior-level Full TimeUnited States - Remote R15h ago
-
Associate Software Engineer, Embedded Development USD 100K-150KAOSP | Android | Bash | Black box testing | Black-box401k match | Dental insurance | Free snacks | Health insurance | Life insuranceMid-level Full TimeRaleigh, NC R15h ago
-
AWS | AWS Batch | AWS Bedrock | AWS CDK | AWS CodeBuild401k | Health insurance | Paid time off | Wellness stipendSenior-level Full TimeSan Carlos - Hybrid R16h ago
-
Dashboard | Data Visualization | Data pipeline | ETL | Machine LearningOnsite days schedule | Overtime paySenior-level Full TimeSan Mateo, CA, United States R16h ago
-
Sr. Machine Learning Engineer USD 175K-230KAWS | C plus plus | Deep learning | Kubernetes | Language Models401k plan | Cell phone internet reimbursement | Company-Paid Holidays | Flexible paid time off | Health Savings Account employer contributionSenior-level Full TimeRemote - United States R16h ago
-
Senior AI & ML Engineer USD 194K-228KAPIs | Agent Orchestration | Agent routing | Agents SDK | Cloud infrastructureSenior-level Full TimeUnited States - Remote R18h ago
-
Senior Embedded Linux Engineer USD 200K-300KBash | C# | C++ | Device Drivers | Distributed SystemsCommuter benefits | Flexible PTO | Flexible spending account | Health savings account | Healthcare coverageSenior-level Full TimeSan Mateo, CA United States R22h ago
-
Sr. Solutions Engineer - AI Natives Business USD 152K-209KAWS | Apache Spark | Azure | Data Engineering | Data ScienceAnnual performance bonus | Equity | Remote work | Travel requiredSenior-level Full TimeRemote - California; Remote - Colorado; … R23h ago
-
AI Integrations Engineer USD 139K-175KAI vector search | API Gateway | Agent Builder | AlloyDB | Apigee401k matching | Dental insurance | Disability insurance | Flexible paid time off | Life insuranceMid-level Full TimeUnited States R23h ago
-
AI Savvy Data Analyst USD 127K-220KAnomaly Detection | Cause analysis | Cloud APIs | Colab | Data GovernanceAdditional day off for birthday | Competitive benefits package | ESG focused company | Flex days between Christmas and New Year | Flexible work hoursMid-level Full TimeDenver, CO, United States R1d ago
-
Data Engineer Lead | $140k-$175k + Hybrid + Equity | Exciting High Growth AI Operational Intelligence Startup A USD 140K-175KApache Airflow | Apache Kafka | DBT | Dagster | Data LineageEquity | Health insurance | Hybrid work | Medical insurance | Paid HolidaysExecutive-level Full TimeWayne, PA, United States R1d ago
-
Applied AI Engineer | $150K-$175K + Hybrid + Equity | High Growth AI Operational Intelligence Startup A USD 150K-175KAI Agents | AI orchestration | APIs | LLM | Model EvaluationEquity | Health medical and vision coverage | Hybrid work | One day per week onsite | Paid HolidaysExecutive-level Full TimeWayne, PA, United States R1d ago
-
Senior AI Engineer USD 160K-200KAPI Gateway | AWS ECS | AWS Fargate | AWS IAM | Amazon APIHealth care benefitsSenior-level Full TimeUnited States R1d ago
-
Agent Frameworks | Deep learning | Distributed Systems | Fine Tuning | LLM InferenceEquity package | High-impact work | Hybrid schedule | Remote work optionSenior-level Full TimeRemote; New York, New York; Onsite R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Accelerator hardware | Agentic Systems | Computer Vision | Data QualityMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Accelerator hardware | Computer Vision | Data Quality | Data labelingMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Accelerators | Agentic Systems | Computer Vision | Data QualityBenefits | Career growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAccelerator hardware | Agentic Systems | Computer Vision | Data Quality | Data quality monitoringMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Accelerator hardware | Agentic Systems | Computer Vision | Data QualityCareer growth | Diversity and inclusion | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KApache Beam | CI/CD | Code review | Data Governance | Data LineageCareer growth | H1B transfer support | Remote work | W2 employmentMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KActive Learning | Apache Beam | CI/CD | Caching | CompressionRemote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KApache Beam | CI/CD | Code review | Data Ingestion | Data LineageBenefits package | Career growth potential | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KActive Learning | Apache Beam | CI/CD | Caching | Code reviewBenefits | Remote workMid-level Full TimeUnited States - Remote R1d ago