AI Data Infrastructure Engineer
Tasks
- Build high throughput data loading for GPU utilization
- Build ingestion systems for multimodal data
- Collaborate with ML researchers and engineers
- Construct evaluation datasets with integrity controls
- Design data pipelines for AI training and evaluation
- Design storage architectures for data tiers
- Develop dataset versioning and lineage tracking
- Document data systems schemas and procedures
- Drive data observability for quality and drift
- Implement data cleaning and quality assurance
- Implement data privacy redaction and consent enforcement
- Implement labeling workflows and active learning
- Optimize data cost and performance with compression and caching
Perks/Benefits
- N/A
Skills/Tech-stack
Active Learning | Apache Beam | CI/CD | Caching | Code review | Compression | Data Drift | Data Lineage | Data Modeling | Data Observability | Data Privacy | Data Quality | Data Storage | Data loading | Data provenance | Data redaction | Dataset versioning | Distributed Systems | GPU Utilization | High Throughput | High Throughput Data Loading | High-throughput data | Human-in-the-loop | Java | Machine Learning | Multimodal Data | Python | Ray | Scala | Spark | Testing | The Loop
Education
Related jobs
-
Sr. Embedded Software Engineer USD 100K-130KARM Cortex | ARM Cortex A | Assembly | Bash | Buildroot401k match | Career growth and professional development opportunities | Employee assistance program | Low-cost medical dental vision | Paid HolidaysSenior-level Full TimeRemote (United States) R17h ago
-
Data & AI Platform Engineer USD 95K-155KAI Search | APIs | AWS | Airflow | ArcGIS401k matching | Dental insurance | Health insurance | Life insurance | Paid HolidaysSenior-level Full TimeRemote, United States R19h ago
-
Sr Data Engineer USD 100K-120KAPIs | AWS | AWS Glue | Airflow | Amazon RedshiftFully remote | Mentorship | On-call supportSenior-level Full TimeOrlando, FL, United States R19h ago
-
Director, AI Architect USD 230K-287KAWS | Agentic Workflows | Artificial Intelligence | Azure | Bias MitigationHealthcare coverage | Lifelong membership | Parental leave | Retirement savings match | Wellness stipendSenior-level Full TimeSan Francisco - Hybrid R20h ago
-
Staff Machine Learning Systems Engineer (MLOps) USD 210K-250KAWS EKS | Alerting | Autoscaling | CI/CD | ClickHouseFlexible remote work | Healthcare industry domain experienceSenior-level Full TimeUS Remote R22h ago
-
Senior Data Engineer USD 140K-170KData Governance | Data Management | Data Modeling | Data Quality | Data sharesComprehensive benefits package | Flexible hybrid schedule | Opportunities for promotion | Supportive work community | Training and developmentSenior-level Full TimeNew York, New York, United States; … R22h ago
-
Senior Applied AI Engineer / Forward Deployed Engineer USD 150K-170KAI Foundry | AI Search | API Integration | Azure AI | Azure AI Foundry401k matching | Career growth | Dental insurance | Disability insurance | Fully remote workSenior-level Full TimeMinneapolis, MN, United States R1d ago
-
Edge AI Engineer USD 100K-150KC++ | Core ML | Device deployment | Embedded Systems | Federated LearningRemote workSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAccelerators | Computer Vision | Data Quality | Data labeling | Data quality monitoringRemote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KApache Beam | Apache Spark | CI/CD | Caching | Code review100 percent remote work | Career growth opportunities | H1B transfer support for qualified candidates | Long term multi year engagementMid-level Full TimeUnited States - Remote R1d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapter | Attention Optimization | DPO | Distributed Training | Evaluation benchmarksMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering Architect USD 100K-150KAPIs | Agentic Workflows | Embeddings | Evaluation Frameworks | Fine TuningSenior-level Full TimeUnited States - Remote R1d ago
-
Robotics Software Engineer USD 100K-150KBehavior Trees | C++ | Computer Vision | Concurrent programming | Control SystemsCareer growth potential | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Senior Data Engineer USD 72K-156KDAX | Data Governance | Data Quality | Databricks | Databricks Lakehouse401k company match | Associate discounts | Dental insurance | Health insurance | Life insuranceSenior-level Full TimeRemote, United States R1d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | Compiler optimization | Continuous batching | Deep learningBenefits | Career growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Storage Engineer (NetApp / Pure / Ceph) USD 100K-150KAnsible | CRUSH maps | CSI | Capacity Planning | CephRemote workSenior-level Full TimeUnited States - Remote R1d ago
-
Storage Engineer (NetApp / Pure / Ceph) USD 100K-150KAnsible | Automation | CRUSH maps | CSI | Capacity PlanningRemote workSenior-level Full TimeUnited States - Remote R1d ago
-
Senior AI/ML Engineer USD 125K-188KAWS | AWS Architecture Patterns | AWS CDK | AWS Lambda | AWS architecture401k matching | Dental insurance | Health savings account | Medical insurance | Online trainingSenior-level Full TimeHerndon, Virginia, United States R1d ago
-
Sr Software Engineer, MLOps USD 150K-180KCI/CD | Cloud Monitoring | DVC | Dataset versioning | Deployment Automation24/7 medical hotline | 401k employer match | Employee discounts | Employee resource groups | Flexible paid time awaySenior-level Full TimeVIRTUAL, WA, US, 00000 R1d ago
-
Analytics Engineer USD 147K-225KApache Airflow | BigQuery | DBT | Databricks | Python401k | Comprehensive benefits | Equity | Flexible time offSenior-level Full TimeUS Remote, San Francisco, CA; New … R1d ago
-
Staff Data & Machine Learning Engineer USD 118K-136KDBT | Data Architecture | Data Governance | Data Quality | Data Streaming401k match | Dental insurance | Family planning resources | Flexible vacation | Fully remoteSenior-level Full TimeRemote - USA R1d ago
-
Senior AI Engineer, Real-World Data USD 125K-175KAI orchestration | AWS | AWS Fargate | AWS Lambda | Agile deliverySenior-level Full TimeUS Remote R1d ago
-
Staff Data Platform Engineer USD 210K-240KAuditing | Azure Event | Azure Event Hubs | Batch Processing | CI/CDHealth plan subsidies | Paid global offsites | Remote-first work culture | WFH office reimbursementSenior-level Full TimeRemote - US R1d ago
-
A/B | A/B Testing | B testing | C++ | Cloud Computing401k employer match | Family planning support | Flexible vacation | Gender-affirming care | Healthcare benefitsSenior-level Full TimeRemote - United States R1d ago
-
Computer Vision | Data collection | Deep learning | Fine Tuning | Generative ModelingEntry-level Full TimeSan Francisco, CA, US; Remote, US R1d ago