AI Data Infrastructure Engineer
Tasks
- Build evaluation dataset construction with integrity and contamination controls
- Build high throughput data loading systems for GPU utilization
- Build ingestion systems for multimodal data
- Collaborate with ML researchers and engineers on data needs
- Design and operate large scale AI data pipelines
- Design storage architectures for latency throughput and cost
- Develop dataset versioning lineage and provenance tracking
- Document data systems schemas and operational procedures
- Drive observability of data quality drift and pipeline health
- Implement data cleaning deduplication filtering and quality assurance
- Implement data privacy redaction and consent enforcement
- Implement labeling workflows active learning and human in the loop improvement
- Optimize cost and performance with compression caching and format selection
Perks/Benefits
- N/A
Skills/Tech-stack
Active Learning | Apache Beam | CI/CD | Caching | Code review | Compression | Data Lineage | Data Modeling | Data Privacy | Data Quality | Data Storage | Data Versioning | Data provenance | Data redaction | Dataset evaluation | Distributed Systems | GPU Utilization | Human-in-the-loop | Java | Python | Ray | Scala | Spark | Testing | The Loop
Education
Related jobs
-
Principal Software Engineer (AI/ML Architect-Engineer) USD 163K-270KAWS | Adversarial Networks | CI/CD | Diffusion Models | Generative AISenior-level Full TimeSeattle,WA,United States R14h ago
-
Senior Data Engineers USD 123K-215KAnsible | Cassandra | Couchbase | Data Modeling | Data QualityCareer development and training | Company Matched Retirement Savings Plan | Confidential counseling | Financial coaching | Free medical dental vision life insurance disability benefitsSenior-level Full TimeNew York, NY, United States R23h ago
-
Analytics Engineer USD 164K-229KAirflow | Apache Spark | Data Governance | Data Modeling | Data Visualization401k match | Caregiving support | Family planning support | Flexible vacation | Gender-affirming careSenior-level Full TimeRemote - United States R1d ago
-
Senior Analytics Engineer USD 190K-267KAirflow | Apache Spark | D3.js | Data Governance | Data Modeling401k employer match | Flexible vacation | Healthcare benefits | Mental health and coaching | Paid parental leaveSenior-level Full TimeRemote - United States R1d ago
-
GenAI Architect - Agentic USD 152K-185KAPI Gateway | AWS Bedrock | AWS Lambda | AWS Step Functions | AirflowSenior-level Full TimeUSA - Remote, United States R1d ago
-
Data Modeling | Data Quality | Data Validation | Data Warehousing | Data integrationSenior-level Full TimeGreenville Memorial Hospital, United States R1d ago
-
Senior Applied Scientist USD 142K-270KDiffusion Models | Direct Preference Optimization | Fine Tuning | Human Feedback | Inference accelerationSenior-level Full TimeSeattle, United States R1d ago
-
Staff Software Engineer, Data Ingestion - Slack USD 197K-344KAI Assisted Development | AWS ECS | AWS EKS | Airflow | Amazon EMR401k | Employee stock purchase program | Insurance | Life and disability insurance | Medical, dental, and vision insuranceSenior-level Full TimeVirginia - Washington DC Metro - … R1d ago
-
Senior AI/ML Engineer, epocrates USD 124K-210KAWS | AWS SageMaker | Data Privacy | Deep learning | ExplainabilityBook clubs | Collaborative workspaces | Commuter support | Employee assistance program | Employee resource groupsSenior-level Full TimeRemote - TX, United States R1d ago
-
AVP, AI Engineering (Remote - EST) USD 185K-235KA2A | API Development | AWS | Agent Frameworks | Agent systems401k matching | Backup Child Care | Backup elder care | Life insurance | Long-term disabilityExecutive-level Full TimeRaleigh, NC, United States R1d ago
-
Sr. Software Engineer - Applied AI (Hybrid) USD 140K-215KBenchmarking | Constrained decoding | Continual Learning | Embeddings | Fine TuningAdoption leave | Employee networks | Great Place to Work certification | Paid parental leave | Paid time offSenior-level Full TimeAustin, United States R1d ago
-
Senior-level Full TimeUnited States - Remote R1d ago
-
Edge AI Engineer USD 100K-150KBenchmarking | C++ | Core ML | Edge inference | Efficiency optimizationSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAgentic Systems | Data Quality | Data labeling | Data quality monitoring | Deep learningMid-level Full TimeUnited States - Remote R1d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapters | DPO | Dataset curation | Efficient Attention | Efficient Fine TuningCareer growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmark Regression Testing | Benchmarking | C++ | CUDA | Compiler optimizationMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering Architect USD 100K-150KAgentic Workflows | Cost Optimization | Embeddings | Evaluation Frameworks | Fine TuningCareer growth | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Robotics Software Engineer USD 100K-150KBehavior Trees | C++ | Computer Vision | Concurrent Systems | ControlMid-level Full TimeUnited States - Remote R1d ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | GPU Kernels | Kernel Fusion | Machine Learning | Nsight401k match | Dental insurance | Health savings account | Life insurance | Medical insuranceSenior-level Full TimeRemote U.S. R1d ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | Kernel Fusion | Nsight | PyTorch | PyTorch Profiler401k match | Dental insurance | Health Accounts | Health savings account | Life insuranceSenior-level Full TimeLas Vegas, Nevada, United States R1d ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | Distributed Training | GPU Performance | GPU performance profiling | Kernel Fusion401k match | Dental insurance | Health savings account | Life insurance | Medical insuranceSenior-level Full TimePittsburgh, Pennsylvania, United States R1d ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | Distributed Training | Kernel Fusion | Nsight | PyTorch401k match | Dental insurance | Health savings account | Life insurance | Medical insuranceSenior-level Full TimeBoston, Massachusetts, United States R1d ago
-
AWS Amazon Connect Agentic AI Engineer USD 153K-227KAWS CDK | AWS Lambda | Amazon Bedrock | Amazon CloudWatch | Amazon ConnectMid-level Full TimeUnited States - Remote R1d ago
-
Senior Platform AI Engineer USD 192K-259KA/B | A/B Testing | API Design | AWS | Amazon BedrockFlexible schedule | Hybrid work model | In-office collaboration days | Stock equity | Work-life balanceSenior-level Full TimeHybrid - San Francisco R1d ago
-
Senior Applied Research Engineer USD 166K-225KA/B | A/B Testing | B testing | Cross-Encoders | Embeddings401k plan | Flexible vacation policy | Flexible work schedule | Health and wellness benefits | Life and disability insuranceSenior-level Full TimeHybrid - San Francisco R1d ago