AI Data Infrastructure Engineer
Tasks
- Build high throughput data loading systems for training
- Build ingestion systems for multimodal data
- Construct evaluation datasets with integrity and contamination controls
- Design and operate large scale AI data pipelines
- Design storage architectures with cost throughput and latency tradeoffs
- Develop dataset versioning lineage and provenance tracking
- Document data systems schemas and operational procedures
- Drive observability of data quality drift and pipeline health
- Implement data cleaning deduplication filtering and quality assurance
- Implement data privacy redaction and consent enforcement
- Implement labeling workflows active learning and human in the loop improvement
- Optimize cost and performance with compression format selection and caching
Perks/Benefits
Skills/Tech-stack
Active Learning | Apache Beam | CI/CD | Caching | Code review | Compression | Data Lineage | Data Modeling | Data Privacy | Data Quality | Data Storage | Data Versioning | Data redaction | Dataset evaluation | Distributed Systems | Human-in-the-loop | Java | Python | Ray | Scala | Spark | Testing | The Loop
Education
Related jobs
-
Senior GenAI Software Engineer (North America) USD 165K-230KA/B | A/B Testing | B testing | Debugging | EvaluationEquity | Health, dental, and vision benefits | In person team gatherings quarterly | Remote-first work | Wellness stipendsSenior-level Full TimeUnited States R17h ago
-
Senior Software Engineer, AI Developer Experience USD 202K-230KAPI Integration | Agentic Workflows | Artificial Intelligence | Code review | Command LineCareer coaching and support | In-office culinary options | Inclusive family building benefits | Long term savings or retirement plans | Mental health wellness and fitness benefitsSenior-level Full TimeNew York City R17h ago
-
Machine Learning Platform Engineer USD 135K-160KAmazon SageMaker | Apache Flink | C++ | CI/CD | Cloud PubSub401k match | Annual bonus | Company equipment provided | Company medical dental vision plans | Disability benefitsMid-level Full TimeAtlanta, GA preferred, Remote R19h ago
-
Senior Developer Advocate - Modern App Development USD 194K-237KAPI Integrations | AWS | Cloud platform | Code Quality | Google CloudCommunity groups | Employee stock purchase plan | Inclusion talks | Mental health benefits | Mentor/Buddy programSenior-level Full TimeCalifornia, USA, Remote; Nevada, USA, Remote; … R20h ago
-
Staff Software Engineer, AI Developer Tools USD 180K-245KAPI Design | Agent systems | CI/CD | Compliance | Data PrivacySenior-level Full TimeDenver, CO;San Francisco, CA;New York, NY;Seattle, … R20h ago
-
Staff Software Engineer, Big Data Storage USD 177K-364KApache Flink | Apache Hive | Apache Iceberg | Apache Spark | Column BackfillSenior-level Full TimePalo Alto, CA, US; Remote, US R20h ago
-
Senior AI Data Engineer USD 160K-200KAWS Glue | AWS Lambda | Amazon Kinesis | Amazon Redshift | Amazon S3401k matching | Dental insurance | Disability insurance | Life insurance | Medical insuranceSenior-level Full TimeSan Diego, California, United States R21h ago
-
Senior Embedded Software Engineer - Future Forward USD 153K-201KAuthentication | Board Bring-up | Bring-up | C# | C++Senior-level Full TimeSunnyvale, CA, United States R21h ago
-
Senior Software Engineer, Data Authoring Platform USD 196K-230KAPI Design | Anomaly Detection | Automated testing | DSL | Data GovernanceEmployee travel credits | Remote eligibleSenior-level Full TimeRemote USA R21h ago
-
Lead AI Engineer, Business Operations (Hybrid or Remote USD 150K-220KAPI Design | Backend Development | Cloud Platforms | Evaluation Frameworks | Fine Tuning401k company match | Career advancement opportunities | Dental insurance | Flexible time off policy | Life insuranceSenior-level Full TimeDallas, Texas, United States; United States R21h ago
-
AWS | Airflow | Apache Spark | Azure Synapse | Azure Synapse Analytics401k matching | Disability insurance | Employee assistance program | Life insurance | Medical/Dental/Vision insuranceMid-level Full TimeRemote, USA ; Remote, Canada R22h ago
-
Principal Data Engineer/ Technical Lead USD 219K-298KAWS | Access Layer | Aggregation pipelines | Apache Kafka | Apache Spark401k match | Employer paid medical/dental/vision | Flexible spending account | Paid parental leave | Remote first work from homeSenior-level Full TimeUnited States (Remote) R23h ago
-
Senior Software Engineer II - (AI Core Platform) USD 100K-177KAPI Development | API Gateway | AWS | Agile | AlertingMid-level Full TimeRemote, United States R1d ago
-
Senior Software Engineer I - AI/ML USD 145K-190KAPI Development | Agile | Alerting | CI/CD | Data ModelingSenior-level Full TimeRemote, United States R1d ago
-
AI Expert USD 148K-175KAWS | Agile | Batch Processing | Data Mapping | Data ModelingHybrid work | Public Trust Clearance | Remote workSenior-level Full TimeMemphis, TN, United States R1d ago
-
Machine Learning Engineer II USD 160K-210KAirflow | Apache_Spark | Autoscaling | C++ | CI_CDDental insurance | Disability insurance | Flexible vacation | Health insurance | Life insuranceSenior-level Full TimeRemote, USA R1d ago
-
People Analytics AI Engineer USD 146K-221KAPI Integration | AWS | Amazon Redshift | Automation | Data ModelingFlexible working | Health benefits | Parental leave plans | Professional development stipend | Remote ModelSenior-level Full TimeRemote - Seattle R1d ago
-
Senior AI Integration Engineer USD 190K-190KAWS | AgenticAI | AmazonS3 | Bash | BedrockPart-time remote workSenior-level Full TimeNew York, New York, United States R1d ago
-
Sr . SAP Datasphere + Databricks Engineer - Hybrid USD 180K-248KAPI | Access Control | CI/CD | Data Classification | Data GovernanceHybrid workSenior-level ContractDurham or Philadelphia, United States R1d ago
-
VP II AI, Machine Learning|US Remote* USD 202K-210KApache Spark | Azure | Databricks | ETL | Flume401k match | Medical/Dental/Vision insurance | Paid time offMid-level Full TimeRemote, United States R1d ago
-
Data Engineer - AWS/Databricks - Mid Level USD 121K-170KAWS | AWS Glue | Adaptive query execution | Amazon CloudWatch | Amazon RDSComprehensive benefits | Mentorship | Mentorship and personalized development plans | Training budget | Work/life balance focusMid-level Full TimeReston, VA, United States R1d ago
-
Assistant Director, Data Science (STP) USD 139K-226KData Visualization | GLM | Insurance pricing | MLOps | Machine LearningDomestic travel | TelecommutingExecutive-level Full TimeBoston, MA, United States R1d ago
-
Inference Engineer USD 180K-250KCUDA | Continuous batching | Distributed Systems | Generative Models | Machine Learning401k | Commuter allowance | Dental insurance | Flexible PTO | Health insuranceMid-level Full Time*HQ - San Francisco, CA R1d ago
-
Principal Machine Learning Engineer USD 205K-230KAWS Lambda | BigQuery | C# | CI/CD | Cloud Functions401k | Dental insurance | Health insurance | Life insurance | Paid HolidaysSenior-level Full TimeUnited States of America - Remote … R1d ago
-
Generative AI Engineering Intern (Graduate) USD 70K-70KAWS | Agile | Azure OpenAI | Azure OpenAI Service | CI/CDDedicated mentorship | Flexible scheduling | Networking opportunities | Potential full-time employment | Remote friendly schedulingEntry-level Full Time InternshipUnited States R1d ago