AI Data Engineer
Tasks
- Build high throughput data loading for GPU utilization
- Build ingestion systems for multimodal data
- Construct evaluation datasets with contamination controls
- Design large scale AI data pipelines
- Design storage architectures for cost and performance
- Develop dataset versioning and lineage
- Document data systems schemas and operational procedures
- Drive observability for data quality and pipeline health
- Implement data cleaning and quality assurance
- Implement data privacy redaction and consent enforcement
- Implement labeling workflows and active learning
- Optimize cost and performance with compression and caching
Perks/Benefits
- N/A
Skills/Tech-stack
Apache Beam | Apache Spark | CI/CD | Code review | Data Compression | Data Lineage | Data Modeling | Data Privacy | Data Quality | Data Storage | Data caching | Data loading | Data quality assurance | Dataset versioning | Distributed Systems | GPU Utilization | High Throughput | High Throughput Data Loading | High-throughput data | Machine Learning | Python | Quality Assurance | Ray | Testing
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Roles
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R4d ago
-
APIs | AWS Glue | AWS Redshift | Amazon Web Services | Apache AirflowFully remote | Health insurance | On call production support | Paid time off | Retirement plansSenior-level Full TimeOrlando, FL, United States R8h ago
-
Data Engineer, Engineering & Operations USD 115K-145KAccess Control | Aggregation Thresholds | Airflow | Alerting | Anonymization401k | Dental insurance | Discounts | Fully remote | Medical insuranceMid-level Full TimeNew York, NEW YORK, United States R12h ago
-
Sr. Databricks Consultant USD 175K-250KAWS | Amazon S3 | Apache Spark | Azure | Azure DataRemote work | Travel as neededSenior-level Full TimeWork from home, VA, United States R12h ago
-
Ansible | C plus plus | C# | CMake | CUDAMid-level Full TimeBaltimore, MD, United States R14h ago
-
GCP/Linux Data Engineer (Remote) USD 95K-170KAPIs | Agile | CI/CD | Cloud automation | Cloud platformFully remote | W2 Candidates OnlyMid-level Full TimeRochester, MN R15h ago
-
Hiring: Senior Principal AI Software Engineer – Agentic & Industry Solutions | Full-Time | Remote USD 138K-208KAWS | AWS Bedrock | Agent systems | Anthropic API | Autogen401k matching | Adoption Assistance | Dental insurance | Fertility treatments | Flexible work scheduleSenior-level Contract Full TimeRemote, OR, United States R16h ago
-
A/B | A/B Testing | AI Model Deployment | AI model | App ServiceRemote workMid-level ContractHartford, United States R16h ago
-
Senior Software Engineer, DeepMind USD 221K-253KAlgorithms | Audio Processing | C++ | Cause analysis | Data StructuresBonus | Equity | Hybrid scheduleSenior-level Full TimeMountain View, CA, USA R18h ago
-
Senior Data Engineer USD 90K-110KAgile | Amazon Web Services | Apache NiFi | Data Modeling | Data WarehousingAutonomy | Employee assistance program | Flexible working hours | Inclusive community | Online training videosSenior-level Full TimeNew York, United States R21h ago
-
Senior Software Engineer, Storage USD 217K-303KC++ | Caching | Cassandra | Data Storage | Distributed Systems401k employer match | Dental insurance | Equity compensation | Generous time off | Medical insuranceSenior-level Full TimeRemote - United States R1d ago
-
Data Engineer, Go To Market (Remote) USD 100K-155KAmazon Redshift | Apache Airflow | CI/CD | CRM | DBTSenior-level Full TimeUSA CA Remote, United States R1d ago
-
Senior Data Engineer USD 97K-185KAWS Batch | Agile | Amazon EC2 | Amazon EKS | Amazon RDSEducational assistance | Health care coverage | Paid time off | Parental leave | Retirement planSenior-level Full Time245 Summer St, Boston MA, United … R1d ago
-
Senior Data Engineer USD 91K-163KAI | Automation | Backup and Recovery | Capacity Planning | Change ManagementCareer development | Comprehensive benefits | Telecommute within United StatesSenior-level Full TimePrimary location: Eden Prairie, MN R1d ago
-
Head of AI Transformation & Enablement USD 155K-195KArtificial Intelligence | Automation | Benefit Analysis | Business case | Business case development401k match | Dental insurance | Employee assistance program | Holiday pay | Life insuranceExecutive-level Full TimeUSA-Remote, United States R1d ago
-
Data Governance and Master Data Management Expert USD 160K-220KBusiness glossary | Cause analysis | Data Compliance | Data Controls | Data GovernanceRemote workSenior-level ContractUnited States - Remote R1d ago
-
Databricks Platform Engineer (AWS) USD 115K-213KAWS | AWS Secrets | AWS Secrets Manager | CI/CD | DatabricksDental insurance | Disability insurance | Employee wellness | Health insurance | Life insuranceSenior-level Full TimeUSA VA Home based (CSC Location), … R1d ago
-
Staff Software Engineer, AI/ML USD 136K-265KAutogen | Cloud Platforms | CrewAI | Embeddings | GoContinuous learning culture | Remote work | Vacation packageSenior-level Full TimeRemote - USA, United States R1d ago
-
Machine Learning Engineer USD 196K-196KApache Airflow | CI/CD | Containerization | Docker | Experiment trackingFully remoteSenior-level Full TimeSan Francisco, CA R1d ago
-
AWS Bedrock | Agent systems | Anthropic API | Autogen | Azure401k matching program | Adoption Assistance | Development and career growth opportunities | Fertility treatments | Flexible work schedulesSenior-level Contract Full TimeRemote, OR, United States R1d ago
-
Software Engineer, Applied AI USD 190K-280KContext engineering | Data Processing | Data Storage | Debugging | Docker401k match | Dental insurance | Health insurance | Hybrid work model | Professional developmentSenior-level Full TimeNew York City R1d ago
-
Data Engineer USD 74K-133KAgile | Apache Airflow | BigQuery | Cloud Composer | Cloud Data401k retirement plan | Dental insurance | Disability insurance | Flexible time off | Health insuranceMid-level Full TimeLisle, IL, United States R1d ago
-
Senior Staff Data Engineer - Platform Data and Analytics USD 268K-368KAirflow | Alerting | Apache Spark | Capacity Planning | Cost OptimizationSenior-level Full TimeSan Francisco, CA R1d ago
-
Staff Data Engineer USD 185K-220KAWS | Apache Airflow | Apache Kafka | Benthos | Big DataDental insurance | Disability insurance | Flexible work hours | Health insurance | Health savings accountSenior-level Full TimeRosslyn, VA or Remote R1d ago
-
API Testing | Cypher | Data Quality | DataOps | DevOpsBenefits | Competitive pay | Growth opportunity | Remote work | Travel requiredSenior-level Full TimeReston, VA, United States R1d ago