AI Agent Data Pipeline Intern
Tasks
- Build data pipelines for experiment context
- Build internal tools dashboards and reports
- Build semantic search and RAG pipelines
- Clean noisy unstructured text data
- Design data schemas and metadata
- Evaluate agent summaries and recommendations
- Extract experiment relevant information
- Implement data quality checks
- Index and retrieve experiment context
- Ingest and organize experiment related data
- Prepare curated datasets for evaluation
- Support LLM fine tuning dataset preparation
Perks/Benefits
- N/A
Skills/Tech-stack
Data cleaning | Data pipeline | ETL | LLM | Language Processing | MLOps | Machine Learning | Natural Language | Natural Language Processing | Prompt engineering | Python | RAG | SQL | Semantic Search | Vector Search
Education
N/A
Regions
Countries
States
Cities
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R7d ago
-
Senior Data Engineer USD 187K-321KAWS | Airflow | Apache Spark | Batch Processing | Data Modeling401k matching | Flexible work schedule | Health and wellness supportSenior-level Full TimeAustin, Texas10h ago
-
Senior Data Engineer USD 148K-361KAirflow | Apache Spark | Data Modeling | Data Quality | HDFS401k | Commuter benefits | Dental insurance | Disability benefits | Equity awardsSenior-level Full TimeSan Jose, California10h ago
-
Data Engineer USD 188K-275KAlerting | DBT | Data Governance | Data Lineage | Data ModelingFlexible work environment | Paid parental leave | Paid time off | Professional development support | Wellness stipendMid-level Full TimeNew York, New York, United States11h ago
-
Research Engineer, Life Sciences USD 350K-500KCloud deployment | Containerization | Data Pipelines | Docker | KubernetesFlexible work policy | Flexible working hours | Optional equity donation matching | Parental leave | Vacation leaveSenior-level Full TimeSan Francisco, CA20h ago
-
A/B | A/B Testing | Apache Airflow | Apache Kafka | Apache SparkSenior-level ContractSanta Monica, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAPI Integration | AWS | Advanced SQL | Advanced SQL optimization | AirflowEducation budget | Fitness budget | Flexible schedule | Mentorship | Personalized growth roadmapsSenior-level Full TimeFort Lauderdale, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAWS | Apache Airflow | Apache Spark | Avro | Columnar StorageFlexible schedule | Mentorship | Personalized growth roadmaps | Remote work options | TechtalksSenior-level Full TimeRichmond, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAPI Integration | AWS | Advanced SQL | Advanced SQL optimization | AirflowEducation budget | Exciting projects | Fitness budget | Flexible schedule | MentorshipSenior-level Full TimeMiami, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAPI Integration | AWS | Apache Airflow | Apache Spark | AvroEducation budget | Fitness budget | Flexible schedule | Mentorship | Office optionSenior-level Full TimeTexas City, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAWS | AWS SageMaker | Apache Airflow | Apache Spark | AvroFlextime | Mentorship | Personalized growth roadmaps | Professional growth | Remote work optionsSenior-level Full TimeBoca Raton, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAPI Integration | AWS | AWS SageMaker | Airflow | Apache SparkEducation budget | Exciting projects | Fitness budget | Flexible schedule with remote and office options | FlextimeSenior-level Full TimeBaltimore, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAWS | Amazon S3 | Apache Airflow | Apache Spark | AvroFlexible schedule | Mentorship | Personalized growth roadmaps | Remote and office options | TechtalksSenior-level Full TimeIrving, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAPIs | AWS | Airflow | Apache Spark | AvroEducation budget | Fitness budget | Flexible schedule | Mentorship | Personalized growth roadmapsSenior-level Full TimeJersey City, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAPI Integration | AWS | Airflow | Avro | Big DataExciting projects | Flexible schedule | Flextime | Mentorship | Office optionsSenior-level Full TimeOrlando, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAPI Integration | AWS | Apache Airflow | Apache Scala | Apache SparkExciting projects | Flexible schedule | Global collaboration | Mentorship | Modern solutionsSenior-level Full TimeHouston, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAPI Integration | AWS | Advanced SQL | Advanced SQL optimization | AirflowEducation budget | Fitness budget | Flexible schedule | Mentorship | Personalized growth roadmapsSenior-level Full TimeTallahassee, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAPI Integration | AWS | Airflow | Apache Spark | AvroEducation budget | Fitness budget | Flexible schedule | Mentorship | Office optionSenior-level Full TimeBlacksburg, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAPIs | AWS | Airflow | Apache Spark | AvroEducation budget | Exciting projects | Fitness budget | Flexible schedule | MentorshipSenior-level Full TimeDallas, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAPI Integration | AWS | Apache Airflow | Apache Spark | AvroEducation budget | Fitness budget | Flextime | Mentorship | Office optionsSenior-level Full TimeWest Palm Beach, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAWS | Apache Airflow | Apache Spark | Avro | Cloud platformFlexible schedule | Mentorship | Personalized growth roadmaps | Remote and office options | TechtalksSenior-level Full TimeAustin, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAWS | Airflow | Avro | Big Data | Columnar StorageFlexible schedule | Mentorship | Office options | Personalized growth roadmaps | Professional growthSenior-level Full TimeLos Angeles, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAWS | AWS SageMaker | Apache Airflow | Apache Spark | AvroEducation budget | Fitness budget | Flextime | Mentorship | Office work optionSenior-level Full TimePort Charlotte, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAWS | Amazon Web Services | Apache Airflow | Apache Spark | AvroFlexible schedule | Mentorship | Office option | Personalized growth roadmaps | Professional growthSenior-level Full TimeAtlanta, United States21h ago
-
Senior Data Engineer ID75059 USD 156K-190KAWS | Apache Airflow | Apache Spark | Avro | BigQueryEducation budget | Fitness budget | Flexible schedule | Mentorship | Office work optionsSenior-level Full TimeJacksonville, United States21h ago