Data Engineer - Dallas, TX
Tasks
- Build ETL ELT data pipelines for LLM consumption
- Build low latency streaming pipelines
- Create gold datasets and versioned data snapshots for evaluation
- Enrich data with metadata tagging for agent reasoning
- Implement data cleaning for PII and toxicity removal
- Manage vector databases for similarity search
- Optimize data chunking and embeddings for retrieval
Perks/Benefits
- 401k retirement plan
- Dental insurance
- Medical insurance
- Paid Holidays
- Paid time off
- Vision insurance
Skills/Tech-stack
AWS | Apache Airflow | Apache Flink | Azure | BM25 | Cosine similarity | DBT | Dagster | Databricks | ELT | ETL | Elasticsearch | FastAPI | Google Cloud | HNSW | Hybrid search | Kafka | Pandas | Pydantic | Python | Similarity Search | Snowflake | Vector Database
Education
N/A
Roles
Related jobs
-
Senior Platform AI Engineer USD 119K-180KAPI Design | Asynchronous programming | Authentication | Concurrency | Distributed SystemsSenior-level Full TimeCenter, Center District, IL3h ago
-
Senior-level Full TimeCenter, Center District, IL3h ago
-
Senior-level ContractJersey City, United States4h ago
-
AWS Lambda | Amazon DynamoDB | Amazon Kinesis | Amazon SNS | Amazon SQSHybrid workSenior-level ContractSeattle, United States4h ago
-
Lead Software Engineer - Java/Python - Learn AI / LLM USD 175K-215KAgile | Amazon Web Services | Application Resiliency | Artificial Intelligence | CI/CDBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimeNew York, NY, United States4h ago
-
Quant Analytics [Multiple Positions Available] USD 150K-185KAWS Redshift | CTE | Data Aggregation | Data Enrichment | Data TransformationBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site wellness centersSenior-level Full TimePlano, TX, United States5h ago
-
Benchmarking | CUDA | Communication optimization | Data parallelism | Deep learningMid-level Full TimeSeattle, Washington, United States5h ago
-
Machine Learning Engineer USD 130K-194KAI machine learning | AWS AI | AWS AI Machine Learning | Amazon DynamoDB | Amazon EC2Professional development | Work from homeMid-level Full TimeRemote, NY, US R5h ago
-
Software Engineer III - Data, AWS, ETL, Java/Python, USD 173K-185KAPIs | AWS | Agile methodologies | Apache Airflow | Apache FlinkBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimePlano, TX, United States5h ago
-
Algorithms Engineer USD 72K-120KARIMA | Anomaly Detection | Causal Inference | Causal forests | Change point detectionEntry-level Full TimeCenter, Center District, IL5h ago
-
Data parallelism | Deep learning | Distributed Training | GPU Acceleration | Model BenchmarkingMid-level Full TimeSan Jose, California, United States5h ago
-
Partner Engineer, Generative AI USD 173K-247KAWS | Agent Orchestration | Azure | Bias Mitigation | C plus plusSenior-level Full TimeMenlo Park, CA6h ago
-
AI Research Scientist, SysML - FAIR USD 143K-208KArtificial Intelligence | C# | C++ | Co-design | Compiler designMid-level Full TimeMenlo Park, CA | Boston, MA …6h ago
-
Data Engineer, Analytics (Technical Leadership) USD 175K-242KDashboards | Data Architecture | Data Governance | Data Marts | Data ModelingSenior-level Full TimeMenlo Park, CA | New York, …6h ago
-
AI Research Engineer, FAIR Chemistry USD 141K-208KApplied Mathematics | Artificial Intelligence | Computational statistics | Data Science | Density Functional TheorySenior-level Full TimeSan Francisco, CA6h ago
-
IP Validation Engineer - Machine Learning Accelerators USD 142K-203KAHB | APB | AXI | Android | C#Cross-functional collaboration | On device AI work | Prototype and silicon developmentMid-level Full TimeSunnyvale, CA | Burlingame, CA6h ago
-
Mid-level Full TimeMenlo Park, CA6h ago
-
Research Engineer - Perception and Machine Learning USD 177K-251KC++ | Computer Vision | Data Pipelines | Knowledge Distillation | Language ModelsSenior-level Full TimeRedmond, WA | Menlo Park, CA …6h ago
-
Research Engineer - Computer Vision and Robotics USD 141K-208K3D Reconstruction | C plus plus | Computational imaging | Computer Vision | Data AnalysisMid-level Full TimeRedmond, WA6h ago
-
Data Engineer, PAR USD 173K-242KAgent Orchestration | C# | C++ | Data Architecture | Data GovernanceCareer growth | Mentorship | Skill developmentSenior-level Full TimeMenlo Park, CA6h ago
-
Data Scientist, Analytics (Technical Leadership) USD 160K-190KAI Workflow Optimization | AI workflow | Agent Orchestration | Bias Mitigation | Causal InferenceCareer development | World class analytics communitySenior-level Full TimeRemote, US | Bellevue, WA | … R6h ago
-
Senior Software Engineer, AI/ML GenAI, GCP USD 174K-252KAlgorithms | C++ | Cloud Storage | Cloud platform | Computer VisionSenior-level Full TimeSeattle, WA, USA6h ago
-
Senior Staff Research Engineer, DeepMind USD 262K-365KAlgorithms | Artificial Intelligence | Benchmarking | Data Analysis | Data StructuresSenior-level Full TimeMountain View, CA, USA6h ago
-
Agentic Engineer, AI/ML USD 174K-252KC++ | Code review | Data Processing | Debugging | Generative AIHealth insurance | Paid time off | Parental leave | Retirement plan | Workplace flexibilityMid-level Full TimeMountain View, CA, USA6h ago
-
Senior Software Engineer, BigQuery AI/ML USD 174K-252KArtificial Intelligence | BigQuery | C++ | Data Warehousing | Data analyticsSenior-level Full TimeKirkland, WA, USA6h ago