Data Engineer - Dallas, TX
Tasks
- Build ETL ELT data pipelines for LLM consumption
- Build low latency streaming pipelines
- Create gold datasets and versioned data snapshots for evaluation
- Enrich data with metadata tagging for agent reasoning
- Implement data cleaning for PII and toxicity removal
- Manage vector databases for similarity search
- Optimize data chunking and embeddings for retrieval
Perks/Benefits
- 401k retirement plan
- Dental insurance
- Medical insurance
- Paid Holidays
- Paid time off
- Vision insurance
Skills/Tech-stack
AWS | Apache Airflow | Apache Flink | Azure | BM25 | Cosine similarity | DBT | Dagster | Databricks | ELT | ETL | Elasticsearch | FastAPI | Google Cloud | HNSW | Hybrid search | Kafka | Pandas | Pydantic | Python | Similarity Search | Snowflake | Vector Database
Education
N/A
Roles
Related jobs
-
Senior-level ContractJersey City, United States3h ago
-
AWS Lambda | Amazon DynamoDB | Amazon Kinesis | Amazon SNS | Amazon SQSHybrid workSenior-level ContractSeattle, United States3h ago
-
Lead Software Engineer - Java/Python - Learn AI / LLM USD 175K-215KAgile | Amazon Web Services | Application Resiliency | Artificial Intelligence | CI/CDBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimeNew York, NY, United States3h ago
-
Quant Analytics [Multiple Positions Available] USD 150K-185KAWS Redshift | CTE | Data Aggregation | Data Enrichment | Data TransformationBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site wellness centersSenior-level Full TimePlano, TX, United States3h ago
-
Benchmarking | CUDA | Communication optimization | Data parallelism | Deep learningMid-level Full TimeSeattle, Washington, United States4h ago
-
Machine Learning Engineer USD 130K-194KAI machine learning | AWS AI | AWS AI Machine Learning | Amazon DynamoDB | Amazon EC2Professional development | Work from homeMid-level Full TimeRemote, NY, US R4h ago
-
Software Engineer III - Data, AWS, ETL, Java/Python, USD 173K-185KAPIs | AWS | Agile methodologies | Apache Airflow | Apache FlinkBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimePlano, TX, United States4h ago
-
Data parallelism | Deep learning | Distributed Training | GPU Acceleration | Model BenchmarkingMid-level Full TimeSan Jose, California, United States4h ago
-
Partner Engineer, Generative AI USD 173K-247KAWS | Agent Orchestration | Azure | Bias Mitigation | C plus plusSenior-level Full TimeMenlo Park, CA5h ago
-
AI Research Scientist, SysML - FAIR USD 143K-208KArtificial Intelligence | C# | C++ | Co-design | Compiler designMid-level Full TimeMenlo Park, CA | Boston, MA …5h ago
-
Data Engineer, Analytics (Technical Leadership) USD 175K-242KDashboards | Data Architecture | Data Governance | Data Marts | Data ModelingSenior-level Full TimeMenlo Park, CA | New York, …5h ago
-
AI Research Engineer, FAIR Chemistry USD 141K-208KApplied Mathematics | Artificial Intelligence | Computational statistics | Data Science | Density Functional TheorySenior-level Full TimeSan Francisco, CA5h ago
-
IP Validation Engineer - Machine Learning Accelerators USD 142K-203KAHB | APB | AXI | Android | C#Cross-functional collaboration | On device AI work | Prototype and silicon developmentMid-level Full TimeSunnyvale, CA | Burlingame, CA5h ago
-
Mid-level Full TimeMenlo Park, CA5h ago
-
Research Engineer - Perception and Machine Learning USD 177K-251KC++ | Computer Vision | Data Pipelines | Knowledge Distillation | Language ModelsSenior-level Full TimeRedmond, WA | Menlo Park, CA …5h ago
-
Research Engineer - Computer Vision and Robotics USD 141K-208K3D Reconstruction | C plus plus | Computational imaging | Computer Vision | Data AnalysisMid-level Full TimeRedmond, WA5h ago
-
Data Engineer, PAR USD 173K-242KAgent Orchestration | C# | C++ | Data Architecture | Data GovernanceCareer growth | Mentorship | Skill developmentSenior-level Full TimeMenlo Park, CA5h ago
-
Data Scientist, Analytics (Technical Leadership) USD 160K-190KAI Workflow Optimization | AI workflow | Agent Orchestration | Bias Mitigation | Causal InferenceCareer development | World class analytics communitySenior-level Full TimeRemote, US | Bellevue, WA | … R5h ago
-
Senior Software Engineer, AI/ML GenAI, GCP USD 174K-252KAlgorithms | C++ | Cloud Storage | Cloud platform | Computer VisionSenior-level Full TimeSeattle, WA, USA5h ago
-
Senior Staff Research Engineer, DeepMind USD 262K-365KAlgorithms | Artificial Intelligence | Benchmarking | Data Analysis | Data StructuresSenior-level Full TimeMountain View, CA, USA5h ago
-
Agentic Engineer, AI/ML USD 174K-252KC++ | Code review | Data Processing | Debugging | Generative AIHealth insurance | Paid time off | Parental leave | Retirement plan | Workplace flexibilityMid-level Full TimeMountain View, CA, USA5h ago
-
Senior Software Engineer, BigQuery AI/ML USD 174K-252KArtificial Intelligence | BigQuery | C++ | Data Warehousing | Data analyticsSenior-level Full TimeKirkland, WA, USA5h ago
-
Senior Strategist, Kids and Learning Trust and Safety USD 132K-189KAutomation | Classification | Data Analysis | Data sets | DebuggingSenior-level Full TimeSeattle, WA, USA; Austin, TX, USA5h ago
-
Machine Learning Engineer USD 110K-150KAirflow | Bayesian Methods | CI/CD | Causal Inference | DBT401k matching | Disability coverage | Health, dental, vision insurance | Paid Holidays | Paid parental leaveMid-level Full TimeNew York, NY, US9h ago
-
Analytics Engineer USD 100K-132KAWS | Amazon Redshift | Apache Spark | Azure | Cloud Computing401k match | Collaborative work environment | Dental insurance | Health insurance | Home Office PerksMid-level Full TimeBethesda, United States13h ago