Data Lead- Dallas, TX
Tasks
- Automate data cleaning for AI datasets
- Build ingestion to insight data pipeline
- Build real-time streaming pipelines
- Design ETL/ELT pipelines
- Design RAG architectures
- Enrich data with metadata tagging
- Implement chunking and embedding strategies
- Manage vector databases
- Optimize similarity search
- Remove PII and toxicity
- Set up evaluation datasets and versioned snapshots
Perks/Benefits
Skills/Tech-stack
AWS | Airflow | Azure | BM25 | Cosine similarity | DBT | Dagster | Databricks | Elasticsearch | Embedding Models | FastAPI | Flink | GCP | HNSW | Hybrid search | Kafka | Metadata Engineering | Pandas | Pydantic | Python | RAG | Similarity Search | Snowflake | Vector Database | Vector similarity | Vector similarity search
Education
N/A
Roles
Data Engineer | Engineer | Lead | Lead Data | Lead Data Engineer
Related jobs
-
Senior Platform AI Engineer USD 119K-180KAPI Design | Asynchronous programming | Authentication | Concurrency | Distributed SystemsSenior-level Full TimeCenter, Center District, IL3h ago
-
Senior-level Full TimeCenter, Center District, IL3h ago
-
Senior-level ContractJersey City, United States4h ago
-
AWS Lambda | Amazon DynamoDB | Amazon Kinesis | Amazon SNS | Amazon SQSHybrid workSenior-level ContractSeattle, United States4h ago
-
Lead Software Engineer - Java/Python - Learn AI / LLM USD 175K-215KAgile | Amazon Web Services | Application Resiliency | Artificial Intelligence | CI/CDBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimeNew York, NY, United States4h ago
-
Quant Analytics [Multiple Positions Available] USD 150K-185KAWS Redshift | CTE | Data Aggregation | Data Enrichment | Data TransformationBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site wellness centersSenior-level Full TimePlano, TX, United States5h ago
-
Benchmarking | CUDA | Communication optimization | Data parallelism | Deep learningMid-level Full TimeSeattle, Washington, United States5h ago
-
Machine Learning Engineer USD 130K-194KAI machine learning | AWS AI | AWS AI Machine Learning | Amazon DynamoDB | Amazon EC2Professional development | Work from homeMid-level Full TimeRemote, NY, US R5h ago
-
Software Engineer III - Data, AWS, ETL, Java/Python, USD 173K-185KAPIs | AWS | Agile methodologies | Apache Airflow | Apache FlinkBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimePlano, TX, United States5h ago
-
Algorithms Engineer USD 72K-120KARIMA | Anomaly Detection | Causal Inference | Causal forests | Change point detectionEntry-level Full TimeCenter, Center District, IL5h ago
-
Data parallelism | Deep learning | Distributed Training | GPU Acceleration | Model BenchmarkingMid-level Full TimeSan Jose, California, United States6h ago
-
Partner Engineer, Generative AI USD 173K-247KAWS | Agent Orchestration | Azure | Bias Mitigation | C plus plusSenior-level Full TimeMenlo Park, CA6h ago
-
AI Research Scientist, SysML - FAIR USD 143K-208KArtificial Intelligence | C# | C++ | Co-design | Compiler designMid-level Full TimeMenlo Park, CA | Boston, MA …6h ago
-
Data Engineer, Analytics (Technical Leadership) USD 175K-242KDashboards | Data Architecture | Data Governance | Data Marts | Data ModelingSenior-level Full TimeMenlo Park, CA | New York, …6h ago
-
AI Research Engineer, FAIR Chemistry USD 141K-208KApplied Mathematics | Artificial Intelligence | Computational statistics | Data Science | Density Functional TheorySenior-level Full TimeSan Francisco, CA6h ago
-
IP Validation Engineer - Machine Learning Accelerators USD 142K-203KAHB | APB | AXI | Android | C#Cross-functional collaboration | On device AI work | Prototype and silicon developmentMid-level Full TimeSunnyvale, CA | Burlingame, CA6h ago
-
QA Engineering Lead, AI Native USD 125K-150KAdversarial Testing | Automation frameworks | Black box testing | Black-box | Box testingSenior-level Full TimeMenlo Park, CA6h ago
-
Mid-level Full TimeMenlo Park, CA6h ago
-
AI ethics | Agent Orchestration | Bias Mitigation | Capacity Planning | Data StorytellingSenior-level Full TimeBellevue, WA | Menlo Park, CA6h ago
-
Research Engineer - Perception and Machine Learning USD 177K-251KC++ | Computer Vision | Data Pipelines | Knowledge Distillation | Language ModelsSenior-level Full TimeRedmond, WA | Menlo Park, CA …6h ago
-
Research Engineer - Computer Vision and Robotics USD 141K-208K3D Reconstruction | C plus plus | Computational imaging | Computer Vision | Data AnalysisMid-level Full TimeRedmond, WA6h ago
-
Data Engineer, PAR USD 173K-242KAgent Orchestration | C# | C++ | Data Architecture | Data GovernanceCareer growth | Mentorship | Skill developmentSenior-level Full TimeMenlo Park, CA6h ago
-
Data Scientist, Analytics (Technical Leadership) USD 160K-190KAI Workflow Optimization | AI workflow | Agent Orchestration | Bias Mitigation | Causal InferenceCareer development | World class analytics communitySenior-level Full TimeRemote, US | Bellevue, WA | … R6h ago
-
Senior Software Engineer, AI/ML GenAI, GCP USD 174K-252KAlgorithms | C++ | Cloud Storage | Cloud platform | Computer VisionSenior-level Full TimeSeattle, WA, USA6h ago
-
Senior Staff Research Engineer, DeepMind USD 262K-365KAlgorithms | Artificial Intelligence | Benchmarking | Data Analysis | Data StructuresSenior-level Full TimeMountain View, CA, USA6h ago