Principal Engineer - Data Ingestion & AI Pipeline
USD 122K-173K (estimate) Senior-level Full Time
Tasks
- Build scalable ingestion patterns for enterprise sources
- Create reusable patterns reference architectures standards and guardrails
- Define data quality checks for completeness accuracy duplication stale content unsupported formats sensitive information
- Define engineering approach for program portfolio solutions
- Define technology tool stack within approved technologies
- Define transformation patterns for parsing text extraction normalization deduplication enrichment chunking classification metadata generation
- Design enterprise data ingestion pipelines
- Design incremental ingestion change detection delta processing and reprocessing
- Ensure ingestion compliance with security privacy retention and regulatory requirements
- Establish end to end test strategy and integration between teams
- Establish ingestion standards for lineage freshness versioning access control auditability
- Evaluate and implement OCR document intelligence parsing entity extraction classification and data quality
- Implement pipeline orchestration monitoring retry logic error handling dashboards
- Lead planning definition and design for complex features across teams
- Mentor senior engineers and influence technical direction
- Optimize ingested data for embedding indexing and retrieval for RAG
- Own architecture decisions impacting multiple teams systems or domains
- Provide technical leadership across data engineering AI platform cloud and application teams
- Provide technical oversight through design reviews and code within domain
- Structure and enrich data for effective LLM reasoning
Perks/Benefits
Skills/Tech-stack
AI Document Intelligence | Access Control | Apache Airflow | Apache Kafka | Apache Spark | Azure AI | Azure AI Document Intelligence | Azure Data | Azure Data Factory | Azure Event | Azure Event Hubs | Azure Fabric | Azure Synapse | Azure Synapse Analytics | Batch Processing | Change detection | Content Classification | DBT | DLP | Data Factory | Data Governance | Data Lineage | Data Quality | Data Security | Data freshness | Databricks | Delta Processing | Document Intelligence | Document parsing | ELT | ETL | Embeddings | Entity Extraction | Error Handling | Event Hubs | Incremental processing | LLM | OCR | PII Detection | Privacy Compliance | Python | RAG | Retry logic | SQL | Schema inference | Semantic Search | Streaming | Synapse Analytics | Text extraction | Vector Databases | Vector Search
Education
Bachelor of Engineering | Bachelor of Engineering in Computer Science | Bachelor of Science | Bachelor of Science in Computer Science
Roles
AI | AI Engineer | Data Engineering | Data Engineering Manager | Engineer | Engineering Manager | Manager | Principal | Principal Engineer
Regions
Countries
States
Related jobs
-
AWS Bedrock | Agent systems | Anthropic API | Autogen | Azure401k matching program | Adoption Assistance | Development and career growth opportunities | Fertility treatments | Flexible work schedulesSenior-level Contract Full TimeRemote, OR, United States R11h ago
-
Senior Data Engineer USD 90K-110KAWS | Agile | Apache NiFi | Data Architecture | Data ModelingAutonomy | Flexible working hours | Global employee assistance programme | Online training videos | Teambuilding eventsSenior-level Full TimeNew York, United States11h ago
-
Data Engineer USD 74K-133KAgile | Apache Airflow | BigQuery | Cloud Composer | Cloud Data401k retirement plan | Dental insurance | Disability insurance | Flexible time off | Health insuranceMid-level Full TimeLisle, IL, United States R12h ago
-
API Testing | Cypher | Data Quality | DataOps | DevOpsBenefits | Competitive pay | Growth opportunity | Remote work | Travel requiredSenior-level Full TimeReston, VA, United States R14h ago
-
Sr. Manager, Digital & AI Success USD 215K-338KArtificial Intelligence | Business Requirements | Cloud Computing | Customer Analytics | Data WorkflowEmployee stock purchase plan | Health insurance | Leave options | Life insurance | Paid time offSenior-level Full TimeAtlanta, GA Office (ATLANTA)15h ago
-
Principal AI/ML Scientist USD 150K-207KAWS | AWS GovCloud | Artificial Intelligence | Azure | Azure AIPublic trust suitabilitySenior-level Full TimeARLINGTON, VA, United States15h ago
-
Principal AI Program Manager USD 145K-193KAI Governance | Artificial Intelligence | Change Management | Data Preparation | Enterprise Architecture401k matching | Paid sick leave | Paid vacation time | Tuition reimbursementSenior-level Full TimeSan Jose, CA, United States15h ago
-
Principal Engineer - Data Platform USD 221K-387KAWS | Airflow | Apache Hive | Apache Iceberg | Apache ImpalaRemote workSenior-level Full TimeSanta Clara, California, United States R15h ago
-
Agile | Automated testing | CI/CD | Cloud Computing | CrewAIDental insurance | Health insurance | Vision insuranceMid-level Full TimeAshburn, VA, United States17h ago
-
AI Machine Learning Skill 2-FFPP-8904 USD 78K-250KC# | Data Governance | Data Modeling | Data pipeline | Java401k plan with company match | Dental insurance | Diverse inclusive workplace | Employee referral programs | Flexible spending accountsMid-level Full TimeHanover, MD18h ago
-
AWS | AWS SageMaker | Azure | Cloud Pak for Data | Cloud infrastructureAccess to national security mission work | Hybrid work | Travel opportunitiesSenior-level Full TimeUSA-VA-Herndon18h ago
-
AI-assisted software development | AWS | Agentic AI | Azure | Cloud ComputingSenior-level Full TimeUSA-VA-Herndon18h ago
-
Analytics Engineer USD 115K-150KAgile | Azure DevOps | CI/CD | DBT | Data GovernanceAdoption Assistance | Dental insurance | Disability insurance | Educational assistance | Flexible spending accountMid-level Full TimeHouston, Texas | Tulsa, Oklahoma | …19h ago
-
AI Engineer USD 180KAgent Orchestration | Cost Management | Data Pipelines | Distributed Systems | LLM401k | Commuter benefits | Dental insurance | Flexible spending | Health insuranceMid-level Full TimeNew York, New York, United States …19h ago
-
Data & Analytics Specialist USD 87K-135KAPI Integration | Alteryx | DAX | JavaScript | Power AppsAdoption Assistance | Educational assistance | Flexible spending account | Health savings account | Life insuranceMid-level Full TimeWichita, Kansas19h ago
-
Data Platform & Engineering Specialist USD 100K-130KAWS | Amazon Kinesis | Azure | Azure Event | Azure Event HubsDental insurance | Educational assistance | Flexible spending accounts | Health insurance | Health savings accountsMid-level Full TimeLincoln, Nebraska19h ago
-
Machine Learning Leader - Optical Solutions USD 180K-300KAnomaly Detection | Data analytics | Image Processing | Java | Machine LearningAdoption Assistance | Disability insurance | Educational assistance | Flexible spending account | Health savings accountSenior-level Full TimeFremont, California19h ago
-
Process and Analytics Engineer USD 105K-140KAgile | Anomaly Detection | Asset Framework | HYSYS | HYSYS OnlineDental insurance | Disability insurance | Educational assistance | Flexible spending account | Health insuranceMid-level Full TimeWichita, Kansas19h ago
-
AI Architect USD 134K-237KAI Search | AI Security | API Gateway | API Integration | AWS BedrockAdoption Assistance | Dental insurance | Disability insurance | Educational assistance | Flexible spending accountsSenior-level Full TimeHouston, Texas | Tulsa, Oklahoma | …19h ago
-
Director, AI Product Manager USD 170K-220KAI FinOps | AI Governance | AIOps | Amazon SageMaker | Apache SparkAdoption Assistance | Disability insurance | Educational assistance | Flexible spending account | Health benefits (medical, dental, vision)Executive-level Full TimeLisle, Illinois19h ago
-
CV/NLP/Multimodal LLM Machine Learning Engineer Graduate (TikTok-Trust and Safety) - 2026 Start (PhD) USD 136K-246KActive Learning | Computer Vision | Content Classification | Data-Driven Strategy | Data-drivenEntry-level Full TimeSeattle, Washington, United States20h ago
-
Senior Finance Data Engineer / Data Analyst USD 100K-120KDAX | Dashboard Development | Data Modeling | Data Standardization | Data TransformationSenior-level Full TimeAuburn Hills, MI, United States21h ago
-
Software Engineer III, Generative AI USD 147K-211KComputer Vision | Data Processing | Debugging | Language Models | Language ProcessingSenior-level Full TimeKirkland, WA, USA21h ago
-
Staff Software Engineer, AI/ML, YouTube Ads USD 207K-301KA/B | A/B Testing | B testing | Data Structures | Data structures algorithmsSenior-level Full TimeMountain View, CA, USA21h ago
-
Machine Learning Engineer USD 120K-140KAI Pipelines | AI Workbench | AI endpoints | Apache Kafka | Automated testingEntry-level Full TimeDenver, Colorado, United States23h ago