Data Engineer - Generative AI & Vector Systems
Chennai, Tamil Nadu, India
INR 2500K-5000K (estimate) Senior-level Full Time
Tasks
- Automate workflow orchestration using Apache Airflow
- Build RAG pipelines
- Build batch and near real time ingestion pipelines
- Build pipelines to clean transform chunk enrich and load data into vector databases
- Design and manage vector database architectures
- Design efficient data models for AI workloads
- Design scalable data ingestion pipelines for AI and ML
- Develop cloud-native solutions on AWS
- Develop production grade ETL and ELT pipelines
- Ensure data integrity and governance
- Generate embeddings from structured and unstructured data
- Implement monitoring, logging, and alerting
- Improve similarity search performance and reduce retrieval latency
- Manage embedding generation and vector indexing
- Monitor pipeline performance and ensure high availability
- Optimize data quality for semantic search and retrieval
- Optimize indexing storage metadata management and retrieval performance
- Troubleshoot production data issues
- Use Starburst or Trino to federate and query multiple data platforms
- Write complex SQL queries across distributed data sources
Perks/Benefits
Skills/Tech-stack
AWS Glue | AWS IAM | AWS Lambda | Amazon Athena | Amazon CloudWatch | Amazon EMR | Amazon S3 | Apache Airflow | Chroma | Data Cleansing | Data Modeling | Data Transformation | ELT | ETL | Embeddings | Faiss | JSON | Metadata Management | Milvus | Pinecone | Prompt engineering | Python | Qdrant | REST API | Retrieval-Augmented Generation | SQL | Semantic Search | Starburst | Trino | Vector Databases | Vector Search | Weaviate
Education
N/A
Roles
Related jobs
-
Database Engineer INR 500K-1500KCI/CD | DB2 | Data Warehousing | Database performance | Database performance tuningMid-level Full TimePune, MH, India4h ago
-
Principal Engineer, Data Analytics Engineering [ 8+ years ] INR 2500K-3584KAWS | Algorithms | Azure | Big Data | CI/CDSenior-level Full TimeBengaluru, KA, India5h ago
-
Lead Software Engineer - Java and AI/ML INR 3500K-5000KAI Agents | Agile | Automated testing | CI/CD | Code modernizationSenior-level Full TimeMumbai, Maharashtra, India7h ago
-
Databricks Developer INR 1500K-2400KACID | Batch Processing | CI/CD | Cluster management | Data IngestionWork from office 5 days per weekMid-level Full TimeGurugram, Haryana, India8h ago
-
Principal Data Scientist INR 3000K-4000KContainers | Continuous integration | Decoder Architecture | Deep learning | Design PatternsCar lease | Certification programs | Continuous learning programs | Corporate pension plan | Dental coverageSenior-level Full TimeBangalore, Karnataka, India9h ago
-
Lead Assistant Manager - Azure Data Engineer INR 1500K-2500KAgile | Amazon Redshift | Apache Airflow | Apache Hadoop | Apache HiveSenior-level Full TimeNoida, Uttar Pradesh, India9h ago
-
Executive - Data Governance INR 600K-840KCI/CD | Data Cleansing | Data Denormalization | Data Modeling | Data NormalizationExecutive-level Full TimeBangalore, Karnataka, India9h ago
-
Sr Data Engineer, Python + Spark (Data Federation skillset - Data Lakehouse - Eg: Starburst) - Chennai INR 1500K-2500KAWS EMR | AWS Glue | AWS S3 | Apache Hudi | Apache Iceberg401k retirement plan | Dental insurance | Medical insurance | Paid Holidays | Paid time offSenior-level Full TimeIndia9h ago
-
Lead Software Engineer - Machine Learning INR 3000K-5000KAPI Microservices | Agent systems | Artificial Intelligence | Asynchronous Workflows | Distributed SystemsSenior-level Full TimeBengaluru, India10h ago
-
Assistant Manager- Python Gen AI Engineer INR 1200K-2500KAI Agents | API Development | Agile | Containerization | Data PipelinesWork from client office 5 days per weekMid-level Full TimeBangalore, Karnataka, India10h ago
-
Cloud DevOps - Consultant INR 1500K-3000KAWS | Apache Airflow | Apache Kafka | Apache Spark | AzureMid-level Full TimeGurgaon, Haryana, India10h ago
-
Manager - Azure Data Engineer INR 1500K-2500KAgile | Amazon Redshift | Apache Airflow | Apache Hadoop | Apache HiveMid-level Full TimeNoida, Uttar Pradesh, India11h ago
-
Power BI + Python INR 500K-1500KAzure Data | Azure Data Factory | Azure SQL | Azure Synapse | CI/CDMid-level Full TimeTelangana, India11h ago
-
AI Expert – Generative AI Solutions Developer INR 2000K-5000KAI Foundry | AI Search | AI Studio | AWS Bedrock | Amazon SageMakerSenior-level Full Time Part Timecoimbatore, India12h ago
-
Senior AI Engineer INR 3700K-5000KAPIs | Agentic AI | Azure | CI/CD | Cloud ComputingCollaborative workspaces | Employee resource groups | Flexible working arrangements | Global orientation program | Learning and developmentSenior-level Full TimeMumbai, MH, India12h ago
-
Senior-level Full TimeMumbai, India12h ago
-
Principal Data Engineer INR 2520K-3880KApp Service | Azure | Azure App | Azure App Service | Azure CognitiveSenior-level Full TimeBengaluru Luxor North Tower, India16h ago
-
Data Governance - Technical Specialist INR 3000K-4000KAWS | Azure | Cloud infrastructure | Data Governance | Data LineageHealthcare | Paid volunteering days | Retirement planning | Wellbeing initiativesSenior-level Full TimeIND-Bangalore-A, RMZ Infinity, India16h ago
-
Applications Development (MERN Stack + GenAI) INR 1400K-2525KAgile | JavaScript | Kubernetes | LLMs | MERN StackComprehensive benefits | Holidays | Hybrid work | Professional development | Shared TransportSenior-level Full TimeNoida - Sector 135, India16h ago
-
Senior Python Backend Developer / ML Engineer (IR-534) INR 2500K-5000KA/B | A/B Testing | Abstract Base Class | Async/Await | Asynchronous programmingCareer path | Employment Cooperation | Expert knowledge sharing | Flexible hours | InsuranceSenior-level Full TimeBengaluru, Karnataka, India16h ago
-
Senior Software Engineer (ML, NLP) INR 2500K-4000KAPI Development | AWS | AWS Lambda | Amazon EKS | Amazon SageMakerFlexible work arrangement | Hybrid work environmentSenior-level Full TimeGurgaon - Cyber Park, India17h ago
-
Lead Software Engineer- LLM, Gen AI, ML INR 3000K-4800KAWS | Anthropic | BERT | Cohere | Deep learningFlexible-hybrid work | Performance-based recognition | Training and developmentSenior-level Full TimeChennai - DLF IT Park, India17h ago
-
Principal Engineer, Data Analytics Engineering INR 2500K-4000KAgile | Batch Processing | CI/CD | GitOps | HadoopSenior-level Full TimeBengaluru, KA, India18h ago
-
Senior-level Full TimeIndia-Contractors1d ago
-
Business Intelligence Engineer - Power BI INR 700K-2000KBridge tables | Context Transition | DAX | Data Governance | Data ModelingFlexible work arrangements | Health insurance | Hybrid work model | Life insurance | Paid time offMid-level Full TimeHyderabad, India R1d ago