Data Engineer - WkStrm3 - Syslog-NG to Iceberg
Tasks
- Align log schemas to Iceberg table definitions
- Build Confluent Kafka to Iceberg ingestion using supported APIs
- Collaborate on Iceberg catalog configuration and partition strategies
- Configure syslog ng to write JSON logs to rolling files
- Design streaming ingestion pipelines
- Handle parsing errors with dead letter queue
- Implement syslog ng to Iceberg batch ingestion pipelines
- Ingest data into Apache Iceberg tables
- Parse JSON logs with Spark file source streaming
- Present pipeline results in stakeholder demo
- Recover from checkpoint corruption
- Review AI generated PySpark code for security and performance
- Set up alerting on parse failures
- Write unit tests for field mapping and ingestion idempotency
Perks/Benefits
- N/A
Skills/Tech-stack
Apache Iceberg | Apache Kafka | Apache Spark | Azure Blob | Azure Blob Storage | Blob Storage | Dead Letter Queue | GitHub Copilot | HDFS | JSON | Linux | PII Tokenization | Protegrity | PySpark | RFC5424 | S3 | Structured Streaming | Syslog-ng | Unit Testing
Education
N/A
Related jobs
-
AWS | AWS Glue | AWS Lambda | Amazon Athena | Amazon EMRAI certifications | Ethical AI focus | Mentorship | World-class trainingSenior-level Full TimeIndia-Hyderabad8h ago
-
Mid-level Full TimePune, Maharashtra, India10h ago
-
Senior-level Full TimeBangalore, Karnataka, India11h ago
-
Senior-level Full TimeBangalore, Karnataka, India11h ago
-
AWS | AWS Lambda | Agile | Apache Spark | Data ModelingSenior-level Full TimeBengaluru, Karnataka, India11h ago
-
Software Engineer (6months to 2 Years of experience - JAVA/Python Backend, CICD, APIs, GEN AI, Microservices, Cloud) INR 2000K-2156KAgile | Algorithms | Apache Kafka | Automated testing | C++Continued learning opportunities | Hybrid work arrangement | MentorshipEntry-level Full TimeBengaluru, INDIA, India13h ago
-
Senior-level Full TimeBengaluru, INDIA, India14h ago
-
Software Engineer III - Java, AWS INR 1800K-2200KAWS | Apache Iceberg | Apache Spark | Authentication | AuthorizationSenior-level Full TimeBengaluru, Karnataka, India14h ago
-
Entry-level Full TimeBangalore15h ago
-
Senior Automation Engineer INR 2156K-2647KALB | API Gateway | AWS Lambda | Agile | Amazon CloudWatchSenior-level Full TimeHyderabad, India15h ago
-
Sr Data Engineer INR 3200K-4600KAWS | Apache Airflow | Apache Spark | Data Modeling | Google CloudCommuter benefits | Disability coverage | Financial wellness support | Healthcare | Life insuranceSenior-level Full TimeBengaluru, India15h ago
-
Senior-level Full Timebengaluru, India16h ago
-
Principal Engineer – Data Platform INR 2000K-2000KAWS | Access Control | Apache Flink | Apache Iceberg | Apache KafkaContinuous learning | Culture first work environment | High-trust work environment | Unlimited vacation policySenior-level Full TimeBengaluru16h ago
-
Systems Integration Specialist Advisor INR 2516K-3356KAWS | Azure | Azure Data | Azure Data Factory | Azure Data LakeSenior-level Full TimeBangalore, KA, IN21h ago
-
Data Engineer (PySpark) INR 1500K-2500KAI Assisted Development | Data Mapping | Data Modeling | Data Warehousing | Data profilingMid-level Full TimeBengaluru, Karnataka, India21h ago
-
Data Engineer - Real-Time Streaming INR 1400K-1500KApache Flink | Apache Kafka | CI/CD | Cloudera | Distributed SystemsMid-level Full TimeChennai, Tamil Nadu, India21h ago
-
Data Engineer (PySpark) INR 1500K-2000KData Compliance | Data Governance | Data Mapping | Data Pipeline Architecture | Data ProcessingMid-level Full TimeBengaluru, Karnataka, India21h ago
-
Data Engineer - Real Time Streaming INR 1800K-2500KApache Flink | Apache Kafka | CI/CD | Cloudera | Design PatternsMid-level Full TimeBengaluru, Karnataka, India21h ago
-
Senior Data Engineer INR 2520K-3380KAPI Authentication | Continuous integration | DevOps | Docker | DruidSenior-level Full TimeChennai, TN, IN21h ago
-
Data Engineer (PySpark/Informatica BDM) INR 500K-575KAWS | Azure | CI/CD | Data Governance | Data MappingSenior-level Full TimeBengaluru, Karnataka, India21h ago
-
Technology Engineer - Kafka & Streaming Platform INR 2000K-2245KAWS | Apache Kafka | Argo CD | Azure | CI/CDSenior-level Full TimeChennai, Tamil Nadu, India21h ago
-
Technology Engineer - Redis & Streaming Platforms INR 1837K-2000KAWS | Ansible | Apache Kafka | ArgoCD | AzureSenior-level Full TimeChennai, Tamil Nadu, India21h ago
-
Senior-level Full TimeBangalore, India21h ago
-
Machine Learning Engineer - 4 INR 2500K-4500KApache Spark | CI/CD | Collaborative Filtering | Continuous Improvement | Deep learningSenior-level Full TimeNoida, India21h ago
-
Mid-level Full TimeIndia - Bangalore - 5th floor, …21h ago