Data Engineer - WkStrm3 - Syslog-NG to Iceberg
Tasks
- Align schemas between syslog ng output and Iceberg tables
- Build PySpark Structured Streaming jobs
- Collaborate on Iceberg catalog configuration and partitioning
- Configure syslog ng JSON logging to rolling files
- Design streaming ingestion pipelines
- Implement error handling with dead letter queues
- Ingest data into Apache Iceberg tables
- Present streaming pipeline results in stakeholder demos
- Read and parse log files incrementally into Iceberg
- Recover from checkpoint corruption
- Review and refine PySpark pipelines for standards security and performance
- Scaffold iterate test and document streaming code
- Set up alerting for parse failures
Perks/Benefits
- N/A
Skills/Tech-stack
Amazon S3 | Apache Iceberg | Apache Kafka | Azure Blob | Azure Blob Storage | Blob Storage | Confluent Kafka | Dead Letter Queue | GitHub Copilot | HDFS | JSON | PII Tokenization | Protegrity | PySpark | RFC5424 | Spark File Source | Structured Streaming | Syslog-ng | Unit Testing
Education
N/A
Related jobs
-
Apache Airflow | Apache Hadoop | Apache Kafka | Apache Spark | App ServiceSenior-level Full TimeKolkata DN 57, India-1d ago
-
API Integration | Agile | Apache Airflow | Apache Hadoop | Apache KafkaFlexibility programs | Inclusive benefits | Mentorship | Wellbeing supportSenior-level Full TimeKolkata DN 57, India-1d ago
-
API ingestion | Agile | Apache Airflow | Apache Hadoop | Apache KafkaFlexibility programmes | Inclusive benefits | Mentorship | Wellbeing supportSenior-level Full TimeKolkata DN 57, India-1d ago
-
Ab Initio | Apache Airflow | BigQuery | Cloud Storage | Cloud platformFlexible work arrangements | Inclusive workplace culture | MentorshipSenior-level Full TimeBengaluru Millenia, India-1d ago
-
Ab Initio | BigQuery | Cloud Composer | Cloud Storage | Cloud platformFlexible work programs | Inclusive workplace culture | Mentorship | Wellbeing supportSenior-level Full TimeBengaluru Millenia, India-1d ago
-
Ab Initio | Apache Airflow | BigQuery | Cloud Storage | Cloud platformFlexible work arrangements | Inclusive benefits | Mentorship | Wellbeing supportSenior-level Full TimeBengaluru Millenia, India-1d ago
-
Ab Initio | Apache Dataflow | BigQuery | Cloud Composer | Cloud StorageCompetitive compensation | Flexibility programmes | Hybrid work environment | Inclusive benefits | MentorshipSenior-level Full TimeBengaluru Millenia, India-1d ago
-
AWS | AWS Glue | AWS Lambda | Amazon Athena | Amazon EMRAI certifications | Ethical AI focus | Mentorship | World-class trainingSenior-level Full TimeIndia-Hyderabad9h ago
-
Mid-level Full TimePune, Maharashtra, India11h ago
-
Senior-level Full TimeBangalore, Karnataka, India12h ago
-
Senior-level Full TimeBangalore, Karnataka, India12h ago
-
Software Engineer (6months to 2 Years of experience - JAVA/Python Backend, CICD, APIs, GEN AI, Microservices, Cloud) INR 2000K-2156KAgile | Algorithms | Apache Kafka | Automated testing | C++Continued learning opportunities | Hybrid work arrangement | MentorshipEntry-level Full TimeBengaluru, INDIA, India14h ago
-
Senior-level Full TimeBengaluru, INDIA, India15h ago
-
Software Engineer III - Java, AWS INR 1800K-2200KAWS | Apache Iceberg | Apache Spark | Authentication | AuthorizationSenior-level Full TimeBengaluru, Karnataka, India15h ago
-
Principal Engineer – Data Platform INR 2000K-2000KAWS | Access Control | Apache Flink | Apache Iceberg | Apache KafkaContinuous learning | Culture first work environment | High-trust work environment | Unlimited vacation policySenior-level Full TimeBengaluru17h ago
-
Systems Integration Specialist Advisor INR 2516K-3356KAWS | Azure | Azure Data | Azure Data Factory | Azure Data LakeSenior-level Full TimeBangalore, KA, IN22h ago
-
Data Engineer (PySpark) INR 1500K-2500KAI Assisted Development | Data Mapping | Data Modeling | Data Warehousing | Data profilingMid-level Full TimeBengaluru, Karnataka, India22h ago
-
Data Engineer - Real-Time Streaming INR 1400K-1500KApache Flink | Apache Kafka | CI/CD | Cloudera | Distributed SystemsMid-level Full TimeChennai, Tamil Nadu, India22h ago
-
Data Engineer (PySpark) INR 1500K-2000KData Compliance | Data Governance | Data Mapping | Data Pipeline Architecture | Data ProcessingMid-level Full TimeBengaluru, Karnataka, India22h ago
-
Data Engineer - Real Time Streaming INR 1800K-2500KApache Flink | Apache Kafka | CI/CD | Cloudera | Design PatternsMid-level Full TimeBengaluru, Karnataka, India22h ago
-
Senior Data Engineer INR 2520K-3380KAPI Authentication | Continuous integration | DevOps | Docker | DruidSenior-level Full TimeChennai, TN, IN22h ago
-
Data Engineer (PySpark/Informatica BDM) INR 500K-575KAWS | Azure | CI/CD | Data Governance | Data MappingSenior-level Full TimeBengaluru, Karnataka, India22h ago
-
Technology Engineer - Kafka & Streaming Platform INR 2000K-2245KAWS | Apache Kafka | Argo CD | Azure | CI/CDSenior-level Full TimeChennai, Tamil Nadu, India22h ago
-
Technology Engineer - Redis & Streaming Platforms INR 1837K-2000KAWS | Ansible | Apache Kafka | ArgoCD | AzureSenior-level Full TimeChennai, Tamil Nadu, India22h ago
-
Senior-level Full TimeBangalore, India22h ago