Senior Data Automation Engineer
EG-Cairo, Egypt (Al Emdad & Al Tamween)
Arrow Electronics
As a Data Automation Engineer, you will play a crucial role in developing and maintaining automated data processing systems. You will work closely with data analysts, and other stakeholders to design, implement, and optimize data pipelines, ensuring the efficient and accurate flow of data throughout the organization.
What You'll Do:
- Develop and implement automated data pipelines for ingesting, processing, and storing structured and unstructured data.
- Good knowledge of AI solutions to analyze, extract, compare, and summarize structured and unstructured data.
- Good knowledge about open-source AI tools and how they will work in data automation stages,
- Good knowledge about AI models and using them through (Hugging Face, FIREWORKS)
- Good experience in data collection methods.
- Good experience in Rules engines.
- Integrate data from various sources in real-time and batch mechanisms and perform transformations to ensure consistency, accuracy, and reliability.
- Implement data quality checks and monitoring mechanisms to identify and address issues proactively.
- Write code and scripts for automation of data processes using languages such as Python, Java, or Scala.
- Leverage ETL (Extract, Transform, Load) tools and frameworks [Apache spark, AWS Glue] for efficient data processing.
- Collaborate with other intelligent data teams.
- Ensure seamless data flow for machine learning and analytics applications.
- Implement monitoring systems to track data pipeline performance and identify areas for optimization.
- Continuously optimize data processes for efficiency and scalability.
- Document data processes, system architecture, and codebase for knowledge sharing and future reference.
- Keep documentation up-to-date with any changes or enhancements to the data infrastructure.
- Implement and adhere to data security and compliance standards.
- Collaborate with security teams to ensure data protection and privacy.
You're a Perfect Fit If You Have:
- 3+ years of experience in data engineering, and automation.
- Strong understanding of data pipelines and data lifecycle management.
- Proficiency in scripting languages (Python).
- Good knowledge of AI solutions for understanding documents.
- Good knowledge about AI models and how to use them.
- Experience with data automation tools (Airflow).
- Excellent analytical and problem-solving skills.
- Strong communication and collaboration skills to work effectively across teams.
- Knowledge of cloud platforms (AWS, Azure) for data processing.
- Knowledge of data warehousing and data quality methodologies.
- Experience with data visualization tools (Tableau, Power BI)
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Architecture AWS AWS Glue Azure Data pipelines Data quality Data visualization Data Warehousing Engineering ETL Java Machine Learning Open Source Pipelines Power BI Privacy Python Scala Security Spark Tableau Unstructured data
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.