Data Engineering - Intern

Karachi, Pakistan

Apply now Apply later

We are a data-driven organization committed to leveraging technology to solve complex problems and generate actionable insights. As a Data Engineer Intern, you will work on building robust data pipelines, managing data architecture, and ensuring efficient data processing systems. This is a great opportunity for final-year students to gain hands-on experience in the exciting field of data engineering. 

 

Key Responsibilities: 

  • Data Pipeline Development: Assist in designing, building, and maintaining scalable ETL (Extract, Transform, Load) pipelines. 
  • Data Integration: Work on integrating data from various sources (APIs, databases, flat files) into centralized systems or data warehouses. 
  • Database Management: Help in the design, implementation, and optimization of relational and NoSQL databases. 
  • Collaboration: Work closely with data scientists, analysts, and software engineers to ensure seamless data flow and availability. 
  • Data Quality Assurance: Perform data validation, cleansing, and transformation to ensure accuracy and reliability. 
  • Performance Optimization: Identify bottlenecks and optimize data processing pipelines for efficiency. 
  • Documentation: Document data workflows, pipelines, and processes for future reference and scalability. 
  • Research and Development: Explore new tools and technologies to improve data engineering processes. 


Requirements

  • Education: Final-year students pursuing a degree in Computer Science, Data Science, or a related field. 
  • Programming Skills: Proficiency in Python, Java, or Scala, with a focus on data manipulation libraries/frameworks (e.g., Pandas, PySpark). 
  • Database Knowledge: Familiarity with relational databases (e.g., MySQL, PostgreSQL) and NoSQL databases (e.g., MongoDB, Cassandra). 
  • Data Processing: Basic understanding of big data tools like Hadoop, Spark, or Kafka. 
  • Version Control: Experience using Git for code management. 
  • Problem-Solving: Strong analytical skills and a methodical approach to troubleshooting issues. 
  • Passion for Data: A keen interest in working with data systems and solving real-world data challenges. 

Preferred Skills (Not mandatory): 

  • Experience with cloud platforms like AWS, GCP, or Azure (e.g., S3, BigQuery, or Redshift). 
  • Familiarity with data warehouse tools like Snowflake or Apache Hive. 
  • Basic knowledge of containerization tools like Docker. 
  • Exposure to workflow orchestration tools (e.g., Apache Airflow, Prefect). 
  • Understanding of data visualization tools for collaboration with analysts (e.g., Tableau, Power BI). 
  • Previous project or internship experience in data engineering (academic projects are welcome). 

Benefits

What We Offer: 


  • Mentorship from experienced data engineers and industry experts. 
  • Opportunities to work on real-world data projects. 
  • Access to modern data tools and technologies. 
  • Flexible work hours (if applicable). 
  • A collaborative and supportive work environment. 
  • Potential for a full-time offer upon successful completion of the internship. 

Other Information's: 
  • Shift: Night
  • Timings: 9PM to 6AM
  • Days: Mon to Fri
  • Location: PECHS Block 6, Shahrah e Faisal, Karachi


Apply now Apply later
Job stats:  3  0  0
Category: Engineering Jobs

Tags: Airflow APIs Architecture AWS Azure Big Data BigQuery Cassandra Computer Science Data pipelines Data quality Data visualization Data warehouse Docker Engineering ETL GCP Git Hadoop Java Kafka MongoDB MySQL NoSQL Pandas Pipelines PostgreSQL Power BI PySpark Python RDBMS Redshift Research Scala Snowflake Spark Tableau

Perks/benefits: Flex hours

Region: Asia/Pacific
Country: Pakistan

More jobs like this