402201 - Data Engineering
Credit Hours

3

Pre-requisite

402101

Co-requisite

-

Distribution

3 + 0

This course introduces students with the vital role of data engineering in data science applications. Starting with the fundamental practice of data engineering, i.e., data lifecycle management, the course familiarizes students with the lifecycle of data and how to perform its key tasks, namely, the process of extracting, loading, and transforming data (ETL/ELT). Different resources of data are investigated, leading to different data formats, and accordingly, several data engineering tools are presented. Students will be able to collect, pre-process, and prepare datasets that are ready to be explored and used in data science projects and applications.