402451 - Big Data
Credit Hours

3

Pre-requisite

402201

Co-requisite

-

Distribution

3 + 0

This course will introduce an overview of Big Data applications, and market trends. The course will also introduce the fundamental platforms, such as MapReduce, Hadoop ecosystem, Spark, H2O Framework, Apache Storm, and other tools. Afterwards, the course will introduce several data storage methods and how to upload, distribute, and process them. This will include HDFS, HBase, Pig, and Hive, document database, and graph database. The course will go on to explore different ways of handling data analytics algorithms on different platforms. Then, the course will introduce visualization issues on Big Data. It also provides a first hands-on experience in handling and analyzing large, complex structured, semi-structured, and unstructured data. Students will then have fundamental knowledge on Big Data to handle various real-world challenges.