May 19, 2024  
2022-2023 Graduate Catalog 
    
2022-2023 Graduate Catalog [ARCHIVED CATALOG]

CSCI 5388 - Big Data Analytics

Credit Hours: 3 Lecture: 3

Fee Type: Special
Fee ($): 40
This course teaches students about the core technologies to manipulate, store and especially to analyze big data. Students will acquire essential skills required for a typical data science project. In this class, we couple hands-on labs/projects with lectures/readings. The hands-on activities familiarize students with Hadoop for storage (HDFS) and Spark as computing engine. Students will learn to apply typical machine learning techniques (using Spark MLlib) and some other analytics techniques such as graph processing (using Spark GraphX) to big data. Python is the main programming language for this course.

Prerequisites: CSCI 4333  or equivalent and knowledge of Python programming.