May 17, 2024  
2023-2024 Graduate Catalog 
    
2023-2024 Graduate Catalog [ARCHIVED CATALOG]

DASC 5433 - Big Data Analytics

Credit Hours: 3 Lecture: 3

Fee Type: Special
Fee ($): 40
This course teaches students about the core technologies to manipulate, store and especially to analyze big data. Students will acquire essential skills required for a typical data science project. In this class, we couple hands-on labs/projects with lectures/readings. The hands-on activities familiarize students with Hadoop for storage (HDFS) and Spark as computing engine. Students will learn to apply typical machine learning techniques (using Spark MLlib) and some other analytics techniques such as graph processing (using Spark GraphX) to big data. Python is the main programming language for this course. Laboratory instruction.

Prerequisites: CSCI 4333, DASC 5333, or equivalent and knowledge of Python programming.