University
of Trento |
T h e m i s P a l p a n a s |
H o m e | Announcements | Reading Material | Homework |
Massive Data Analytics Graduate Course
Class Hours Please always check the Announcements
page, for news regarding the course.
Course Description We are now witnessing an explosion in the amount of data being produced. It has become much easier than in the past to produce and gather these data, but increasingly more difficult to process and analyze them, because of their sheer size. How can we analyze data when they are coming in at rates of multiple GB or TB per day? This problem is relevant to more and more organizations and businesses around the world, and at the core of our research. This is an introductory course on topics relevant to data mining and massive data analytics. Data mining is the process of analyzing large amounts of data, and discovering new, non-obvious knowledge among these data. Data mining is very popular in the business world, because it helps businesses take advantage of the huge data collections they have. We will talk about some basic data mining techniques, such as association rules, sequential patterns, clustering, and classification. We will discuss different flavors of these techniques, and comment on their strong and weak points. Finally, we will also talk about techniques that allow us to analyze massive data collections, such as algorithms for streaming data (that is, data that is continuously generated) processing and analysis. The course will be tought in English, based on slides, and published research papers. Prerequisites: databases or data management, basic knowledge of probability and statistics Grading Scheme 10% Class Participation 15% Assignments 60% Project Report 15% Final Oral Examination Course Policies
|
|