Web Mining

"Laurea Magistrale" in Computer Science
Academic year 2008-2009, second semester

[ Previous academic year (2007/2008) ]

Teachers


Schedule

  • Tuesday 1430-1630, room 106
  • Thursday 1330-1530, room 105

Program

  1. Web crawling
  2. Web page indexing
  3. Information retrieval
  4. Unsupervised learning: clustering
  5. PageRank and HITS
  6. Search engine attack and defense strategies
See the detailed program below

Exam

Exams will consist of a written test (exercises) and an oral test (discussion of theory and lab assignments).

Lab assignments

Assignments are compulsory; collaboration is accepted, but every student must deliver his own material. Timely delivery is taken into account at the exam.

To hand in an assignment, your email (to both teachers, see addresses above) should contain:

  • URL of requested document or targzipped source code (NOT attached to the email, but available in the student's web space);
  • Instructions for compilation and execution.
  • Name of the student.
AssignedDueSubject
2009-02-192009-03-05Simple web crawler
2008-03-052008-03-19Word index

Bibliography

Official Textbook

    SOUMEN CHAKRABARTI
    Mining the Web - Discovering knowledge from hypertext data
    Morgan Kaufmann - Elsevier, 2003.

Course materials

Slide handouts (4 slides per sheet): Slide contents collected in a unique document ("article" format): Exercises Text of the written exams: Sample code:

Program


Page maintained by Mauro Brunato