Classification of web-based discussions using Naive Bayes

Recent Updates:
New Scientific Paper:
A solution to the exact match on rare item searches: introducing the lost sheep algorithm 2011
New Blog Post:
Towards Automated eGovernment Monitoring 2011-09-26
Ph.D. Thesis:
Towards Automated eGovernment Monitoring

General Information

Download Classification of web-based discussions using Naive Bayes as PDF (164 KB) .

Title: Classification of web-based discussions using Naive Bayes.
Author(s): Ekaterina Soukhikh.
Published date: December 2006.
Published at: Web-Mining and Data Analysis 2006

Abstract


Given a set of web-based discussions on various topics written in various languages, the classification problem consists of determining for each discussion (and its sub-posts) on what topic these discussions report on,and in what language they are written in. In this project the students are to investigate whether the Naive Bayes algorithm is applicable to classifying web-based discussions. The students will be given a training-set of articles and a large corpus of articles that they are to investigate on.

The author of this document is:
Morten Goodwin
E-mail address is:
morten.goodwin ASCII 64 tingtun.no
Phone is:
+47 95 24 86 79

Valid XHTML 1.0! Valid CSS! Checked by eGovMon