Classification of web-based discussions using Naive Bayes

Recent Updates:
New Scientific Paper:
Automatic Checking of Alternative Texts on Web Pages 2010-07-15
New Blog Post:
A collaborative approach for improving local government web sites 2010-07-30

General Information

Download Classification of web-based discussions using Naive Bayes as PDF (164 KB) .

Title: Classification of web-based discussions using Naive Bayes.
Author(s): Ekaterina Soukhikh.
Published date: December 2006.
Published at: Web-Mining and Data Analysis 2006

Abstract


Given a set of web-based discussions on various topics written in various languages, the classification problem consists of determining for each discussion (and its sub-posts) on what topic these discussions report on,and in what language they are written in. In this project the students are to investigate whether the Naive Bayes algorithm is applicable to classifying web-based discussions. The students will be given a training-set of articles and a large corpus of articles that they are to investigate on.

The author of this document is:
Morten Goodwin
E-mail address is:
morten.goodwin __at__ tingtun.no
Phone is:
+47 95 24 86 79

Valid XHTML 1.0! Valid CSS! Checked by eGovMon