Given a set of web-based discussions on various topics written in various languages, the classification problem consists of determining for each discussion (and its sub-posts) on what topic these discussions report on,and in what language they are written in. In this project the students are to investigate whether the Naive Bayes algorithm is applicable to classifying web-based discussions. The students will be given a training-set of articles and a large corpus of articles that they are to investigate on.