Past Issues

Studies in Informatics and Control
Vol. 19, No. 2, 2010

Statistical Methods for Performance Evaluation of WEB Document Classification

Daniel Volovici, Macarie Breazu, Gabriel Dacian Curea, Daniel Ionel Morariu
Abstract

The principal aim of this paper is to make a review of main statistical methods for classifying documents that could be easily adapted in the context of Web document retrieval. After presenting the most popular methods of classification we will also define the most accurate indicators for assessment of classifiers performance. Thus we will refer to the recall, precision, fscore, sensitivity and specificity. We will also describe how these indicators can be calculated in the context of Web documents.

Keywords

Information retrieval, Classification, Naïve Bayes, Evaluation metrics.

View full article