The principal aim of this paper is to make a review of main statistical methods for classifying documents that could be easily adapted in the context of Web document retrieval. After presenting the most popular methods of classification we will also define the most accurate indicators for assessment of classifiers performance. Thus we will refer to the recall, precision, fscore, sensitivity and specificity. We will also describe how these indicators can be calculated in the context of Web documents.
Information retrieval, Classification, Naïve Bayes, Evaluation metrics.
Daniel Volovici, Macarie Breazu, Gabriel Dacian Curea, Daniel Ionel Morariu, "Statistical Methods for Performance Evaluation of WEB Document Classification", Studies in Informatics and Control, ISSN 1220-1766, vol. 19(2), pp. 169-176, 2010. https://doi.org/10.24846/v19i2y201007