Saturday , April 20 2024

Statistical Methods for Performance Evaluation of WEB Document Classification

Daniel VOLOVICI 
‘Lucian Blaga’ University of Sibiu,
10, Victoriei Blv., 550024, Sibiu, Romania

Macarie BREAZU
‘Lucian Blaga’ University of Sibiu,
10, Victoriei Blv., 550024, Sibiu, Romania

Gabriel Dacian CUREA
‘Lucian Blaga’ University of Sibiu,
10, Victoriei Blv., 550024, Sibiu, Romania

Daniel Ionel MORARIU
‘Lucian Blaga’ University of Sibiu,
10, Victoriei Blv., 550024, Sibiu, Romania

Abstract: The principal aim of this paper is to make a review of main statistical methods for classifying documents that could be easily adapted in the context of Web document retrieval. After presenting the most popular methods of classification we will also define the most accurate indicators for assessment of classifiers performance. Thus we will refer to the recall, precision, fscore, sensitivity and specificity. We will also describe how these indicators can be calculated in the context of Web documents.

Keywords: Information retrieval, Classification, Naïve Bayes, Evaluation metrics.

>>Full text
CITE THIS PAPER AS:
Daniel VOLOVICI, Macarie BREAZU, Dacian CUREA, Daniel Ionel MORARIU, Statistical Methods for Performance Evaluation of WEB Document Classification, Studies in Informatics and Control, ISSN 1220-1766, vol. 19 (1), pp. 169-176, 2010.