Hierarchically classifying documents with multiple labels

Andrew Mayne,Russell Perry

doi:10.1109/cidm.2009.4938640

Hierarchically classifying documents with multiple labels

Andrew Mayne, Russell Perry

https://doi.org/10.1109/cidm.2009.4938640

Copy DOI

Publication Date: Mar 1, 2009

Citations: 20

Affiliation: University of Oxford

#Hierarchical Classifier #Weka Toolkit + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper describes the evaluation of a hierarchical classifier for classifying multi-labeled documents organized in a two-level taxonomy. The hierarchical classifier consists of a tree of independent naive Bayes classifiers, with output probabilities from parent classifiers propagated to child classifiers as additional features. Each classifier uses Bi-Normal Feature Separation for word feature selection. Experiments were performed using the Weka Toolkit [7] adapted to deal with multi-labeled documents. The hierarchical classifier accuracy marginally out-performed a set of independent binary classifiers trained to classify documents for each class in the taxonomy.

Full Text