Kernel-Based Learning of Hierarchical Multilabel Classification Models


Rousu, J., Saunders, C., Szedmak, S. and Shawe-Taylor, J. (2006) Kernel-Based Learning of Hierarchical Multilabel Classification Models. Journal of Machine Learning Research, 7, 1601-1626.

Download

[img] PDF
Download (188Kb)

Description/Abstract

We present a kernel-based algorithm for hierarchical text classification where the documents are allowed to belong to more than one category at a time. The classification model is a variant of the Maximum Margin Markov Network framework, where the classification hierarchy is represented as a Markov tree equipped with an exponential family defined on the edges. We present an efficient optimization algorithm based on incremental conditional gradient ascent in single-example subspaces spanned by the marginal dual variables. The optimization is facilitated with a dynamic programming based algorithm that computes best update directions in the feasible set. Experiments show that the algorithm can feasibly optimize training sets of thousands of examples and classification hierarchies consisting of hundreds of nodes. Training of the full hierarchical model is as efficient as training independent SVM-light classifiers for each node. The algorithm's predictive accuracy was found to be competitive with other recently introduced hierarchical multi-category or multilabel classification learning algorithms.

Item Type: Article
Keywords: kernel methods, text classification
Divisions: Faculty of Physical Sciences and Engineering > Electronics and Computer Science
ePrint ID: 262947
Date Deposited: 07 Sep 2006
Last Modified: 27 Mar 2014 20:06
Publisher: Microtome Publishing
Further Information:Google Scholar
ISI Citation Count:39
URI: http://eprints.soton.ac.uk/id/eprint/262947

Actions (login required)

View Item View Item