The University of Southampton
University of Southampton Institutional Repository

Learning Hierarchical Multi-Category Text Classification Models

Learning Hierarchical Multi-Category Text Classification Models
Learning Hierarchical Multi-Category Text Classification Models
We present a kernel-based algorithm for hierarchical text classification where the documents are allowed to belong to more than one category at a time. The classification model is a variant of the Maximum Margin Markov Network framework, where the classification hierarchy is represented as a Markov tree equipped with an exponential family defined on the edges. We present an efficient optimization algorithm based on incremental conditional gradient ascent in single-example subspaces spanned by the marginal dual variables. Experiments show that the algorithm can feasibly optimize training sets of thousands of examples and classification hierarchies consisting of hundreds of nodes. The algorithm's predictive accuracy is competitive with other recently introduced hierarchical multi-category or multilabel classification learning algorithms.
Rousu, J.
4ef21914-6600-483b-9995-ff0283dad7b3
Saunders, C.
38a38da8-1eb3-47a8-80bc-b9cbb43f26e3
Szedmak, S.
993ca93f-c7c7-4d0b-a5f7-374eb0401add
Shawe-Taylor, J.
c32d0ee4-b422-491f-8c28-78663851d6db
Rousu, J.
4ef21914-6600-483b-9995-ff0283dad7b3
Saunders, C.
38a38da8-1eb3-47a8-80bc-b9cbb43f26e3
Szedmak, S.
993ca93f-c7c7-4d0b-a5f7-374eb0401add
Shawe-Taylor, J.
c32d0ee4-b422-491f-8c28-78663851d6db

Rousu, J., Saunders, C., Szedmak, S. and Shawe-Taylor, J. (2005) Learning Hierarchical Multi-Category Text Classification Models. 22nd International Conference on Machine Learning (ICML 2005), Bonn, Germany. 07 - 11 Aug 2005.

Record type: Conference or Workshop Item (Paper)

Abstract

We present a kernel-based algorithm for hierarchical text classification where the documents are allowed to belong to more than one category at a time. The classification model is a variant of the Maximum Margin Markov Network framework, where the classification hierarchy is represented as a Markov tree equipped with an exponential family defined on the edges. We present an efficient optimization algorithm based on incremental conditional gradient ascent in single-example subspaces spanned by the marginal dual variables. Experiments show that the algorithm can feasibly optimize training sets of thousands of examples and classification hierarchies consisting of hundreds of nodes. The algorithm's predictive accuracy is competitive with other recently introduced hierarchical multi-category or multilabel classification learning algorithms.

Text
HM3-ICML.pdf - Other
Download (211kB)
Other
HM3-ICML.ps - Other
Download (386kB)

More information

Published date: 2005
Additional Information: Event Dates: 7-11 August, 2005
Venue - Dates: 22nd International Conference on Machine Learning (ICML 2005), Bonn, Germany, 2005-08-07 - 2005-08-11
Organisations: Electronics & Computer Science

Identifiers

Local EPrints ID: 261071
URI: http://eprints.soton.ac.uk/id/eprint/261071
PURE UUID: 5356570f-ee76-44af-8d1a-eeb74b487d4e

Catalogue record

Date deposited: 18 Jul 2005
Last modified: 14 Mar 2024 06:47

Export record

Contributors

Author: J. Rousu
Author: C. Saunders
Author: S. Szedmak
Author: J. Shawe-Taylor

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×