Using KCCA for Japanese-English cross-language information retrieval and classification


Li, Yaoyong and Shawe-Taylor, John (2005) Using KCCA for Japanese-English cross-language information retrieval and classification. Journal of Intelligent Information Systems, tba, (tba)

Download

[img] PDF
Download (174Kb)

Description/Abstract

Kernel Canonical Correlation Analysis (KCCA) is a method of correlating linear relationship between two variables in a kernel dened feature space. A machine learning algorithm based on KCCA is studied for cross-language information retrieval. We apply the algorithm in Japanese-English cross-language information retrieval. The results are quite encouraging and are signicantly better than those obtained by other state of the art methods. Computational complexity is an important issue when applying KCCA to large dataset as in information retrieval. We experimentally evaluate several methods to alleviate the problem of applying KCCA to large datasets. We also investigate cross-language document classication using KCCA as well as other methods. Our results show that it is feasible to use a classier learned in one language to classify the documents in other languages.

Item Type: Article
ISSNs: 0925990215737675
Keywords: KCCA cross-language information retrieval algorithm Japanese English kernel
Divisions: Faculty of Physical and Applied Science > Electronics and Computer Science
Item ID: 260786
Date Deposited: 21 Apr 2005
Last Modified: 02 Mar 2012 13:20
Contributors: Li, Yaoyong (Author)
Shawe-Taylor, John (Author)
Date: 2005
Status: Published
Publisher: Springer
Further Information:Google Scholar
URI: http://eprints.soton.ac.uk/id/eprint/260786

Actions (login required)

View Item View Item