Automatic Arabic Text Classification


Al-Harbi, S, Almuhareb, A, Al-Thubaity , A, Khorsheed, M. S. and Al-Rajeh, A (2008) Automatic Arabic Text Classification. In, Proceedings of The 9th International Conference on the Statistical Analysis of Textual Data, Lyon-, France,

Download

[img] PDF
Download (283Kb)

Description/Abstract

Automated document classification is an important text mining task especially with the rapid growth of the number of online documents present in Arabic language. Text classification aims to automatically assign the text to a predefined category based on linguistic features. Such a process has different useful applications including, but not restricted to, e-mail spam detection, web page content filtering, and automatic message routing. This paper presents the results of experiments on document classification achieved on seven different Arabic corpora using statistical methodology. The performance of two popular classification algorithms in classifying the aforementioned corpora has been evaluated.

Item Type: Conference or Workshop Item (Paper)
Divisions: Faculty of Physical Sciences and Engineering > Electronics and Computer Science > Comms, Signal Processing & Control
Faculty of Physical Sciences and Engineering > Electronics and Computer Science
ePrint ID: 272254
Date Deposited: 05 May 2011 18:21
Last Modified: 27 Mar 2014 20:17
Further Information:Google Scholar
URI: http://eprints.soton.ac.uk/id/eprint/272254

Actions (login required)

View Item View Item

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics