Automatic Arabic Text Classification
Automatic Arabic Text Classification
Automated document classification is an important text mining task especially with the rapid growth of the number of online documents present in Arabic language. Text classification aims to automatically assign the text to a predefined category based on linguistic features. Such a process has different useful applications including, but not restricted to, e-mail spam detection, web page content filtering, and automatic message routing. This paper presents the results of experiments on document classification achieved on seven different Arabic corpora using statistical methodology. The performance of two popular classification algorithms in classifying the aforementioned corpora has been evaluated.
Al-Harbi, S
8cea09f6-3898-49bc-8185-5ff893d2a05c
Almuhareb, A
7e902bc1-bc0c-4413-9dae-74140745d1b8
Al-Thubaity, A
c3969617-310a-40f0-9ad5-824fdbfc3a3c
Khorsheed, M. S.
380be73d-eb54-4298-98b2-066d70f7f487
Al-Rajeh, A
64acb5ae-6e8e-44a0-9afa-edd1658cd0cd
March 2008
Al-Harbi, S
8cea09f6-3898-49bc-8185-5ff893d2a05c
Almuhareb, A
7e902bc1-bc0c-4413-9dae-74140745d1b8
Al-Thubaity, A
c3969617-310a-40f0-9ad5-824fdbfc3a3c
Khorsheed, M. S.
380be73d-eb54-4298-98b2-066d70f7f487
Al-Rajeh, A
64acb5ae-6e8e-44a0-9afa-edd1658cd0cd
Al-Harbi, S, Almuhareb, A, Al-Thubaity, A, Khorsheed, M. S. and Al-Rajeh, A
(2008)
Automatic Arabic Text Classification.
Proceedings of The 9th International Conference on the Statistical Analysis of Textual Data, Lyon-, France.
Record type:
Conference or Workshop Item
(Paper)
Abstract
Automated document classification is an important text mining task especially with the rapid growth of the number of online documents present in Arabic language. Text classification aims to automatically assign the text to a predefined category based on linguistic features. Such a process has different useful applications including, but not restricted to, e-mail spam detection, web page content filtering, and automatic message routing. This paper presents the results of experiments on document classification achieved on seven different Arabic corpora using statistical methodology. The performance of two popular classification algorithms in classifying the aforementioned corpora has been evaluated.
Text
Arabic-Classification.pdf
- Other
More information
Published date: March 2008
Venue - Dates:
Proceedings of The 9th International Conference on the Statistical Analysis of Textual Data, Lyon-, France, 2008-03-01
Organisations:
Electronics & Computer Science, Southampton Wireless Group
Identifiers
Local EPrints ID: 272254
URI: http://eprints.soton.ac.uk/id/eprint/272254
PURE UUID: d9af8b3d-f7a5-402c-9db0-10a438968fb2
Catalogue record
Date deposited: 05 May 2011 18:21
Last modified: 14 Mar 2024 09:50
Export record
Contributors
Author:
S Al-Harbi
Author:
A Almuhareb
Author:
A Al-Thubaity
Author:
M. S. Khorsheed
Author:
A Al-Rajeh
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics