KACST Arabic Text Classification Project: Overview and Preliminary Results


Althubaity, A., Almuhareb, A., Alharbi, S., Al-Rajeh, A. and Khorsheed , M. (2008) KACST Arabic Text Classification Project: Overview and Preliminary Results. In, Proceedings of The 9th IBIMA conference on Information Management in Modern Organizations, Marrakech, Morocco,

Download

[img] PDF
Download (107Kb)

Description/Abstract

Electronically formatted Arabic free-texts can be found in abundance these days on the World Wide Web, often linked to commercial enterprises and/or government organizations. Vast tracts of knowledge and relations lie hidden within these texts, knowledge that can be exploited once the correct intelligent tools have been identified and applied. For example, text mining may help with text classification and categorization. Text classification aims to automatically assign text to a predefined category based on identifiable linguistic features. Such a process has different useful applications including, but not restricted to, E-Mail spam detection, web pages content filtering, and automatic message routing. In this paper an overview of King Abdulaziz City for Science and Technology (KACST) Arabic Text Classification Project will be illustrated along with some preliminary results. This project will contribute to the better understanding and elaboration of Arabic text classification techniques.

Item Type: Conference or Workshop Item (Paper)
Divisions: Faculty of Physical Sciences and Engineering > Electronics and Computer Science > Comms, Signal Processing & Control
Faculty of Physical Sciences and Engineering > Electronics and Computer Science
ePrint ID: 272255
Date Deposited: 05 May 2011 18:25
Last Modified: 27 Mar 2014 20:17
Further Information:Google Scholar
URI: http://eprints.soton.ac.uk/id/eprint/272255

Actions (login required)

View Item View Item

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics