Implementation Challenges for Nastaliq Character Recognition


Sattar, Sohail A., Haque, Shamsul, Pathan, Mahmood K. and Gee, Quintin (2008) Implementation Challenges for Nastaliq Character Recognition. In, International Multi Topic Conference (IMTIC'08), Jamshoro, Sindh, Pakistan, 11 - 12 Apr 2008. (Submitted).

Download

[img] Microsoft Word (Implementation Challenges for Nastaliq Character Recognition) - Published Version
Download (332Kb)

Description/Abstract

Character recognition in cursive scripts or handwritten Latin script has attracted researchers’ attention recently and some research has been done in this area. Optical character recognition is the translation of optically-scanned bitmaps of printed or written text into digitally editable data files. OCRs developed for many world languages are already in use but none exists for Urdu Nastaliq – a calligraphic adaptation of the Arabic script, just as Jawi is for Malay. Urdu Nastaliq has 39 characters against Arabic 28. Each character then has 2-4 different shapes according to its position in the word: initial, medial, final and isolated. In Nastaliq, inter-word and intra-word overlapping makes optical recognition more complex. Character recognition of the Latin script is relatively easier. This paper reports research on Urdu Nastaliq OCR, discusses challenges and suggest a new solution for its implementation.

Item Type: Conference or Workshop Item (Paper)
Additional Information: Event Dates: 11-12 April 2008
Divisions: Faculty of Physical Sciences and Engineering > Electronics and Computer Science
ePrint ID: 266510
Date Deposited: 05 Aug 2008 08:56
Last Modified: 27 Mar 2014 20:12
Further Information:Google Scholar
ISI Citation Count:1
URI: http://eprints.soton.ac.uk/id/eprint/266510

Actions (login required)

View Item View Item

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics