Implementation Challenges for Nastaliq Character Recognition


Sattar, Sohail A., Haque, Shamsul, Pathan, Mahmood K. and Gee, Quintin (2008) Implementation Challenges for Nastaliq Character Recognition At International Multi Topic Conference (IMTIC'08), Pakistan. 11 - 12 Apr 2008. (Submitted).

Download

[img] Microsoft Word ASattar_85.doc - Version of Record
Download (339kB)

Description/Abstract

Character recognition in cursive scripts or handwritten Latin script has attracted researchers’ attention recently and some research has been done in this area. Optical character recognition is the translation of optically-scanned bitmaps of printed or written text into digitally editable data files. OCRs developed for many world languages are already in use but none exists for Urdu Nastaliq – a calligraphic adaptation of the Arabic script, just as Jawi is for Malay. Urdu Nastaliq has 39 characters against Arabic 28. Each character then has 2-4 different shapes according to its position in the word: initial, medial, final and isolated. In Nastaliq, inter-word and intra-word overlapping makes optical recognition more complex. Character recognition of the Latin script is relatively easier. This paper reports research on Urdu Nastaliq OCR, discusses challenges and suggest a new solution for its implementation.

Item Type: Conference or Workshop Item (Paper)
Additional Information: Event Dates: 11-12 April 2008
Venue - Dates: International Multi Topic Conference (IMTIC'08), Pakistan, 2008-04-11 - 2008-04-12
Organisations: Electronics & Computer Science
ePrint ID: 266510
Date :
Date Event
July 2008Submitted
Date Deposited: 05 Aug 2008 08:56
Last Modified: 17 Apr 2017 19:02
Further Information:Google Scholar
URI: http://eprints.soton.ac.uk/id/eprint/266510

Actions (login required)

View Item View Item