Text Extraction from Web Images Based on Human Perception and Fuzzy Inference

Antonacopoulos, Apostolos and Karatzas, Dimosthenis (2001) Text Extraction from Web Images Based on Human Perception and Fuzzy Inference. In, First International Workshop on Web Document Analysis (WDA2001), Seattle, USA, PRImA Press, 35-38.


[img] PDF
Download (213Kb)


There is a significant need to extract and recognise the semantically-important text contained in images on Web pages. This paper proposes a new approach to text extraction from this special class of images. The method attempts to emulate closer than before the way humans perceive colour differences in order to differentiate between text and background regions. Pixels of similar colour (as humans see it) are merged into components and a fuzzy inference mechanism (using connectivity and colour distance features) is devised to group components into larger character-like regions.

Item Type: Conference or Workshop Item (Paper)
Additional Information: Event Dates: September 2001
Related URLs:
Divisions : Faculty of Physical Sciences and Engineering > Electronics and Computer Science
ePrint ID: 263510
Accepted Date and Publication Date:
Date Deposited: 19 Feb 2007
Last Modified: 27 Mar 2014 20:07
Further Information:Google Scholar
URI: http://eprints.soton.ac.uk/id/eprint/263510

Actions (login required)

View Item View Item

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics