Text Extraction from Web Images Based on Human Perception and Fuzzy Inference
Antonacopoulos, Apostolos and Karatzas, Dimosthenis (2001) Text Extraction from Web Images Based on Human Perception and Fuzzy Inference. In, First International Workshop on Web Document Analysis (WDA2001), Seattle, USA, PRImA Press, 35-38.
Download
|
PDF
Download (213Kb) |
Description/Abstract
There is a significant need to extract and recognise the semantically-important text contained in images on Web pages. This paper proposes a new approach to text extraction from this special class of images. The method attempts to emulate closer than before the way humans perceive colour differences in order to differentiate between text and background regions. Pixels of similar colour (as humans see it) are merged into components and a fuzzy inference mechanism (using connectivity and colour distance features) is devised to group components into larger character-like regions.
| Item Type: | Conference or Workshop Item (Paper) |
|---|---|
| Additional Information: | Event Dates: September 2001 |
| Related URLs: | |
| Divisions: | Faculty of Physical and Applied Science > Electronics and Computer Science |
| Item ID: | 263510 |
| Date Deposited: | 19 Feb 2007 |
| Last Modified: | 02 Mar 2012 12:40 |
| Contributors: | Antonacopoulos, Apostolos (Author) Karatzas, Dimosthenis (Author) |
| Date: | 2001 |
| Additional Information: | Event Dates: September 2001 |
| Status: | Published |
| Publisher: | PRImA Press |
| Further Information: | Google Scholar |
| URI: | http://eprints.soton.ac.uk/id/eprint/263510 |
Actions (login required)
![]() |
View Item |


