A Fuzzy Approach to Text Segmentation in Web Images Based on Human Colour Perception
A Fuzzy Approach to Text Segmentation in Web Images Based on Human Colour Perception
This chapter describes a new approach for the segmentation of text in images on Web pages. In the same spirit as the authors’ previous work on this subject, this approach attempts to model the ability of humans to differentiate between colours. In this case, pixels of similar colour are first grouped using a colour distance defined in a perceptually uniform colour space (as opposed to the commonly used RGB). The resulting colour connected components are then grouped to form larger (character-like) regions with the aid of a propinquity measure, which is the output of a fuzzy inference system. This measure expresses the likelihood for merging two components based on two features. The first feature is the colour distance between the components, in the L*a*b* colour space. The second feature expresses the topological relationship of two components. The results of the method indicate a better performance than previous methods devised by the authors and possibly better (a direct comparison is not really possible due to the differences in application domain characteristics between this and previous methods) performance to other existing methods.
Text segmentation, web document analysis, image analysis, fuzzy
203-221
Antonacopoulos, Apostolos
9369bee5-b30f-4d4c-a63d-fe54984578cc
Karatzas, Dimosthenis
4d7e3927-2252-4039-88a4-0daca766e943
Antonacopoulos, Apostolos
2003
Antonacopoulos, Apostolos
9369bee5-b30f-4d4c-a63d-fe54984578cc
Karatzas, Dimosthenis
4d7e3927-2252-4039-88a4-0daca766e943
Antonacopoulos, Apostolos
Antonacopoulos, Apostolos and Karatzas, Dimosthenis
(2003)
A Fuzzy Approach to Text Segmentation in Web Images Based on Human Colour Perception.
In,
Antonacopoulos, Apostolos and Hu, J
(eds.)
Web Document Analysis: Challenges and Opportunities.
World Scientific, .
Record type:
Book Section
Abstract
This chapter describes a new approach for the segmentation of text in images on Web pages. In the same spirit as the authors’ previous work on this subject, this approach attempts to model the ability of humans to differentiate between colours. In this case, pixels of similar colour are first grouped using a colour distance defined in a perceptually uniform colour space (as opposed to the commonly used RGB). The resulting colour connected components are then grouped to form larger (character-like) regions with the aid of a propinquity measure, which is the output of a fuzzy inference system. This measure expresses the likelihood for merging two components based on two features. The first feature is the colour distance between the components, in the L*a*b* colour space. The second feature expresses the topological relationship of two components. The results of the method indicate a better performance than previous methods devised by the authors and possibly better (a direct comparison is not really possible due to the differences in application domain characteristics between this and previous methods) performance to other existing methods.
Text
WDAbook_Antonacopoulos.pdf
- Other
More information
Published date: 2003
Keywords:
Text segmentation, web document analysis, image analysis, fuzzy
Organisations:
Electronics & Computer Science
Identifiers
Local EPrints ID: 263518
URI: http://eprints.soton.ac.uk/id/eprint/263518
PURE UUID: 42c9ef4b-0830-429d-96e5-ecd4b1014a32
Catalogue record
Date deposited: 19 Feb 2007
Last modified: 14 Mar 2024 07:33
Export record
Contributors
Author:
Apostolos Antonacopoulos
Author:
Dimosthenis Karatzas
Editor:
Apostolos Antonacopoulos
Editor:
J Hu
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics