The University of Southampton
University of Southampton Institutional Repository

A Fuzzy Approach to Text Segmentation in Web Images Based on Human Colour Perception

A Fuzzy Approach to Text Segmentation in Web Images Based on Human Colour Perception
A Fuzzy Approach to Text Segmentation in Web Images Based on Human Colour Perception
This chapter describes a new approach for the segmentation of text in images on Web pages. In the same spirit as the authors’ previous work on this subject, this approach attempts to model the ability of humans to differentiate between colours. In this case, pixels of similar colour are first grouped using a colour distance defined in a perceptually uniform colour space (as opposed to the commonly used RGB). The resulting colour connected components are then grouped to form larger (character-like) regions with the aid of a propinquity measure, which is the output of a fuzzy inference system. This measure expresses the likelihood for merging two components based on two features. The first feature is the colour distance between the components, in the L*a*b* colour space. The second feature expresses the topological relationship of two components. The results of the method indicate a better performance than previous methods devised by the authors and possibly better (a direct comparison is not really possible due to the differences in application domain characteristics between this and previous methods) performance to other existing methods.
Text segmentation, web document analysis, image analysis, fuzzy
203-221
World Scientific Publishing Company
Antonacopoulos, Apostolos
9369bee5-b30f-4d4c-a63d-fe54984578cc
Karatzas, Dimosthenis
4d7e3927-2252-4039-88a4-0daca766e943
Antonacopoulos, Apostolos
Hu, J
Antonacopoulos, Apostolos
9369bee5-b30f-4d4c-a63d-fe54984578cc
Karatzas, Dimosthenis
4d7e3927-2252-4039-88a4-0daca766e943
Antonacopoulos, Apostolos
Hu, J

Antonacopoulos, Apostolos and Karatzas, Dimosthenis (2003) A Fuzzy Approach to Text Segmentation in Web Images Based on Human Colour Perception. In, Antonacopoulos, Apostolos and Hu, J (eds.) Web Document Analysis: Challenges and Opportunities. World Scientific Publishing Company, pp. 203-221.

Record type: Book Section

Abstract

This chapter describes a new approach for the segmentation of text in images on Web pages. In the same spirit as the authors’ previous work on this subject, this approach attempts to model the ability of humans to differentiate between colours. In this case, pixels of similar colour are first grouped using a colour distance defined in a perceptually uniform colour space (as opposed to the commonly used RGB). The resulting colour connected components are then grouped to form larger (character-like) regions with the aid of a propinquity measure, which is the output of a fuzzy inference system. This measure expresses the likelihood for merging two components based on two features. The first feature is the colour distance between the components, in the L*a*b* colour space. The second feature expresses the topological relationship of two components. The results of the method indicate a better performance than previous methods devised by the authors and possibly better (a direct comparison is not really possible due to the differences in application domain characteristics between this and previous methods) performance to other existing methods.

PDF
WDAbook_Antonacopoulos.pdf - Other
Download (5MB)

More information

Published date: 2003
Keywords: Text segmentation, web document analysis, image analysis, fuzzy
Organisations: Electronics & Computer Science

Identifiers

Local EPrints ID: 263518
URI: https://eprints.soton.ac.uk/id/eprint/263518
PURE UUID: 42c9ef4b-0830-429d-96e5-ecd4b1014a32

Catalogue record

Date deposited: 19 Feb 2007
Last modified: 18 Jul 2017 07:44

Export record

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of https://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×