A Fuzzy Approach to Text Segmentation in Web Images Based on Human Colour Perception
Antonacopoulos, Apostolos and Karatzas, Dimosthenis (2003) A Fuzzy Approach to Text Segmentation in Web Images Based on Human Colour Perception. In, Antonacopoulos, Apostolos and Hu, J (eds.) Web Document Analysis: Challenges and Opportunities. , World Scientific Publishing Company, 203-221.
This chapter describes a new approach for the segmentation of text in images on Web pages. In the same spirit as the authors’ previous work on this subject, this approach attempts to model the ability of humans to differentiate between colours. In this case, pixels of similar colour are first grouped using a colour distance defined in a perceptually uniform colour space (as opposed to the commonly used RGB). The resulting colour connected components are then grouped to form larger (character-like) regions with the aid of a propinquity measure, which is the output of a fuzzy inference system. This measure expresses the likelihood for merging two components based on two features. The first feature is the colour distance between the components, in the L*a*b* colour space. The second feature expresses the topological relationship of two components. The results of the method indicate a better performance than previous methods devised by the authors and possibly better (a direct comparison is not really possible due to the differences in application domain characteristics between this and previous methods) performance to other existing methods.
|Item Type:||Book Section|
|Keywords:||Text segmentation, web document analysis, image analysis, fuzzy|
|Divisions:||Faculty of Physical and Applied Science > Electronics and Computer Science
|Date Deposited:||19 Feb 2007|
|Last Modified:||02 Mar 2012 13:20|
|Contributors:||Antonacopoulos, Apostolos (Author)
Karatzas, Dimosthenis (Author)
Antonacopoulos, Apostolos (Editor)
Hu, J (Editor)
|Publisher:||World Scientific Publishing Company|
|Further Information:||Google Scholar|
|RDF:||RDF+N-Triples, RDF+N3, RDF+XML, Browse.|
Actions (login required)