Paper
21 December 2000 Detection of text strings from mixed text/graphics images
Chien-Hua Tsai, Christos A. Papachristou
Author Affiliations +
Proceedings Volume 4307, Document Recognition and Retrieval VIII; (2000) https://doi.org/10.1117/12.410838
Event: Photonics West 2001 - Electronic Imaging, 2001, San Jose, CA, United States
Abstract
A robust system for text strings separation from mixed text/graphics images is presented. Based on a union-find (region growing) strategy the algorithm is thus able to classify the text from graphics and adapts to changes in document type, language category (e.g., English, Chinese and Japanese), text font style and size, and text string orientation within digital images. In addition, it allows for a document skew that usually occurs in documents, without skew correction prior to discrimination while these proposed methods such a projection profile or run length coding are not always suitable for the condition. The method has been tested with a variety of printed documents from different origins with one common set of parameters, and the experimental results of the performance of the algorithm in terms of computational efficiency are demonstrated by using several tested images from the evaluation.
© (2000) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Chien-Hua Tsai and Christos A. Papachristou "Detection of text strings from mixed text/graphics images", Proc. SPIE 4307, Document Recognition and Retrieval VIII, (21 December 2000); https://doi.org/10.1117/12.410838
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visualization

Image segmentation

Detection and tracking algorithms

Image processing

Optical character recognition

Document imaging

Chemical elements

RELATED CONTENT

Text segmentation for automatic document processing
Proceedings of SPIE (January 07 1999)
Thai handwritten character recognition by Euclidean distance
Proceedings of SPIE (February 26 2010)
Survey: omnifont-printed character recognition
Proceedings of SPIE (November 01 1991)
Graph-based table recognition system
Proceedings of SPIE (March 07 1996)

Back to Top