Paper
18 January 2010 Trainable multiscript orientation detection
Author Affiliations +
Proceedings Volume 7534, Document Recognition and Retrieval XVII; 75340W (2010) https://doi.org/10.1117/12.839409
Event: IS&T/SPIE Electronic Imaging, 2010, San Jose, California, United States
Abstract
Detecting the correct orientation of document images is an important step in large scale digitization processes, as most subsequent document analysis and optical character recognition methods assume upright position of the document page. Many methods have been proposed to solve the problem, most of which base on ascender to descender ratio computation. Unfortunately, this cannot be used for scripts having no descenders nor ascenders. Therefore, we present a trainable method using character similarity to compute the correct orientation. A connected component based distance measure is computed to compare the characters of the document image to characters whose orientation is known. This allows to detect the orientation for which the distance is lowest as the correct orientation. Training is easily achieved by exchanging the reference characters by characters of the script to be analyzed. Evaluation of the proposed approach showed accuracy of above 99% for Latin and Japanese script from the public UW-III and UW-II datasets. An accuracy of 98.9% was obtained for Fraktur on a non-public dataset. Comparison of the proposed method to two methods using ascender / descender ratio based orientation detection shows a significant improvement.
© (2010) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Joost Van Beusekom, Yves Rangoni, and Thomas M. Breuel "Trainable multiscript orientation detection", Proc. SPIE 7534, Document Recognition and Retrieval XVII, 75340W (18 January 2010); https://doi.org/10.1117/12.839409
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Distance measurement

Associative arrays

Image processing

Analytical research

Optical character recognition

Scanners

Shape analysis

RELATED CONTENT

Trigram-based algorithms for OCR result correction
Proceedings of SPIE (March 17 2017)
Recognition of printed Arabic text using machine learning
Proceedings of SPIE (April 01 1998)
Recognition By Two Stage Discriminant Analysis
Proceedings of SPIE (August 21 1987)
Parsing algorithm for line-drawing pattern recognition
Proceedings of SPIE (February 01 1991)

Back to Top