Paper
18 February 2014 Multi-frame knowledge based text enhancement for mobile phone captured videos
Suleyman Ozarslan, P. Erhan Eren
Author Affiliations +
Proceedings Volume 9030, Mobile Devices and Multimedia: Enabling Technologies, Algorithms, and Applications 2014; 90300F (2014) https://doi.org/10.1117/12.2040606
Event: IS&T/SPIE Electronic Imaging, 2014, San Francisco, California, United States
Abstract
In this study, we explore automated text recognition and enhancement using mobile phone captured videos of store receipts. We propose a method which includes Optical Character Resolution (OCR) enhanced by our proposed Row Based Multiple Frame Integration (RB-MFI), and Knowledge Based Correction (KBC) algorithms. In this method, first, the trained OCR engine is used for recognition; then, the RB-MFI algorithm is applied to the output of the OCR. The RB-MFI algorithm determines and combines the most accurate rows of the text outputs extracted by using OCR from multiple frames of the video. After RB-MFI, KBC algorithm is applied to these rows to correct erroneous characters. Results of the experiments show that the proposed video-based approach which includes the RB-MFI and the KBC algorithm increases the word character recognition rate to 95%, and the character recognition rate to 98%.
© (2014) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Suleyman Ozarslan and P. Erhan Eren "Multi-frame knowledge based text enhancement for mobile phone captured videos", Proc. SPIE 9030, Mobile Devices and Multimedia: Enabling Technologies, Algorithms, and Applications 2014, 90300F (18 February 2014); https://doi.org/10.1117/12.2040606
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Optical character recognition

Video

Detection and tracking algorithms

Image processing

Cell phones

Cameras

Light sources and illumination

Back to Top