Paper
21 March 2013 A research on Video text tracking and recognition
Author Affiliations +
Proceedings Volume 8664, Imaging and Printing in a Web 2.0 World IV; 86640G (2013) https://doi.org/10.1117/12.2009441
Event: IS&T/SPIE Electronic Imaging, 2013, Burlingame, California, United States
Abstract
Nowadays, video has gradually become the mainstream of dissemination media for its rich information capacity and intelligibility, and texts in videos often carry significant semantic information, thus making great contribution to video content understanding and construction of content-based video retrieval system. Text-based video analyses usually consist of text detection, localization, tracking, segmentation and recognition. There has been a large amount of research done on video text detection and tracking, but most solutions focus on text content processing in static frames, few making full use of redundancy between video frames. In this paper, a unified framework for text detection, localization and tracking in video frames is proposed. We select edge and corner distribution of text blocks as text features, localizing and tracking are performed. By making good use of redundancy between frames, location relations and motion characteristics are determined, thus effectively reduce false-alarm and raise correct rate in localizing. Tracking schemes are proposed for static and rolling texts respectively. Through multi-frame integration, text quality is promoted, so is correct rate of OCR. Experiments demonstrate the reduction of false-alarm and the increase of correct rate of localization and recognition.
© (2013) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Baokang Wang, Changsong Liu, and Xiaoqing Ding "A research on Video text tracking and recognition", Proc. SPIE 8664, Imaging and Printing in a Web 2.0 World IV, 86640G (21 March 2013); https://doi.org/10.1117/12.2009441
Lens.org Logo
CITATIONS
Cited by 7 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Semantic video

Video processing

Optical character recognition

Detection and tracking algorithms

Binary data

3D modeling

RELATED CONTENT


Back to Top