Paper
14 November 2023 Tibetan-Chinese bilingual text detection in scene images integrating spatial attention features
Kaizheng Li, Yusheng Hao, Qiaoqiao Li, Weilan Wang
Author Affiliations +
Proceedings Volume 12934, Third International Conference on Computer Graphics, Image, and Virtualization (ICCGIV 2023); 1293414 (2023) https://doi.org/10.1117/12.3008395
Event: 2023 3rd International Conference on Computer Graphics, Image and Virtualization (ICCGIV 2023), 2023, Nanjing, China
Abstract
Aiming at the problems of false detection and missing detection of texts in the process of text detection caused by random distribution of Tibetan texts, various scales and shapes in natural scenes, this paper proposes a natural scene Tibetan text detection algorithm based on feature enhancement of spatial attention mechanism. The spatial attention mechanism is introduced into the pyramid network module of feature extraction to extract richer local and overall information and enhance the ability of feature extraction; feature kernel clustering can better distinguish adjacent text instances, and the predicted similarity vector is accurate Aggregate text pixels to the corresponding text kernel, further improve the accuracy of scene Tibetan detection, and effectively reduce false detection and missed detection. The model is evaluated on the TCSD scene Tibetan dataset, and the results show that the F-measure comprehensive index of this method reaches 81.09%, which is better than the previous scene Tibetan detection algorithm.
(2023) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Kaizheng Li, Yusheng Hao, Qiaoqiao Li, and Weilan Wang "Tibetan-Chinese bilingual text detection in scene images integrating spatial attention features", Proc. SPIE 12934, Third International Conference on Computer Graphics, Image, and Virtualization (ICCGIV 2023), 1293414 (14 November 2023); https://doi.org/10.1117/12.3008395
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature extraction

Image segmentation

Education and training

Detection and tracking algorithms

Data modeling

Image processing algorithms and systems

Convolution

Back to Top