Paper
22 March 2019 Character recognition of modern Japanese official documents using CNN for imbalanced learning data
Author Affiliations +
Proceedings Volume 11049, International Workshop on Advanced Image Technology (IWAIT) 2019; 1104906 (2019) https://doi.org/10.1117/12.2521307
Event: 2019 Joint International Workshop on Advanced Image Technology (IWAIT) and International Forum on Medical Imaging in Asia (IFMIA), 2019, Singapore, Singapore
Abstract
The documents of the government-general of Taiwan recorded from 1895 to 1945 contain the whole of Japanese official documents before the end of the WW2, and have great historic value. The characters in the documents, however, are illegible because they were written by hand with a brush. It is labor-intensive work for historians or scholars to understand the documents. We propose a method for character recognition of these documents by using a convolutional neural network and also conduct to solve the problem of imbalanced learning data. Experimental results show that the top-1 and the top10 accuracies were 89.48% and 98.10%, respectively.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Zongjhe Yang, Keisuke Doman, Masashi Yamada, and Yoshito Mekada "Character recognition of modern Japanese official documents using CNN for imbalanced learning data", Proc. SPIE 11049, International Workshop on Advanced Image Technology (IWAIT) 2019, 1104906 (22 March 2019); https://doi.org/10.1117/12.2521307
Lens.org Logo
CITATIONS
Cited by 4 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Optical character recognition

Convolution

Convolutional neural networks

Data modeling

Image processing

Process modeling

Associative arrays

Back to Top