Paper
15 March 2019 Pattern augmentation for classifying handwritten japanese hiragana characters of 46 classes by using CNN pre-trained with object images
Yoshihiro Shima, Yuki Omori, Yumi Nakashima, Michio Yasuda
Author Affiliations +
Proceedings Volume 11041, Eleventh International Conference on Machine Vision (ICMV 2018); 110412G (2019) https://doi.org/10.1117/12.2522661
Event: Eleventh International Conference on Machine Vision (ICMV 2018), 2018, Munich, Germany
Abstract
Neural networks are powerful technology for classifying character patterns and object images. A huge number of training samples is very important for classification accuracy. A novel method for recognizing handwritten hiragana characters is proposed that combines pre-trained convolutional neural networks (CNN) and support vector machines (SVM). The training samples are augmented by pattern distortion such as by cosine translation and elastic distortion. A pre-trained CNN, Alex-Net, can be used as the pattern feature extractor. Alex-Net is pre-trained for large-scale object image datasets. An SVM is used as a trainable classifier. Original hiragana samples of 46 classes on the ETL9B are divided in two-fold by odd and even dataset numbers. Samples with the odd dataset number and augmented patterns on the ETL9B database are trained by the SVM. The feature vectors of character patterns are passed to the SVM from AlexNet. The average error rate was 1.130% for 100 test patterns of each of the 46 classes for a 5-times test, and the lowest error rate was 0.978% with 506138 training patterns of distorted hiragana characters. Experimental results showed that the proposed method is effective in recognizing handwritten hiragana characters.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Yoshihiro Shima, Yuki Omori, Yumi Nakashima, and Michio Yasuda "Pattern augmentation for classifying handwritten japanese hiragana characters of 46 classes by using CNN pre-trained with object images", Proc. SPIE 11041, Eleventh International Conference on Machine Vision (ICMV 2018), 110412G (15 March 2019); https://doi.org/10.1117/12.2522661
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Distortion

Databases

Convolutional neural networks

Image classification

Error analysis

MATLAB

Neural networks

Back to Top