Paper
13 April 2018 Generation method of synthetic training data for mobile OCR system
Author Affiliations +
Proceedings Volume 10696, Tenth International Conference on Machine Vision (ICMV 2017); 106962G (2018) https://doi.org/10.1117/12.2310119
Event: Tenth International Conference on Machine Vision, 2017, Vienna, Austria
Abstract
This paper addresses one of the fundamental problems of machine learning - training data acquiring. Obtaining enough natural training data is rather difficult and expensive. In last years usage of synthetic images has become more beneficial as it allows to save human time and also to provide a huge number of images which otherwise would be difficult to obtain. However, for successful learning on artificial dataset one should try to reduce the gap between natural and synthetic data distributions. In this paper we describe an algorithm which allows to create artificial training datasets for OCR systems using russian passport as a case study.
© (2018) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Yulia S. Chernyshova, Alexander V. Gayer, and Alexander V. Sheshkus "Generation method of synthetic training data for mobile OCR system", Proc. SPIE 10696, Tenth International Conference on Machine Vision (ICMV 2017), 106962G (13 April 2018); https://doi.org/10.1117/12.2310119
Lens.org Logo
CITATIONS
Cited by 8 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Optical character recognition

Convolutional neural networks

Machine learning

Back to Top