Paper
19 January 2009 Learning approach for multicontent analysis of compound images
Quentin Besnehard, Cedric Marchessoux, Tom Kimpe, Guillaume Spalla, Arnaud Joubel, François Boudet
Author Affiliations +
Proceedings Volume 7255, Multimedia Content Access: Algorithms and Systems III; 72550E (2009) https://doi.org/10.1117/12.805789
Event: IS&T/SPIE Electronic Imaging, 2009, San Jose, California, United States
Abstract
In the context of the European Cantata project (ITEA project, 2006-2009), within Barco, a complete Multi-Content Analysis framework was developed for detection and analysis of compound images. The framework consists of: a dataset, a Multi-Content Analysis (MCA) algorithm based on learning approaches, a Ground Truth, an evaluation module based on metrics and a presentation module. The aim of the MCA methodology presented here is to classify image content of computer screenshots into different categories such as: text; Graphical User Interface; Medical images and other complex images. The AdaBoost meta-algorithm was chosen, implemented and optimized for the classification method as it fitted the constraints (real-time and precision). A large dataset separated in training and testing subsets and their ground truth (with ViPER metadata format) was both collected and generated for the four different categories. The outcome of the MCA is a cascade of strong classifiers trained and tested on the different subsets. The obtained framework and its optimization (binary search, pre-computing of the features, pre-sorting) allow to re-train the classifiers as much as needed. The preliminary results are quite encouraging with a low false positive rate and close true positive rate in comparison with expectations. The re-injection of false negative examples from new testing subsets in the training phase resulted in better performances of the MCA.
© (2009) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Quentin Besnehard, Cedric Marchessoux, Tom Kimpe, Guillaume Spalla, Arnaud Joubel, and François Boudet "Learning approach for multicontent analysis of compound images", Proc. SPIE 7255, Multimedia Content Access: Algorithms and Systems III, 72550E (19 January 2009); https://doi.org/10.1117/12.805789
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image classification

Medical imaging

Binary data

Image analysis

Image processing

Human-machine interfaces

Image enhancement

Back to Top