Paper
3 April 2024 HoughToRadon transform: new neural network layer for features improvement in projection space
Author Affiliations +
Proceedings Volume 13072, Sixteenth International Conference on Machine Vision (ICMV 2023); 1307210 (2024) https://doi.org/10.1117/12.3023163
Event: Sixteenth International Conference on Machine Vision (ICMV 2023), 2023, Yerevan, Armenia
Abstract
In this paper, we introduce HoughToRadon Transform layer, a novel layer designed to improve the speed of neural networks incorporated with Hough Transform to solve semantic image segmentation problems. By placing it after a Hough Transform layer, ’inner’ convolutions receive modified feature maps with new beneficial properties, such as a smaller area of processed images and parameter space linearity by angle and shift. These properties were not presented in Hough Transform alone. Furthermore, HoughToRadon Transform layer allows us to adjust the size of intermediate feature maps using two new parameters, thus allowing us to balance the speed and quality of the resulting neural network. Our experiments on the open MIDV-500 dataset show that this new approach leads to time savings in document segmentation tasks and achieves state-of-the-art 97.7% accuracy, outperforming HoughEncoder with larger computational complexity.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Alexandra Zhabitskaya, Alexander Sheshkus, and Vladimir L. Arlazarov "HoughToRadon transform: new neural network layer for features improvement in projection space", Proc. SPIE 13072, Sixteenth International Conference on Machine Vision (ICMV 2023), 1307210 (3 April 2024); https://doi.org/10.1117/12.3023163
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Neural networks

Hough transforms

Convolution

Image segmentation

Education and training

Semantics

Image processing

Back to Top