Paper
9 October 2024 Research on music symbol recognition model based on YOLOv8s
Yingqi Xia, Ying Zhao
Author Affiliations +
Proceedings Volume 13288, Fourth International Conference on Computer Graphics, Image, and Virtualization (ICCGIV 2024); 1328802 (2024) https://doi.org/10.1117/12.3045298
Event: Fourth International Conference on Computer Graphics, Image, and Virtualization (ICCGIV 2024), 2024, Chengdu, China
Abstract
Optical Music Recognition aims to automatically extract music information such as notes and beats from printed or handwritten music score images using computer vision technology, which holds significant value in fields such as music information retrieval and sheet music digitization, etc. This paper introduces YOLOv8 object detection algorithms into music symbol recognition field, and an improved model named YOLO-Score is proposed based on YOLOv8s.This model brings SPD-Conv into the backbone feature network to enhance the recognition ability for small targets; incorporates LSK selective attention mechanism to focus on more meaningful feature information using extensive contextual information; redesigns the detection layer by adding a small target detection branch and removing the large target detection branch to strengthen the network's feature fusion capability; and employs Shape-IoU as the bounding box regression loss function to improve network convergence accuracy. The experimental results show an 11.2% increase in precision, a 33.0% increase in recall, a 26.6% increase in mAP, and a reduction of 1.8Mb in weight file size compared to the YOLOv8s model.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Yingqi Xia and Ying Zhao "Research on music symbol recognition model based on YOLOv8s", Proc. SPIE 13288, Fourth International Conference on Computer Graphics, Image, and Virtualization (ICCGIV 2024), 1328802 (9 October 2024); https://doi.org/10.1117/12.3045298
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Object detection

Performance modeling

Data modeling

Small targets

Target detection

Visual process modeling

Education and training

Back to Top