Paper
11 July 2024 CBSENet: A conformer-based speech enhancement network
Wenke Xu, Xuesong Su, Jin Yang
Author Affiliations +
Abstract
Deep learning for speech enhancement has become a popular trend. With the introduction of attention mechanisms, the field of speech enhancement has seen further advancements. In previous research, Transformer is commonly employed as the core network for speech enhancement due to its attention capabilities, but it has certain limitations in extracting local features. To address these issues, we propose a novel speech enhancement model named CBSENet. This model utilizes the CRN architecture and adopts Conformer as its core network. The Conformer's capabilities in global and local feature extraction have been validated. The CBSENet model demonstrates excellent performance in speech enhancement tasks and outperforms the baseline network in both objective and subjective evaluation metrics.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Wenke Xu, Xuesong Su, and Jin Yang "CBSENet: A conformer-based speech enhancement network", Proc. SPIE 13210, Third International Symposium on Computer Applications and Information Systems (ISCAIS 2024), 132100E (11 July 2024); https://doi.org/10.1117/12.3034949
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Convolution

Performance modeling

Deep learning

Time-frequency analysis

Signal processing

Interference (communication)

Modeling

Back to Top