Paper
6 December 2021 Char-word fusion method for Chinese NER
Letian Lu, Hao Liu
Author Affiliations +
Proceedings Volume 12085, International Conference on Green Communication, Network, and Internet of Things (GCNIoT 2021); 120850P (2021) https://doi.org/10.1117/12.2624996
Event: 2021 International Conference on Green Communication, Network, and Internet of Things, 2021, Kunming, China
Abstract
The task of Chinese named entity recognition (CNER) is closely related to Chinese word segmentation, because most Chinese entities are composed of words. The CNER model with words as the minimum input unit is called word based CNER model. However, wrong word segmentation will lead to wrong entity recognition results, so the CNER model with single word as input (char based) has gradually become the focus of research. Recently, many efforts have begun to integrate external word information to improve the performance of word based CNER model. And softlexicon are excellent word information fusion methods. When using softlexicon method for word fusion, too much matching word set information will be introduced, resulting in additional "noise" interference, resulting in entity recognition errors. Aiming at the problems of softlexicon method, this paper proposes a word information Selected-Softword-Concat method (SSC), which effectively solves the problem of "noise" in softlexicon method by introducing word segmentation information. The experimental results on four general CNER data sets show that compared with softlexicon method, the F1 value of the model constructed by SSC method is increased by 0.94, 0.92, 0.78 and 0.25 respectively, which effectively solves the "noise" problem of softlexicon.
© (2021) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Letian Lu and Hao Liu "Char-word fusion method for Chinese NER", Proc. SPIE 12085, International Conference on Green Communication, Network, and Internet of Things (GCNIoT 2021), 120850P (6 December 2021); https://doi.org/10.1117/12.2624996
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Binary data

Bismuth

Performance modeling

Defense technologies

Head

Information fusion

Back to Top