TY - JOUR
T1 - DCM-CNER
T2 - A Dual-Channel Model for Clinical Named Entity Recognition Based on Embedded ConvNet and Gated Dilated CNN
AU - Shi, Lin
AU - Zhou, Wenyan
AU - Wu, Yafeng
AU - Yuan, Na
AU - Zang, Xiangzhen
AU - Ji, Zhanlin
AU - Ganchev, Ivan
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2024
Y1 - 2024
N2 - As the volume of Chinese electronic medical records (EMRs) experiences an explosive growth, the application of clinical named entity recognition (CNER) technology becomes crucial for the effective utilization of EMR data and practical implementation of evidence-based medicine. While mainstream models excel in capturing global contextual information, their feature extraction mechanisms tend to be unidimensional, limiting their information retrieval capabilities. To address this issue, this paper proposes a Dual-Channel Model for CNER (DCM-CNER), which enhances feature extraction through the introduction of a customized convolutional stack, denoted as emConvNet (embedded ConvNet), for local feature extraction. The model further addresses the vanishing gradient issue by employing a newly designed Gated Dilated Convolutional Neural Network (GDCNN) module with a residual structure. Sequential and contextual information is extracted from the text using a Bidirectional Long Short-Term Memory (BiLSTM) layer. Additionally, a multi-head bilinear attention mechanism is introduced for parallel dynamic feature fusion. Experimental results demonstrate the superiority of the proposed DCM-CNER model, in comparison to the existing mainstream models and state-of-the-art models, achieving F1 scores of 94.15%, 85.26%, and 84.21% on the CCKS2017, CCKS2019, and CLUENER2020 datasets, respectively, thereby validating its effectiveness in performing the task of Chinese CNER in EMRs.
AB - As the volume of Chinese electronic medical records (EMRs) experiences an explosive growth, the application of clinical named entity recognition (CNER) technology becomes crucial for the effective utilization of EMR data and practical implementation of evidence-based medicine. While mainstream models excel in capturing global contextual information, their feature extraction mechanisms tend to be unidimensional, limiting their information retrieval capabilities. To address this issue, this paper proposes a Dual-Channel Model for CNER (DCM-CNER), which enhances feature extraction through the introduction of a customized convolutional stack, denoted as emConvNet (embedded ConvNet), for local feature extraction. The model further addresses the vanishing gradient issue by employing a newly designed Gated Dilated Convolutional Neural Network (GDCNN) module with a residual structure. Sequential and contextual information is extracted from the text using a Bidirectional Long Short-Term Memory (BiLSTM) layer. Additionally, a multi-head bilinear attention mechanism is introduced for parallel dynamic feature fusion. Experimental results demonstrate the superiority of the proposed DCM-CNER model, in comparison to the existing mainstream models and state-of-the-art models, achieving F1 scores of 94.15%, 85.26%, and 84.21% on the CCKS2017, CCKS2019, and CLUENER2020 datasets, respectively, thereby validating its effectiveness in performing the task of Chinese CNER in EMRs.
KW - clinical named entity recognition (CNER)
KW - convolutional stack
KW - dual-channel model
KW - Electronic medical record (EMR)
KW - gated dilated convolution
KW - multi-head bilinear attention
UR - https://www.scopus.com/pages/publications/85199151485
U2 - 10.1109/ACCESS.2024.3422677
DO - 10.1109/ACCESS.2024.3422677
M3 - Article
AN - SCOPUS:85199151485
SN - 2169-3536
VL - 12
SP - 97726
EP - 97738
JO - IEEE Access
JF - IEEE Access
ER -