TY - JOUR
T1 - ConvNeXt-ST-AFF
T2 - A Novel Skin Disease Classification Model Based on Fusion of ConvNeXt and Swin Transformer
AU - Hao, Shengnan
AU - Zhang, Liguo
AU - Jiang, Yanyan
AU - Wang, Jingkun
AU - Ji, Zhanlin
AU - Zhao, Li
AU - Ganchev, Ivan
N1 - Publisher Copyright:
© 2023 The Authors.
PY - 2023
Y1 - 2023
N2 - Automatic classification of dermatological images is an important technology that assists doctors in performing faster and more accurate classification of skin diseases. Recently, convolutional neural networks (CNNs) and Transformer networks have been employed in learning respectively the local and global features of lesion images. However, existing works mainly focus on utilizing a single neural network for feature extraction, which limits the model classification performance. In order to tackle this problem, a novel fusion model, named ConvNeXt-ST-AFF, is proposed in this paper, by combining the strengths of ConvNeXt and Swin Transformer (ConvNeXt-ST in the model's name). In the proposed model, the pretrained ConvNeXt and Swin Transformer networks extract local and global features from images, which are then fused using Attentional Feature Fusion (AFF) submodules (AFF in the model's name). Additionally, in order to enhance the model's attention on the regions of skin lesions during training, an Efficient Channel Attention (ECA) module is incorporated into the ConvNeXt network. Moreover, the proposed model employs a denoising module to reduce the influence of artifacts and improve the image contrast. The results, obtained by experiments conducted on two datasets, demonstrate that the proposed ConvNeXt-ST-AFF model has higher classification ability, according to multiple evaluation metrics, compared to the original ConvNeXt and Swin Transformer, and other state-of-the-art classification models.
AB - Automatic classification of dermatological images is an important technology that assists doctors in performing faster and more accurate classification of skin diseases. Recently, convolutional neural networks (CNNs) and Transformer networks have been employed in learning respectively the local and global features of lesion images. However, existing works mainly focus on utilizing a single neural network for feature extraction, which limits the model classification performance. In order to tackle this problem, a novel fusion model, named ConvNeXt-ST-AFF, is proposed in this paper, by combining the strengths of ConvNeXt and Swin Transformer (ConvNeXt-ST in the model's name). In the proposed model, the pretrained ConvNeXt and Swin Transformer networks extract local and global features from images, which are then fused using Attentional Feature Fusion (AFF) submodules (AFF in the model's name). Additionally, in order to enhance the model's attention on the regions of skin lesions during training, an Efficient Channel Attention (ECA) module is incorporated into the ConvNeXt network. Moreover, the proposed model employs a denoising module to reduce the influence of artifacts and improve the image contrast. The results, obtained by experiments conducted on two datasets, demonstrate that the proposed ConvNeXt-ST-AFF model has higher classification ability, according to multiple evaluation metrics, compared to the original ConvNeXt and Swin Transformer, and other state-of-the-art classification models.
KW - attention
KW - ConvNeXt
KW - image denoising
KW - model fusion
KW - Skin disease classification
KW - swin transformer
UR - https://www.scopus.com/pages/publications/85174857440
U2 - 10.1109/ACCESS.2023.3324042
DO - 10.1109/ACCESS.2023.3324042
M3 - Article
AN - SCOPUS:85174857440
SN - 2169-3536
VL - 11
SP - 117460
EP - 117473
JO - IEEE Access
JF - IEEE Access
ER -