TY - GEN
T1 - A Comparative Analysis of Implicit Augmentation Techniques for Breast Cancer Diagnosis Using Multiple Views
AU - Hasan, Yumnah
AU - Khan, Talhat
AU - De Bulnes, Darian Reyes Fernández
AU - Albarracín, Juan F.H.
AU - Ryan, Conor
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The Design of effective deep-learning methods for medical image analysis represents a great challenge given the scarcity of balanced datasets, leading to biased results and overfitting. Data augmentation mitigates these limitations due to its effectiveness in increasing the diversity and quantity of training data, but the selection of an appropriate augmentation method strongly depends on the problem domain. In this study, we investigate the effects of various feature-level augmentation methods on the performance of Deep-Learning-based Breast Cancer (BC) diagnosis using mammographic images of Craniocaudal (CC) and Mediolateral Oblique (MLO) views. Through quantitative performance evaluations, we systematically assess the impact of augmentation techniques on classification using two feature extraction techniques, namely, Haralick features and deep GoogleNET features. Our experiments, conducted on the Digital Database for Screening Mammography (DDSM) and the Wisconsin Breast Cancer (WBC) datasets, reveal that Mixup, when combined with STEM, outstands as the most promising in a wide range of scenarios.
AB - The Design of effective deep-learning methods for medical image analysis represents a great challenge given the scarcity of balanced datasets, leading to biased results and overfitting. Data augmentation mitigates these limitations due to its effectiveness in increasing the diversity and quantity of training data, but the selection of an appropriate augmentation method strongly depends on the problem domain. In this study, we investigate the effects of various feature-level augmentation methods on the performance of Deep-Learning-based Breast Cancer (BC) diagnosis using mammographic images of Craniocaudal (CC) and Mediolateral Oblique (MLO) views. Through quantitative performance evaluations, we systematically assess the impact of augmentation techniques on classification using two feature extraction techniques, namely, Haralick features and deep GoogleNET features. Our experiments, conducted on the Digital Database for Screening Mammography (DDSM) and the Wisconsin Breast Cancer (WBC) datasets, reveal that Mixup, when combined with STEM, outstands as the most promising in a wide range of scenarios.
KW - 1D CNN
KW - Breast Cancer
KW - Class Imbalance
KW - Data Augmentation
KW - Neural Network
UR - http://www.scopus.com/inward/record.url?scp=85206493392&partnerID=8YFLogxK
U2 - 10.1109/CVPRW63382.2024.00240
DO - 10.1109/CVPRW63382.2024.00240
M3 - Conference contribution
AN - SCOPUS:85206493392
T3 - IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
SP - 2345
EP - 2354
BT - Proceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024
PB - IEEE Computer Society
T2 - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024
Y2 - 16 June 2024 through 22 June 2024
ER -