TY - JOUR
T1 - Comparative evaluation of support vector machines for computer aided diagnosis of lung cancer in CT based on a multi-dimensional data set
AU - Sun, Tao
AU - Wang, Jingjing
AU - Li, Xia
AU - Lv, Pingxin
AU - Liu, Fen
AU - Luo, Yanxia
AU - Gao, Qi
AU - Zhu, Huiping
AU - Guo, Xiuhua
N1 - Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
PY - 2013/8
Y1 - 2013/8
N2 - Lung cancer is one of the most common forms of cancer resulting in over a million deaths per year worldwide. In this paper, the usage of support vector machine (SVM) classification for lung cancer is investigated, presenting a systematic quantitative evaluation against Boosting, Decision trees, k-nearest neighbor, LASSO regressions, neural networks and random forests. A large database of 5984 regions of interest (ROIs) and 488 input features (including textural features, patient characteristics, and morphological features) were used to train the classifiers and evaluate for their performance. The evaluation for classifiers' performance was based on a tenfold cross validation framework, receiver operating characteristic curve (ROC), and Matthews correlation coefficient. Area under curve (AUC) of SVM, Boosting, Decision trees, k-nearest neighbor, LASSO, neural networks, random forests were 0.94, 0.86, 0.73, 0.72, 0.91, 0.92, and 0.85, respectively. It was proved that SVM classification offered significantly increased classification performance compared to the reference methods. This scheme may be used as an auxiliary tool to differentiate between benign and malignant SPNs of CT images in future.
AB - Lung cancer is one of the most common forms of cancer resulting in over a million deaths per year worldwide. In this paper, the usage of support vector machine (SVM) classification for lung cancer is investigated, presenting a systematic quantitative evaluation against Boosting, Decision trees, k-nearest neighbor, LASSO regressions, neural networks and random forests. A large database of 5984 regions of interest (ROIs) and 488 input features (including textural features, patient characteristics, and morphological features) were used to train the classifiers and evaluate for their performance. The evaluation for classifiers' performance was based on a tenfold cross validation framework, receiver operating characteristic curve (ROC), and Matthews correlation coefficient. Area under curve (AUC) of SVM, Boosting, Decision trees, k-nearest neighbor, LASSO, neural networks, random forests were 0.94, 0.86, 0.73, 0.72, 0.91, 0.92, and 0.85, respectively. It was proved that SVM classification offered significantly increased classification performance compared to the reference methods. This scheme may be used as an auxiliary tool to differentiate between benign and malignant SPNs of CT images in future.
KW - CT image
KW - Curvelet
KW - Solitary pulmonary nodule
KW - Support vector machine
KW - Texture extraction
UR - http://www.scopus.com/inward/record.url?scp=84880035900&partnerID=8YFLogxK
U2 - 10.1016/j.cmpb.2013.04.016
DO - 10.1016/j.cmpb.2013.04.016
M3 - Article
C2 - 23727300
AN - SCOPUS:84880035900
SN - 0169-2607
VL - 111
SP - 519
EP - 524
JO - Computer Methods and Programs in Biomedicine
JF - Computer Methods and Programs in Biomedicine
IS - 2
ER -