TY - JOUR
T1 - Voice Biomarkers for Parkinson’s Disease Prediction Using Machine Learning Models with Improved Feature Reduction Techniques
AU - Chintalapudi, Nalini
AU - Dhulipalla, Venkata Rao
AU - Battineni, Gopi
AU - Rucco, Ciro
AU - Amenta, Francesco
N1 - Publisher Copyright:
© The Author(s) 2023.
PY - 2023/10/20
Y1 - 2023/10/20
N2 - As a chronic and life-threatening disease, Parkinson’s disease (PD) causes people to become rigid and inactive and have shaky voices. There is an argument that current PD detection techniques are ineffective due to their high latency and low accuracy. To enhance the accuracy of PD identification, voice recordings were used as biomarkers in conjunction with the synthetic minority oversampling technique (SMOTE). Three machine learning (ML) models namely support vector machine (SVM), K-nearest neighbors (KNN), and random forest (RF) were adopted to calculate the prediction accuracy. By applying an unsupervised dimensional reduction method, the generated model eliminates redundant data and speeds up training and testing. Model performance is estimated with three parameters, including accuracy, F1 score, and area under the curve (AUC) values. Experimental outcomes suggested that the RF model outperforms other models with 97.4% of classification accuracy. This type of research aims to analyze patient voice recordings to determine the disease severity.
AB - As a chronic and life-threatening disease, Parkinson’s disease (PD) causes people to become rigid and inactive and have shaky voices. There is an argument that current PD detection techniques are ineffective due to their high latency and low accuracy. To enhance the accuracy of PD identification, voice recordings were used as biomarkers in conjunction with the synthetic minority oversampling technique (SMOTE). Three machine learning (ML) models namely support vector machine (SVM), K-nearest neighbors (KNN), and random forest (RF) were adopted to calculate the prediction accuracy. By applying an unsupervised dimensional reduction method, the generated model eliminates redundant data and speeds up training and testing. Model performance is estimated with three parameters, including accuracy, F1 score, and area under the curve (AUC) values. Experimental outcomes suggested that the RF model outperforms other models with 97.4% of classification accuracy. This type of research aims to analyze patient voice recordings to determine the disease severity.
KW - accuracy
KW - AUC
KW - machine learning
KW - Parkinson’s disease
KW - SMOTE
UR - https://www.scopus.com/pages/publications/105020585273
U2 - 10.47852/bonviewJDSIS3202831
DO - 10.47852/bonviewJDSIS3202831
M3 - Article
AN - SCOPUS:105020585273
SN - 2972-3841
VL - 1
SP - 92
EP - 98
JO - Journal of Data Science and Intelligent Systems
JF - Journal of Data Science and Intelligent Systems
IS - 2
ER -