TY - JOUR
T1 - A comparative UHPLC-Q/TOF-MS-based metabolomics approach coupled with machine learning algorithms to differentiate Keemun black teas from narrow-geographic origins
AU - Peng, Chuan yi
AU - Ren, Yin feng
AU - Ye, Zhi hao
AU - Zhu, Hai yan
AU - Liu, Xiao qian
AU - Chen, Xiao tong
AU - Hou, Ru yan
AU - Granato, Daniel
AU - Cai, Hui mei
N1 - Publisher Copyright:
© 2022 Elsevier Ltd
PY - 2022/8
Y1 - 2022/8
N2 - Geographic-label is a remarkable feature for Chinese tea products. In this study, the UHPLC-Q/TOF-MS-based metabolomics approach coupled with chemometrics was used to determine the five narrow-geographic origins of Keemun black tea. Thirty-nine differentiated compounds (VIP > 1) were identified, of which eight were quantified. Chemometric analysis revealed that the linear discriminant analysis (LDA) classification accuracy model is 91.7%, with 84.7% cross-validation accuracy. Three machine learning algorithms, namely feedforward neural network (FNN), random forest (RF) and support vector machine (SVM), were introduced to improve the recognition of narrow-geographic origins, the performances of the model were evaluated by confusion matrix, receiver operating characteristic curve (ROC) and area under the curve (AUC). The recognition of RF, SVM and FNN for Keemun black tea from five narrow-geographic origins were 87.5%, 94.44%, and 100%, respectively. Importantly, FNN exhibited an excellent classification effect with 100% accuracy. The results indicate that metabolomics fingerprints coupled with chemometrics can be used to authenticate the narrow-geographic origins of Keemun black teas.
AB - Geographic-label is a remarkable feature for Chinese tea products. In this study, the UHPLC-Q/TOF-MS-based metabolomics approach coupled with chemometrics was used to determine the five narrow-geographic origins of Keemun black tea. Thirty-nine differentiated compounds (VIP > 1) were identified, of which eight were quantified. Chemometric analysis revealed that the linear discriminant analysis (LDA) classification accuracy model is 91.7%, with 84.7% cross-validation accuracy. Three machine learning algorithms, namely feedforward neural network (FNN), random forest (RF) and support vector machine (SVM), were introduced to improve the recognition of narrow-geographic origins, the performances of the model were evaluated by confusion matrix, receiver operating characteristic curve (ROC) and area under the curve (AUC). The recognition of RF, SVM and FNN for Keemun black tea from five narrow-geographic origins were 87.5%, 94.44%, and 100%, respectively. Importantly, FNN exhibited an excellent classification effect with 100% accuracy. The results indicate that metabolomics fingerprints coupled with chemometrics can be used to authenticate the narrow-geographic origins of Keemun black teas.
KW - Camellia sinensis tea
KW - Machine learning algorithms
KW - Metabolomics fingerprints
KW - Narrow-geographic origin
KW - Phenolic compounds
UR - http://www.scopus.com/inward/record.url?scp=85132911511&partnerID=8YFLogxK
U2 - 10.1016/j.foodres.2022.111512
DO - 10.1016/j.foodres.2022.111512
M3 - Article
C2 - 35840220
AN - SCOPUS:85132911511
SN - 0963-9969
VL - 158
SP - 111512
JO - Food Research International
JF - Food Research International
M1 - 111512
ER -