TY - JOUR
T1 - Mining and searching app reviews for requirements engineering
T2 - Evaluation and replication studies
AU - Dąbrowski, Jacek
AU - Letier, Emmanuel
AU - Perini, Anna
AU - Susi, Angelo
N1 - Publisher Copyright:
© 2023 The Author(s)
PY - 2023/3
Y1 - 2023/3
N2 - App reviews provide a rich source of feature-related information that can support requirement engineering activities. Analyzing them manually to find this information, however, is challenging due to their large quantity and noisy nature. To overcome the problem, automated approaches have been proposed for ‘feature-specific analysis’. Unfortunately, the effectiveness of these approaches has been evaluated using different methods and datasets. Replicating these studies to confirm their results and to provide benchmarks of different approaches is a challenging problem. We address the problem by extending previous evaluations and performing a comparison of these approaches. In this paper, we present two empirical studies. In the first study, we evaluate opinion mining approaches; the approaches extract features discussed in app reviews and identify their associated sentiments. In the second study, we evaluate approaches searching for feature-related reviews. The approaches search for users’ feedback pertinent to a particular feature. The results of both studies show these approaches achieve lower effectiveness than reported originally, and raise an important question about their practical use.
AB - App reviews provide a rich source of feature-related information that can support requirement engineering activities. Analyzing them manually to find this information, however, is challenging due to their large quantity and noisy nature. To overcome the problem, automated approaches have been proposed for ‘feature-specific analysis’. Unfortunately, the effectiveness of these approaches has been evaluated using different methods and datasets. Replicating these studies to confirm their results and to provide benchmarks of different approaches is a challenging problem. We address the problem by extending previous evaluations and performing a comparison of these approaches. In this paper, we present two empirical studies. In the first study, we evaluate opinion mining approaches; the approaches extract features discussed in app reviews and identify their associated sentiments. In the second study, we evaluate approaches searching for feature-related reviews. The approaches search for users’ feedback pertinent to a particular feature. The results of both studies show these approaches achieve lower effectiveness than reported originally, and raise an important question about their practical use.
KW - Empirical study
KW - Feature extraction
KW - Mining user reviews
KW - Searching for feature-related reviews
KW - Sentiment analysis
KW - Software engineering
UR - http://www.scopus.com/inward/record.url?scp=85147543940&partnerID=8YFLogxK
U2 - 10.1016/j.is.2023.102181
DO - 10.1016/j.is.2023.102181
M3 - Article
AN - SCOPUS:85147543940
SN - 0306-4379
VL - 114
JO - Information Systems
JF - Information Systems
M1 - 102181
ER -