BoostNSift: A Query Boosting and Code Sifting Technique for Method Level Bug Localization

Abdul Razzaq, Jim Buckley, James Vincent Patten, Muslim Chochlov, Ashish Rajendra Sai

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Locating bugs is an important, but effort-intensive and time-consuming task, when dealing with large-scale systems. To address this, Information Retrieval (IR) techniques are increasingly being used to suggest potential buggy source code locations, for given bug reports. While IR techniques are very scalable, in practice their effectiveness in accurately localizing bugs in a software system remains low. Results of empirical studies suggest that the effectiveness of bug localization techniques can be augmented by the configuration of queries used to locate buggy code. However, in most IR-based bug localization techniques, presented by researchers, the impact of the queries' configurations is not fully considered. In a similar vein, techniques consider all code elements as equally suspicious of being buggy while localizing bugs, but this is not always the case either.In this paper, we present a new method-level, information-retrieval-based bug localization technique called "BoostNSift". BoostNSift exploits the important information in queries by 'boost'ing that information, and then 'sift's the identified code elements, based on a novel technique that emphasizes the code elements' specific relatedness to a bug report over its generic relatedness to all bug reports. To evaluate the performance of BoostNSift, we employed a state-of-The-Art empirical design that has been commonly used for evaluating file level IR-based bug localization techniques: 6851 bugs are selected from commonly used Eclipse, AspectJ, SWT, and ZXing benchmarks and made openly available for method-level analyses. The performance of BoostNSift is compared with the openly-Available state-of-The-Art IR-based BugLocator, BLUiR, and BLIA techniques. Experiments show that BoostNSift improves on BLUiR by up to 324%, on BugLocator by up to 297%, and on BLIA up to 120%, in terms of Mean Reciprocal Rank (MRR). Similar improvements are observed in terms of Mean Average Precision (MAP) and Top-N evaluation measures.

Original languageEnglish
Title of host publicationProceedings - IEEE 21st International Working Conference on Source Code Analysis and Manipulation, SCAM 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages81-91
Number of pages11
ISBN (Electronic)9781665448970
DOIs
Publication statusPublished - 2021
Event21st IEEE International Working Conference on Source Code Analysis and Manipulation, SCAM 2021 - Virtual, Luxembourg City, Luxembourg
Duration: 27 Sep 20211 Oct 2021

Publication series

NameProceedings - IEEE 21st International Working Conference on Source Code Analysis and Manipulation, SCAM 2021

Conference

Conference21st IEEE International Working Conference on Source Code Analysis and Manipulation, SCAM 2021
Country/TerritoryLuxembourg
CityVirtual, Luxembourg City
Period27/09/211/10/21

Keywords

  • Bug localization
  • Code Sifting
  • Code analysis
  • Query boosting
  • Query enhancement
  • Software maintenance

Fingerprint

Dive into the research topics of 'BoostNSift: A Query Boosting and Code Sifting Technique for Method Level Bug Localization'. Together they form a unique fingerprint.

Cite this