Enabling improved IR-based feature location

Dave Binkley,Dawn Lawrie,Christopher Uehlinger,Daniel Heinz

doi:10.1016/j.jss.2014.11.013

Abstract

Recent solutions to software engineering problems have incorporated tools and techniques from information retrieval (IR). The use of IR requires choosing an appropriate retrieval model and deciding on a query that best captures a particular information need. Taking feature location as a representative example, three research questions are investigated: (1) the impact of query preprocessing, (2) the impact that different scraping techniques for queries have on retrieval performance, (3) the performance impact that the underlying retrieval model has on identifying the correct source-code functions (the correct documents). These research questions are addressed using the five open source projects released as part of the SEMERU dataset. In the experiments, five methods of scraping queries from modification requests and seven retrieval model instances are considered. Using the standard evaluation metric Mean Reciprocal Rank (MRR), the experimental analysis reveals that better retrieval models are not the ones commonly used by software engineering researchers. Results find that models based on query-likelihood perform about twice as well as models in common use in software engineering such as LSI and thus deserve greater attention. Furthermore, corpus preprocessing has a significant impact as the top performing setting is over 100% better than the average.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enabling improved IR-based feature location

Abstract

Talk to us

Similar Papers

More From: Journal of Systems and Software

Lead the way for us

Journal: Journal of Systems and Software	Publication Date: Nov 20, 2014
Citations: 18

Similar Papers

The need for software specific natural language techniques
Dave Binkley ... Christopher Morrell
Empirical Software Engineering | VOL. 23
Dave Binkley, et. al.Dave Binkley ... Christopher Morrell
25 Nov 2017
Empirical Software Engineering | VOL. 23

A Case for Software Specific Natural Language Techniques
David Binkley ... Dawn Lawrie
-
David Binkley, et. al.David Binkley ... Dawn Lawrie
01 Oct 2016
01 Oct 2016

Research in software engineering: an analysis of the literature
R.L Glass ... V Ramesh
Information and Software Technology | VOL. 44
R.L Glass, et. al.R.L Glass ... V Ramesh
16 Apr 2002
Information and Software Technology | VOL. 44

How to Treat the Use of Grey Literature in Software Engineering
Xin Zhou
-
Xin ZhouXin Zhou
26 Jun 2020
26 Jun 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enabling improved IR-based feature location

Abstract

Talk to us

Similar Papers

More From: Journal of Systems and Software