Recovering Relationships between Documentation and Source Code based on the Characteristics of Software Engineering

Xiaobo Wang,Chao Liu,Guanhui Lai

doi:10.1016/j.entcs.2009.07.009

Abstract

Software documentation is usually expressed in natural languages, which contains much useful information. Therefore, establishing the traceability links between documentation and source code can be very helpful for software engineering management, such as requirement traceability, impact analysis, and software reuse. Currently, the recovery of traceability links is mostly based on information retrieval techniques, for instance, probabilistic model, vector space model, and latent semantic indexing. Previous work treats both documentation and source code as plain text files, but the quality of retrieved links can be improved by imposing additional structure using that they are software engineering documents. In this paper, we present four enhanced strategies to improve traditional LSI method based on the special characteristics of documentation and source code, namely, source code clustering, identifier classifying, similarity thesaurus, and hierarchical structure enhancement. Experimental results show that the first three enhanced strategies can increase the precision of retrieved links by 5 % ∼ 16 % , while the the fourth strategy is about 13%.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronic Notes in Theoretical Computer Science	Publication Date: Jul 1, 2009
Citations: 31	License type: cc-by-nc-nd

R Discovery Prime

Recovering Relationships between Documentation and Source Code based on the Characteristics of Software Engineering

Abstract

Published Version

Talk to us

Similar Papers

More From: Electronic Notes in Theoretical Computer Science

Lead the way for us

Similar Papers

Improving Bug Location Using Binary Class Relationships
Nasir Ali ... Giuliano Antoniol
-
Nasir Ali, et. al.Nasir Ali ... Giuliano Antoniol
01 Sep 2012
01 Sep 2012

Trustrace: Mining Software Repositories to Improve the Accuracy of Requirement Traceability Links
Nasir Ali ... Giuliano Antoniol
IEEE Transactions on Software Engineering | VOL. 39
Nasir Ali, et. al.Nasir Ali ... Giuliano Antoniol
01 May 2013
IEEE Transactions on Software Engineering | VOL. 39

Using code ownership to improve IR-based Traceability Link Recovery
Diana Diaz ... Andrea De Lucia
-
Diana Diaz, et. al.Diana Diaz ... Andrea De Lucia
01 May 2013
01 May 2013

Comparison of Information Retrieval Techniques for Traceability Link Recovery
Danissa V Rodriguez ... Doris L Carver
-
Danissa V Rodriguez, et. al.Danissa V Rodriguez ... Doris L Carver
01 Mar 2019
01 Mar 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Recovering Relationships between Documentation and Source Code based on the Characteristics of Software Engineering

Abstract

Published Version

Talk to us

Similar Papers

More From: Electronic Notes in Theoretical Computer Science