Title: Mining traces between source code and textual documents

Authors: Amir Hossein Rasekh; Seyed Mostafa Fakhrahmad; Mohammad Hadi Sadreddini

Addresses: Computer Science and Engineering Department, Shiraz University, Shiraz, Iran ' Computer Science and Engineering Department, Shiraz University, Shiraz, Iran ' Computer Science and Engineering Department, Shiraz University, Shiraz, Iran

Abstract: Currently, researchers in computer science are dealing with a major challenge to link the source codes with the software documents. Writing informal documents by using natural and unstructured language causes this problem. In this paper, we present a model for recovery of traceable links between the source code and requirement documents. The proposed method in this paper is executed in four interconnected sections. The first section goes through extracting the features from the documents, which is followed by extracting the features from the source code. During the third section, abbreviations will be completed, using similarity measure as a feature. Finally, data mining algorithms will be implemented to find the hidden links between source code and software documentation. The most outstanding advantage of using this method is to be independent from the language. Also the preliminary results show that the proposed method has a good performance.

Keywords: traceability; data mining; artefacts; documentation; source code; similarity.

DOI: 10.1504/IJCAT.2019.097116

International Journal of Computer Applications in Technology, 2019 Vol.59 No.1, pp.43 - 52

Received: 20 Feb 2017
Accepted: 03 Nov 2017

Published online: 21 Dec 2018 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article