Title: Mining traces between source code and textual documents
Authors: Amir Hossein Rasekh; Seyed Mostafa Fakhrahmad; Mohammad Hadi Sadreddini
Addresses: Computer Science and Engineering Department, Shiraz University, Shiraz, Iran ' Computer Science and Engineering Department, Shiraz University, Shiraz, Iran ' Computer Science and Engineering Department, Shiraz University, Shiraz, Iran
Abstract: Currently, researchers in computer science are dealing with a major challenge to link the source codes with the software documents. Writing informal documents by using natural and unstructured language causes this problem. In this paper, we present a model for recovery of traceable links between the source code and requirement documents. The proposed method in this paper is executed in four interconnected sections. The first section goes through extracting the features from the documents, which is followed by extracting the features from the source code. During the third section, abbreviations will be completed, using similarity measure as a feature. Finally, data mining algorithms will be implemented to find the hidden links between source code and software documentation. The most outstanding advantage of using this method is to be independent from the language. Also the preliminary results show that the proposed method has a good performance.
Keywords: traceability; data mining; artefacts; documentation; source code; similarity.
DOI: 10.1504/IJCAT.2019.097116
International Journal of Computer Applications in Technology, 2019 Vol.59 No.1, pp.43 - 52
Received: 20 Feb 2017
Accepted: 03 Nov 2017
Published online: 21 Dec 2018 *