Title: A package-based clustering approach to enhance the accuracy and performance of software defect prediction
Authors: Rayhanul Islam; Kazi Sakib
Addresses: Institute of Information Technology, University of Dhaka, Dhaka, Bangladesh ' Institute of Information Technology, University of Dhaka, Dhaka, Bangladesh
Abstract: To enhance the accuracy and performance, software defect prediction models considering clustering of dataset combine related and similar features to improve the learning process of the model. Here, a clustering approach named package-based clustering has been proposed to group the similar and related parts of software using object oriented classes' relationships and similarities. To segregate software into clusters, it performs textual analysis to identify all object-oriented classes from source codes. Then it uses package information of each class to divide those into clusters. To analyse the performance of the proposed algorithm, linear regression model is used, which learns from clusters of related and similar classes. The experiment has been conducted on eight releases of two open source software, which are Xalan and Ant, and results show that the proposed technique outperforms the existing clustering algorithms those are BorderFlow and the entire system.
Keywords: software engineering; software testing; software defect prediction; SDP; package-based clustering; PBC.
International Journal of Software Engineering, Technology and Applications, 2017 Vol.2 No.1, pp.1 - 21
Received: 07 Jul 2015
Accepted: 12 Feb 2016
Published online: 03 Oct 2017 *