Title: Identifying related commits from software repositories

Authors: Mustafa Hammad

Addresses: Department of Information Technology, Mutah University, Al-Karak, Mutah 61710, Jordan

Abstract: Source code modifications are saved in software repositories as individual and independent commits. A high-level programming task is usually applied by related or similar code changes activities. This paper presents an approach to automatically identify related and similar source code modifications from software repositories. Discovering related commits helps maintainers to understand and trace the implementation of a specific programming task. Furthermore, identifying commits of a programming task leads to simplify code fixing and debugging activities. The identification is based on discovering relations among commits from software repositories. A relation is exposed based on the textual similarity between commits. Therefore, commits relationships lead to categorise commits into disjoint groups. Each generated group would represent related or similar code modifications activities. A group can be a set of maintenance tasks related to a specific feature in the system. A case study on an open source project is presented to investigate the proposed approach.

Keywords: software maintenance; software repositories; source code modifications; textual similarity; similar modifications; related modifications.

DOI: 10.1504/IJCAT.2015.069335

International Journal of Computer Applications in Technology, 2015 Vol.51 No.3, pp.212 - 218

Published online: 13 May 2015 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article