Title: Mining expertise of developers from software repositories

Authors: Maen Hammad; Haneen Hijazi; Mustafa Hammad; Ahmed Fawzi Otoom

Addresses: Department of Software Engineering, The Hashemite University, Zarqa, Jordan ' Department of Computer Information Systems, The Hashemite University, Zarqa, Jordan ' Department of Computer Science, Mutah University, Mutah, Al Karak, Jordan ' Department of Software Engineering, The Hashemite University, Zarqa, Jordan

Abstract: This paper presents a technique to mine the developers' contributions to explore their expertise in open source projects. The technique analyses the keywords that appear in the textual content of commits. It is a lightweight technique since the text in commits is analysed without making any syntactic code differencing. Each developer is linked with a list of keywords, with their frequencies, that appeared in his commits. Based on these keywords, three types of expertise are defined: unique, common and frequent. The identified expertise can help in identifying topics or issues that are handled by specific or group of developers. A tool is developed to automatically mine and analyse committed code changes to support expertise identification. A case study is presented on three open source projects to show how the proposed techniques can be applied. The observations of the study showed that frequent terms provide useful information about developers' expertise.

Keywords: software maintenance and evolution; mining software repositories; expertise mining.

DOI: 10.1504/IJCAT.2020.106581

International Journal of Computer Applications in Technology, 2020 Vol.62 No.3, pp.227 - 239

Received: 04 Jul 2019
Accepted: 12 Aug 2019

Published online: 10 Apr 2020 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article