Title: An effective algorithm for mining closed sequential patterns and their minimal generators based on prefix trees
Authors: Thi-Thiet Pham; Jiawei Luo; Bay Vo
Addresses: School of Information Science and Engineering, Hunan University, Changsha City, Hunan Province, 410082, China; Faculty of Information Technology, Industrial University of Ho Chi Minh City, 12 Nguyen Van Bao, GoVap, Ho Chi Minh City, Vietnam ' School of Information Science and Engineering, Hunan University, Changsha City, Hunan Province, 410082, China ' Information Technology College, 12 Tran Dinh Thao, Tan Phu, Ho Chi Minh City, Vietnam
Abstract: Sequential generator patterns and closed sequential patterns play an important role in data mining tasks. They are proposed to address difficult problems in mining sequential pattern and have often been used together to generate non-redundant rules. Based on their important role, this paper proposes an efficient algorithm called CloGen for mining closed sequential patterns and their minimal sequential generator patterns. The CloGen algorithm uses the parent-child relationship on prefix tree structure and inserts fields into each node on prefix tree to determine whether that is a minimal sequential generator pattern or closed sequential pattern. Experimental results show that the performance runtime of CloGen algorithm is much faster than that of other algorithms by more than one order of magnitude.
Keywords: closed sequential patterns; sequential generator patterns; prefix trees; data mining.
International Journal of Intelligent Information and Database Systems, 2013 Vol.7 No.4, pp.324 - 339
Received: 27 Oct 2012
Accepted: 24 Jan 2013
Published online: 10 Sep 2013 *