Authors: Cao Duy Truong; Duong Tuan Anh
Addresses: Faculty of Computer Science and Engineering, Ho Chi Minh City University of Technology, 268 Ly Thuong Kiet, Dist. 10, Ho Chi Minh City, Vietnam ' Faculty of Computer Science and Engineering, Ho Chi Minh City University of Technology, 268 Ly Thuong Kiet, Dist. 10, Ho Chi Minh City, Vietnam
Abstract: Motifs and anomalies are two important representative patterns in a time series. Existing approaches usually handle motif discovery and anomaly detection in time series separately. In this paper, we propose a new efficient clustering-based method for discovering motif and detecting anomaly at the same time in large time series data. Our method first extracts motif/anomaly candidates from a time series by using significant extreme points and then clusters the candidates by using BIRCH algorithm. The proposed method computes anomaly scores for all sub-clusters and discovers the top motif based on the sub-cluster with the smallest anomaly score and detects the top anomaly based on the sub-cluster with the largest anomaly score. Experimental results on several benchmark datasets show that our proposed method can discover precise motif and anomaly with high time efficiency on large time series data.
Keywords: time series; motif discovery; anomaly detection; BIRCH algorithm; clustering algorithms; important extrema; motifs; anomalies.
International Journal of Business Intelligence and Data Mining, 2015 Vol.10 No.4, pp.356 - 377
Available online: 04 Oct 2015 *Full-text access for editors Access for subscribers Purchase this article Comment on this article