Title: Mining recent frequent cooccurring items from transaction stream

Authors: Sumit Misra; Soumyadeep Thakur; Manosij Ghosh; Sanjoy Kumar Saha

Addresses: Department of Computer Science and Engineering, Jadavpur University, Kolkata, 700032, India ' Department of Computer Science and Engineering, Jadavpur University, Kolkata, 700032, India ' Department of Computer Science and Engineering, Jadavpur University, Kolkata, 700032, India ' Department of Computer Science and Engineering, Jadavpur University, Kolkata, 700032, India

Abstract: The work presents a simple but novel scheme to determine the recent frequent cooccurring items from transaction streams. It enables to follow the current trend, pattern of occurrence of the items and also help in recommendation. Proposed methodology works on a sequence of transaction forming a time-window. Based on the items present in a transaction, item-sets are formed and encoded. An in-memory structure maintains the frequency of the item-sets. Frequency is time-decayed count that prioritises the recent occurrences. To limit the size of in-memory structure, a pruning operation is carried out to remove the less frequent item-sets. Threshold for pruning is chosen dynamically. Impacts of decaying parameter and window size are also analysed. Experiment has been carried out on two datasets and performance has been compared with two recent works. Proposed methodology provides a better accuracy. Low time and space complexity make it suitable for stream analytic.

Keywords: recent frequent item-set; time decaying count; item pruning; stream analytic.

DOI: 10.1504/IJDS.2019.105239

International Journal of Data Science, 2019 Vol.4 No.4, pp.288 - 304

Received: 12 Oct 2018
Accepted: 14 Aug 2019

Published online: 22 Feb 2020 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article