Title: Discovery of characteristic patterns from tabular structured data including missing values

Authors: Shigeaki Sakurai, Kouichirou Mori

Addresses: Corporate Research and Development Center, Toshiba Corporation, 1, Komukai-Toshiba-cho, Saiwai-ku, Kawasaki 212-8582, Japan. ' Corporate Research and Development Center, Toshiba Corporation, 1, Komukai-Toshiba-cho, Saiwai-ku, Kawasaki 212-8582, Japan

Abstract: This paper proposes a method dealing with missing values in the discovery of frequent patterns. Also, it proposes a method that effectively discovers the patterns from examples composed of attributes and their values. The method generates candidate patterns based on the combination of attributes and the combination of attribute values. It evaluates the patterns based on two supports. These supports are calculated based on the number of examples that do not include missing values in attributes composing target items and in all attributes. The proposed method is verified by comparing it with the existing methods dealing with missing values.

Keywords: missing values; attributes; tabular structured data; apriori properties; attribute constraints; frequent patterns.

DOI: 10.1504/IJBIDM.2010.033359

International Journal of Business Intelligence and Data Mining, 2010 Vol.5 No.3, pp.213 - 230

Published online: 01 Jun 2010 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article