Article: A data mining approach for efficient selection bitmap join index Journal: International Journal of Data Mining, Modelling and Management (IJDMMM) 2010 Vol.2 No.3 pp.238 - 251 Abstract: The amount of information in a data warehouse tends to be extremely large and queries may involve several complex join and aggregates operations at the same time. To improve performance of these queries, database administrators often use indices. However, selection of an optimal set of indices is a very hard task because of the exponential number of attributes candidates that can be used in the selection process. To deal with this problem, we propose a data mining pruning approach based on maximal frequent itemsets representing candidate attributes for the index selection process. The main particularity of our pruning approach, compared to the existing ones, is that it uses other parameters than the frequency constraint, and respect monotony and anti-monotony properties. A greedy algorithm is proposed in order to select indices using a subset of attribute candidates. These indices minimise the query processing cost and satisfy the storage constraint. We validate our proposed algorithm using an experimental evaluation. Inderscience Publishers - linking academia, business and industry through research

Title: A data mining approach for efficient selection bitmap join index

Authors: Hamid Necir

Addresses: Research Laboratory in Artificial Intelligence (LRIA), Department of Computer Science, Faculty of Electrical and Computer Science, USTHB, University Science and Technology Houari Boumediene, El Alia BP 32, Bab Ezzouar, Algiers, Algeria

Abstract: The amount of information in a data warehouse tends to be extremely large and queries may involve several complex join and aggregates operations at the same time. To improve performance of these queries, database administrators often use indices. However, selection of an optimal set of indices is a very hard task because of the exponential number of attributes candidates that can be used in the selection process. To deal with this problem, we propose a data mining pruning approach based on maximal frequent itemsets representing candidate attributes for the index selection process. The main particularity of our pruning approach, compared to the existing ones, is that it uses other parameters than the frequency constraint, and respect monotony and anti-monotony properties. A greedy algorithm is proposed in order to select indices using a subset of attribute candidates. These indices minimise the query processing cost and satisfy the storage constraint. We validate our proposed algorithm using an experimental evaluation.

Keywords: bitmap join indices; BJIs; data mining; data warehousing; pruning; index selection.

DOI: 10.1504/IJDMMM.2010.033535

International Journal of Data Mining, Modelling and Management, 2010 Vol.2 No.3, pp.238 - 251

Published online: 04 Jun 2010 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article

Title: A data mining approach for efficient selection bitmap join index

Keep up-to-date