The Apriori property of sequence pattern mining with wildcard gaps
by Fan Min; Youxi Wu; Xindong Wu
International Journal of Functional Informatics and Personalised Medicine (IJFIPM), Vol. 4, No. 1, 2012

Abstract: In biological sequence analysis, long and frequently occurring patterns tend to be interesting. Data miners try to obtain frequent patterns with periodical wildcard gaps. However, with the existing definition set, the Apriori property does not hold; consequently, state-of-the-art algorithms are rather complex. This paper proposes an alternative definition of the number of offset sequences by adding a number of dummy characters. With the new definition, the Apriori property holds, hence our Apriori algorithm can mine all frequent patterns with minimal endeavour. This study also serves as the foundation of further research works on sequence pattern mining.

Online publication date: Tue, 20-Nov-2012

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Functional Informatics and Personalised Medicine (IJFIPM):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com