karam gouda|Publications:Fast Vertical Mining Using Diffsets. KDD 2003

You are in:Home/Publications/Fast Vertical Mining Using Diffsets. KDD 2003
Prof. karam gouda :: Publications:

Title:	Fast Vertical Mining Using Diffsets. KDD 2003
Authors:	Mohammed J. Zaki and Karam Gouda
Year:	2003
Keywords:	Not Available
Journal:	Not Available
Volume:	Not Available
Issue:	Not Available
Pages:	Not Available
Publisher:	Not Available
Local/International:	International
Paper Link:	Not Available
Full paper	Karm abdelghany abdelrahman goda_SIGKDD03-diffsets.pdf
Supplementary materials	Not Available

Abstract:

A number of vertical mining algorithms have been proposed recently for association mining, which have shown to be very eective and usually outperform horizontal approaches. The main advantage of the vertical format is support for fast frequency counting via intersection operations on transaction ids (tids) and automatic pruning of irrelevant data. The main problem with these approaches is when intermediate results of vertical tid lists become too large for memory, thus aecting the algorithm scalability. In this paper we present a novel vertical data representation called Diset, that only keeps track of dierences in the tids of a candidate pattern from its generating frequent patterns. We show that disets drastically cut down the size of memory required to store intermediate results. We show how disets, when incorporated into previous vertical mining methods, increase the performance signicantly.

Prof. karam gouda :: Publications: