A Priori Algorithm

Apriori is an efficient association rule mining algorithm, developed by Agrawal et al (Agrawal 93, Agrawal 94) Apriori (Agrawal 94) employs breadth-first search and uses a hash tree structure to count candidate item sets efficiently. The algorithm generates candidate item sets (patterns) of length k from k-1 length item sets. Then, the patterns which have an infrequent sub pattern are pruned. According to the downward closure lemma, the generated candidate set contains all frequent k length item sets. Following that, the whole transaction database is scanned to determine frequent item sets among the candidates. For determining frequent items in a fast manner, the algorithm uses a hash tree to store candidate itemsets. Note: A hash tree has item sets at the leaves and hash tables at internal nodes (Zaki, 99). Apriori is designed to operate on databases containing transactions (eg: collection of items bought by customers or details of a website frequentation). Other algorithms are designed for finding association rules in data having no transactions (Winepi and Minepi), or having no timestamps (dna sequencing).

Algorithm

Apriori(T,\varepsilon)
     L_1 \gets \{  large 1-itemsets  \}      k \gets 2     while  L_{k-1} \neq \varnothing          C_k \gets Generate(L_k-1)         for transactions t \in T             C_t \gets Subset(C_k,t)             for candidates c \in C_t                 \mathrm{count}c \gets \mathrm{count}c+1         L_k \gets \{ c \in C_k | ~ \mathrm{count}c \geq \varepsilon \}         k \gets k+1     return \bigcup_k L_k 

References

  • Rakesh Agrawal and Tomasz Imielinski and Arun N. Swami, Mining Association Rules between Sets of Items in Large Databases, Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data.
  • Rakesh Agrawal and Ramakrishnan Srikant, Fast Algorithms for Mining Association Rules, Proc. 20th Int. Conf. Very Large Data Bases (VLDB), 1994.
  • Heikki Mannila and Hannu Toivonen and A. Inkeri Verkamo, Efficient algorithms for discovering association rules, AAAI Workshop on Knowledge Discovery in Databases (KDD-94), 1994.
  • Mohammed Javeed Zaki and Srinivasan Parthasarathy and Mitsunori Ogihara and Wei Li, Parallel Algorithms for Discovery of Association Rules, Data Mining and Knowledge Discovery, 1997.

 

<< PreviousWord BrowserNext >>
supraoptic nucleus
wondelgem
green imperial pigeon
omagh town fc
ivar vidfamne
enjoy yourself
paraventricular nucleus
blasphemy law in the united kingdom
robert w. behrman
ren caill
rhythm of love
nabil khatib
star of bethlehem
jesper harding
carl wilson
karl albrecht
polzeath
dutch literature
fngraph
definitions of palestine and palestinian
suprachiasmatic nucleus
titadine
brood parasite
nikolay shvernik
beta carboline
bishop of nottingham
war of aggression
video game studies
xun zi
sarabande
list of xindi
moses annenberg
lane theological seminary
goma 2
uncle fester (author)
linktv
marshal of the soviet union
legans
onyx (band)
quebec conference, 1943
beijing subway
mosaic: world news from the middle east
leo (disambiguation)
uncle fester