Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Example
I = {I1, I2, I5} Confidence Threshold : 70%
Non empty subsets: {I1, I2}, {I1, I5}, {I2, I5}
{I1}, {I2}, {I5}
I1
I1
I2 I
I1
I2
I5
Improving the Efficiency of Apriori
Hash based technique
Transaction reduction
A transaction which does not contain k frequent
itemsets cannot contain k+1 frequent itemsets
Partitioning
Sampling
Dynamic itemset counting
Start points
Algorithm
Procedure FP_growth (Tree, a)
If Tree contains a single path P then
for each combination of nodes- b generate b a with
support = min. support of nodes in b
else for each xi in the header of the Tree
{
generate pattern b = xi
construct bs conditional pattern base and bs
conditional FP_tree Treeb
if Treeb < > NULL then call FP_growth(Treeb, b)
}
Features
Finds long frequent patterns by looking for shorter
ones recursively
Items in frequency descending order: the more
frequently occurring, the more likely to be shared
Main-memory based FP-tree
Efficient and scalable
Faster than Apriori