Given: E-a set of data events k-the number of clusters LEF- the clustering quality criterion Choose initial“seedevents from Determine a star for each seed against the other seed events By appropriately modifying and selecting complexes from stars, construct a disjoint cover of E that optimizes the criterion LEF Y Is the termination END Criterion satisfied? N
Choose initial “seed” events from E Determine a star for each seed against the other seed events By appropriately modifying and selecting complexes from stars, construct a disjoint cover of E that optimizes the criterion LEF Is the termination Criterion satisfied? END a Given: E-a set of data events k-the number of clusters LEFthe clustering quality criterion Y N
Is the clustering quality improving Y Choose k new seeds which Choose k new seeds which Are central events are“ border' events a
Is the clustering quality improving? Choose k new seeds which Are central events Choose k new seeds which are “border” events a Y N
XI X2 e abcabc e3 4 e5 e6 2 ab e7 e 8 ce9 e10 20120 X4 X3
a b c e3 a e4 b c e5 a e6 b e7 e8 c e9 e10 0 1 2 0 1 2 0 1 2 e1 e2 X1 X2 0 1 2 X4 0 1 2 X3
Event Ⅹ2 X3 Ⅹ4 e2 000 00101 022 5 6 e e ee 89 22222 abcacabbcc 0102 01202 e 0
Event X1 X2 X3 X4 e1 0 a 0 1 e2 0 b 0 0 e3 0 c 1 2 e4 1 a 0 2 e5 1 c 1 1 e6 2 a 1 0 e7 2 b 0 1 e8 2 b 1 2 e9 2 c 0 0 e10 2 c 2 2
f K=2; LEF-Sparseness, Complexity, Termination criterion: base=2. probe=2 Iteration 1 Step I Select seed ele2 Step 2 Produce Stars: RG(ele2m)RG(e2el,m)m=5 RG(ell2,m)={[x2=ax3=0V1][X4=1V2]} RG(e2el, m=x2=b Vcl[x4=0 V2)
d f a b c K=2 ; LEF-sparseness, Complexity; Termination criterion:base=2,probe=2 Iteration 1 Step 1: Select seed: e1,e2 Step 2: Produce Stars: RG(e1|e2,m) RG(e2|e1,m) m=5 RG(e1|e2,m)={[x2=a][x3=0∨1],[X4=1 ∨2]} RG(e2|e1,m)={[x2=b ∨c],[x4=0 ∨2]}