Multilevel Graph Clustering with Density-Based Quality Measures

More documents

Recommendations

Info

$Eine Einführung in LaTeX-Beamer - studiy - Brandenburgische ...$

3 The Multi-Level Refinement Algorithmthe previous subsections. Specific observations about greedy and Kernighan-Linrefinement are discussed in the next section.The implemented algorithms are marked in the table. Other algorithms are includedto show how they align within this design space. Sorted greedy refinementwith maxmod vertex ranking (sorted-maxmod) is not implemented because it is asslow as complete greedy but also is restricted in its search. Thus it has no advantagesover other greedy algorithms. The three fitness-based Kernighan-Lin algorithms KLmod,KL-eo, and KL-dens are not considered further because they do not reliablyfind local optima. Their vertex ranking combined with accepting quality decreasingmoves prevents this. Finally randomized algorithms are generally excluded for thesame reason. They are interesting just in combination with greedy refinement. Butthen too many combinations exist to be discussed in the scope of this work.3.4.2 Greedy RefinementData: graph,clustering,selectorResult: clusteringrepeatv ← selector:find best maxmod vertex;j ← selector:find best target cluster for v;if move v → C j is improving modularity thenmove v to cluster j and update selector;until move was not improving ;Figure 3.10: Refinement Method: Complete GreedyGreedy refinement algorithms are characterized by accepting only vertex movesthat increase modularity. The complete greedy algorithm, as displayed in Figure 3.10,uses the maximal modularity increase maxmod as vertex selector. This enforces theglobally best move in each iteration. Selecting and moving vertices is repeated untilthe modularity is not further improved. In each iteration |V | vertex rankings haveto be evaluated with each requiring O(|E|/|V | + max |C|) time in average. Selectingthe best target cluster and updating the cluster weights w(C(v)), w(C j ) costs lineartime in the number of incident edges. Moving the vertex is done in constant timeby updating the mapping C(v) which is stored in an array. Thus k moves requireO(k|E| + k|V | max |C|) time.To improve search speed the sorted greedy algorithm splits vertex and move selectioninto separate steps. Figure 3.11 displays the algorithm in pseudo-code. Theinner loop selects all vertices once sorted by their vertex ranking. Since the rankingchanges with each move all vertices are re-visited in each inner iteration. Withconstant-time rankings like mod-fitness this will cost O(|V | 2 ) time for the completeinner loop. Again the most expensive part is selecting the target cluster for thecurrent vertex. The inner loop does this exactly once for each vertex which costsO(|E| + |V | max |C|). Moving a vertex and updating cluster weights and vertex48
3.4 Cluster RefinementData: graph,clustering,selectorResult: clusteringrepeatmark all vertices as unmoved;while unmoved vertices exist dov ← selector:find best ranked, unmoved vertex;j ← selector:find best target cluster for v;mark v as moved;if move v → C j is improving modularity thenmove v to cluster j and update selector;// outer loop// inner loopuntil no improving move found;Figure 3.11: Refinement Method: Sorted Greedyfitness costs O(|E|/|V |) in average. Therefore the worst-case time for all iterationsof the inner loop is in O(|V | 2 + |E| + |V | max |C|).In case vertices were moved the outer loop restarts the refinement. This processesvertices which were visited early but became movable with modularity increase justafter later moves of other vertices. In practice only a small number of outer iterationsis necessary. These restarts ensures that sorted greedy always finds a local optimum:If at least one improving move exists its vertex will be visited and moved even ifit is not the best ranked vertex. Higher ranked vertices are simply skipped. Whenimprovements were found the refinement is restarted until no single improving moveexists. Nevertheless the found optimum depends on the vertex ranking. The variantsmay end up in different nearby local optima.3.4.3 Kernighan-Lin RefinementThe central idea of Kernighan-Lin refinement is to escape local optima by movingvertices with the least modularity decrease in case no improvements are possible.Selecting the least modularity decreasing moves is like a careful depth-first walk intothe surrounding clustering space along a ridge while avoiding clusterings of very lowmodularity.The basic algorithm is presented in the following subsection. Unfortunately it isnot very effective considering its long run-time. To improve this the next subsectionsanalyze two aspects of the dynamic behavior: The creation of clusters and theeffective search depth. Based on the results the algorithm is improved by restrictingthe search depth.Basic Algorithm The basic algorithm is shown in Figure 3.12. The best clusteringfound during the refinement is called peak clustering and is separately stored. Anew peak clustering would have to be stored when the clustering after a modularityincreasing move is better than the last peak. To save some work just the lastclustering in a series of modularity increasing moves is stored. This situation is49
Page 1:
Brandenburgische Technische Univers
Page 5 and 6:
ContentsList of FiguresList of Tabl
Page 7: List of Figures1.1 Graph of the Mex
Page 11 and 12: 1 IntroductionSince the rise of com
Page 13 and 14: 1.2 Objectives and Outline1.2 Objec
Page 15 and 16: 2 Graph ClusteringThis chapter intr
Page 17 and 18: 2.2 The Modularity Measure of Newma
Page 19 and 20: 2.3 Density-Based Clustering Qualit
Page 27 and 28: 2.4 Fundamental Clustering Strategi
Page 35 and 36: 3 The Multi-Level Refinement Algori
Page 37 and 38: 3.1 The Multi-Level Schemeas starti
Page 39 and 40: 3.2 Graph CoarseningData: graph,sel
Page 41 and 42: 3.2 Graph Coarseningnearly no edges
Page 43 and 44: 3.3 Merge SelectorsExtent Name Desc
Page 45 and 46: 3.3 Merge Selectorsdifferent size.
Page 47 and 48: 3.3 Merge SelectorsThe probability
Page 49 and 50: 3.3 Merge SelectorsAs selection qua
Page 51 and 52: 3.3 Merge Selectorsvectors the eige
Page 53 and 54: 3.4 Cluster Refinementleave the loc
Page 55 and 56: 3.4 Cluster Refinementmoving v from
Page 57: 3.4 Cluster RefinementAlgorithm Sea
Page 61 and 62: 3.4 Cluster RefinementModularity0.2
Page 63 and 64: 3.5 Further Implementation NotesInd
Page 65 and 66: 3.5 Further Implementation NotesBOO
Page 67: 3.5 Further Implementation Notesfor
Page 70 and 71: 4 Evaluationparameter component des
Page 72 and 73: 4 Evaluationsignificance scale also
Page 74 and 75: 4 EvaluationModularity by Match Fra
Page 76 and 77: 4 Evaluation5% 10% 30% 50% 100%G-no
Page 78 and 79: 4 Evaluationmean modularity0.50 0.5
Page 80 and 81: 4 Evaluation1 2 3 4RWreach-none 1 0
Page 82 and 83: 4 EvaluationG-none M-none G-sgrd M-
Page 84 and 85: 4 Evaluationmean modularity time DI
Page 86 and 87: 4 EvaluationRuntime vs. Graph SizeR
Page 88 and 89: 4 EvaluationComparison of Modularit
Page 90 and 91: 4 Evaluation(a) karate(b) dolphinsF
Page 92 and 93: 4 Evaluation(a) jazz(b) celegans me
Page 94 and 95: 4 Evaluationadministrators, and gra
Page 97 and 98: 5 Results and Future WorkThe object
Page 99 and 100: 5.3 Directions for Future Workstrat
Page 101: 5.3 Directions for Future Workties
Page 104 and 105: BIBLIOGRAPHY[14] B.L. Chamberlain.
Page 106 and 107: BIBLIOGRAPHY[42] H. Jeong, B. Tombo
Page 108 and 109:
BIBLIOGRAPHY[71] A. J. Soper and C.
Page 110 and 111:
A The Benchmark Graph Collectionsub
Page 112 and 113:
B Clustering ResultsRWreach-sgrd 1
Page 114:
B Clustering Resultswalktrap leadin
show all

Multilevel Graph Clustering with Density-Based Quality Measures

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?