Multilevel Graph Clustering with Density-Based Quality Measures

More documents

Recommendations

Info

$Eine Einführung in LaTeX-Beamer - studiy - Brandenburgische ...$

4 Evaluationmean modularity time DIC28 main mod. DIC28 mainnone 0.56849 24.11900 0.80154SGR-mod 0.59661 75.93500 0.84754SGR-eo 0.59664 74.89400 0.84727CGR 0.59672 798.17800 0.84758SGR-density 0.59677 76.71600 0.84747KL 0.59792 4672.12100 0.84781Table 4.8: Mean Modularity by Refinement Method (reduced set). The first columncontains mean modularities and the second column lists the runtime on the graphDIC28 main.none SGR-density CGR KLmean modularity 0.52305 0.54608 0.54610 0.54810Table 4.9: Mean Modularity by Refinement Method (large set)eo) no significant differences in modularity are visible. Their clustering results arealso comparable to complete greedy refinement (CGR). On the graph DIC28 mainthe complete greedy refinement was about 10 times slower than any sorted greedyrefinement. This is in agreement with the higher worst case complexity of thecomplete greedy refinement. Therefore sorted greedy refinement with the densityfitnessvertex selector (SGR-density) is chosen as default greedy refinement method.4.4.2 Kernighan-Lin RefinementThis subsection analyzes how much the refinement variants improve the clusteringresults compared to no refinement. This includes the question whether Kernighan-Lin refinement performs significantly better than greedy refinement. The algorithmsare configured like in the previous subsection. Considered are the variants none,SGR-density, CGR, and KL. The evaluation is applied to the large graph set to gainmore reliable mean modularity values. Table 4.9 lists the produced mean modularityvalues and Figure 4.8 provides a bar plot of the same values. Appendix B.4 containsa table of the single modularity values gained by the algorithms on each graph.On the large graph set the mean modularity was improved with sorted greedy refinement(SGR-density) by 4.4% compared to no refinement 1 . This range providesthe significance scale like already used in the previous evaluations. In comparisonKernighan-Lin refinement (KL) improved the results by 4.79%. The mean improvementwith Kernighan-Lin refinement over sorted greedy refinement was 0.37%. Theruntime and modularity values of the graph DIC28 main from the previous subsectionshow that the Kernighan-Lin refinement was about 10 times slower than sortedgreedy refinement. At the same time there it improved the modularity by just0.04%. Like on the reduced set also on the large graph set sorted greed refinement(SGR-density) was equally good as the complete greedy refinement (CGR).1 max / min ∗100%74
4.5 ScalabilityModularity by Refinement Method (large set)mean modularity0.525 0.530 0.535 0.540 0.545none SGR−density CGR KLFigure 4.8: Mean Modularity by Refinement Method (large set)Altogether Kernighan-Lin refinement reliably improves the clusterings by a smallamount but requires considerably more runtime. Therefore it should be used inplace of sorted greedy refinement only when best clusterings are searched. Similaror better modularity improvements might be easily achievable by other refinementmethods in less time.4.5 ScalabilityThe purpose of this section is to experimentally study how well the runtime of thealgorithms scales with the graph size. The considered configurations are the multilevelKernighan-Lin refinement (KL), sorted greedy refinement by density-fitness(SGR-density), complete greedy refinement (CGR) and the raw graph coarseningwithout refinement (none). For all other parameters the default values are used.The runtime of all graphs and algorithm has to be measured on the same computer.Thus only a subset of 24 graphs from the large graph collection is used. The graphsare also marked in the graph table in the appendix.Figure 4.9a shows the total runtime of the algorithms versus the number of vertices.In addition Figure 4.9b shows the computation time of the coarsening phase,which equals the configuration none. The time spend on sorted greedy refinementis included. In order to show the influence of the vertices the names of three largergraphs and their vertex count are inserted into the figure. The complete runtimemeasurements of each graph can be found in the appendix in Table B.5.Of course the runtime not only depends on the number of vertices but also onthe edges. On the other hand the number of edges mostly scales with the verticesbecause nearly all graphs have a similar mean vertex degree. The graphs eatRSand hep-th-new main are the two largest graphs of the collection. Both have morethan three times more edges than other graphs of similar vertex count (for example75
Page 1:
Brandenburgische Technische Univers
Page 5 and 6:
ContentsList of FiguresList of Tabl
Page 7:
List of Figures1.1 Graph of the Mex
Page 11 and 12:
1 IntroductionSince the rise of com
Page 13 and 14:
1.2 Objectives and Outline1.2 Objec
Page 15 and 16:
2 Graph ClusteringThis chapter intr
Page 17 and 18:
2.2 The Modularity Measure of Newma
Page 19 and 20:
2.3 Density-Based Clustering Qualit
Page 21 and 22:
Page 23 and 24:
Page 25 and 26:
Page 27 and 28:
2.4 Fundamental Clustering Strategi
Page 29 and 30:
Page 31 and 32:
Page 33 and 34: 2.4 Fundamental Clustering Strategi
Page 35 and 36: 3 The Multi-Level Refinement Algori
Page 37 and 38: 3.1 The Multi-Level Schemeas starti
Page 39 and 40: 3.2 Graph CoarseningData: graph,sel
Page 41 and 42: 3.2 Graph Coarseningnearly no edges
Page 43 and 44: 3.3 Merge SelectorsExtent Name Desc
Page 45 and 46: 3.3 Merge Selectorsdifferent size.
Page 47 and 48: 3.3 Merge SelectorsThe probability
Page 49 and 50: 3.3 Merge SelectorsAs selection qua
Page 51 and 52: 3.3 Merge Selectorsvectors the eige
Page 53 and 54: 3.4 Cluster Refinementleave the loc
Page 55 and 56: 3.4 Cluster Refinementmoving v from
Page 57 and 58: 3.4 Cluster RefinementAlgorithm Sea
Page 59 and 60: 3.4 Cluster RefinementData: graph,c
Page 61 and 62: 3.4 Cluster RefinementModularity0.2
Page 63 and 64: 3.5 Further Implementation NotesInd
Page 65 and 66: 3.5 Further Implementation NotesBOO
Page 67: 3.5 Further Implementation Notesfor
Page 70 and 71: 4 Evaluationparameter component des
Page 72 and 73: 4 Evaluationsignificance scale also
Page 74 and 75: 4 EvaluationModularity by Match Fra
Page 76 and 77: 4 Evaluation5% 10% 30% 50% 100%G-no
Page 78 and 79: 4 Evaluationmean modularity0.50 0.5
Page 80 and 81: 4 Evaluation1 2 3 4RWreach-none 1 0
Page 82 and 83: 4 EvaluationG-none M-none G-sgrd M-
Page 86 and 87: 4 EvaluationRuntime vs. Graph SizeR
Page 88 and 89: 4 EvaluationComparison of Modularit
Page 90 and 91: 4 Evaluation(a) karate(b) dolphinsF
Page 92 and 93: 4 Evaluation(a) jazz(b) celegans me
Page 94 and 95: 4 Evaluationadministrators, and gra
Page 97 and 98: 5 Results and Future WorkThe object
Page 99 and 100: 5.3 Directions for Future Workstrat
Page 101: 5.3 Directions for Future Workties
Page 104 and 105: BIBLIOGRAPHY[14] B.L. Chamberlain.
Page 106 and 107: BIBLIOGRAPHY[42] H. Jeong, B. Tombo
Page 108 and 109: BIBLIOGRAPHY[71] A. J. Soper and C.
Page 110 and 111: A The Benchmark Graph Collectionsub
Page 112 and 113: B Clustering ResultsRWreach-sgrd 1
Page 114: B Clustering Resultswalktrap leadin
show all

Multilevel Graph Clustering with Density-Based Quality Measures

Create successful ePaper yourself

Delete template?

Save as template?