Multilevel Graph Clustering with Density-Based Quality Measures

More documents

Recommendations

Info

$Eine Einführung in LaTeX-Beamer - studiy - Brandenburgische ...$

4 EvaluationComparison of ModularityAverage Modularity0.0 0.1 0.2 0.3 0.4 0.50.3963 0.4393 0.4580 0.4644 0.4718 0.4927 0.5050 0.5075 0.5110walktrap leadingev wakita_HE wakita_HN fgj ML−none spinglass ML−sgrd ML−KLAlgorithm(a) Modularity by AlgorithmComparison of Runtimeruntime [s]0 1000 2000 3000 4000 5000●walktrapleadingevwakita_HEwakita_HNfgjML−nonespinglassML−sgrdML−KL●●● ● ● ●0 5000 10000 15000 20000 25000vertex count(b) Runtime of the Algorithms vs. Graph SizeFigure 4.10: Clustering Results and Runtime of the Reference Algorithms78
4.7 Comparison to Published Resultsof Reichardt and Bornholdt produces comparable good clusterings. However it ismuch slower. On nearly all graphs it is outperformed by the multi-level Kernighan-Lin refinement (ML-KL) in terms of modularity and runtime. The three slowestimplementations were spinglass, ML-KL, and leadingev. All other algorithms had arelatively constant, low runtime.4.7 Comparison to Published ResultsThis section compares the presented multi-level refinement method against otherclustering algorithms. For many of these algorithms clustering results are publishedin the papers presenting the algorithm. Because of their size clusterings are directlyprinted only for very small graphs. Commonly just the modularity value of theclusterings are published.The following discussion is based on the algorithms listed in Section 2.4 aboutfundamental clustering methods. First some general problems of this evaluationmethod are discussed. Then for each graph the modularity values found in articlesare presented. The section concludes with a small summary table comparing thebest found values to own results.Only modularity values published together with the original algorithms were consideredand for each value the source article is cited. The modularity values arecompared against two multi-level Kernighan-Lin refinement algorithms. Both usegreedy grouping with 10% reduction factor and Kernighan-Lin refinement. As mergeselector the weight density is employed by the ML-KL-density variant and the randomwalk reachability (2,3) by the ML-KL-rw variant. The first variant is the defaultconfiguration identified by the previous evaluation. In addition the other variant waschosen because for degree volume model used here it is in many cases able to findbetter clusterings.The comparison of printed modularity values embodies some general problems.Often only a few digits are printed to save space. With just three digits it is difficultto check whether the same or just similar clusterings where found. In this regardalso printing the number of clusters would be helpful. The calculation of modularitymay be done slightly different. Several variants of the modularity measure exist andsmall variations in the handling of self-edges are possible. In addition unweightededges might have been used instead of the available weighted version. Finally smalldiversities can arise in preprocessing and graph conversion by different strategies toobtain undirected, symmetric graphs. Here just the last problem is addressed byexcluding graphs when their published number of vertices and edges differs from theown version. This applies to most graphs in [64, 25, 69].4.7.1 The Graphs and ClusteringsIn the following paragraphs each graph is shortly presented. Thereafter the clusteringresults are discussed. For smaller graphs also example pictures are printed.The layout was computed with the LinLog energy model [62] 5 . The best clustering5 available at http://www.informatik.tu-cottbus.de/~an/GD/79
Page 1:
Brandenburgische Technische Univers
Page 5 and 6:
ContentsList of FiguresList of Tabl
Page 7:
List of Figures1.1 Graph of the Mex
Page 11 and 12:
1 IntroductionSince the rise of com
Page 13 and 14:
1.2 Objectives and Outline1.2 Objec
Page 15 and 16:
2 Graph ClusteringThis chapter intr
Page 17 and 18:
2.2 The Modularity Measure of Newma
Page 19 and 20:
2.3 Density-Based Clustering Qualit
Page 21 and 22:
Page 23 and 24:
Page 25 and 26:
Page 27 and 28:
2.4 Fundamental Clustering Strategi
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Page 35 and 36:
3 The Multi-Level Refinement Algori
Page 37 and 38: 3.1 The Multi-Level Schemeas starti
Page 39 and 40: 3.2 Graph CoarseningData: graph,sel
Page 41 and 42: 3.2 Graph Coarseningnearly no edges
Page 43 and 44: 3.3 Merge SelectorsExtent Name Desc
Page 45 and 46: 3.3 Merge Selectorsdifferent size.
Page 47 and 48: 3.3 Merge SelectorsThe probability
Page 49 and 50: 3.3 Merge SelectorsAs selection qua
Page 51 and 52: 3.3 Merge Selectorsvectors the eige
Page 53 and 54: 3.4 Cluster Refinementleave the loc
Page 55 and 56: 3.4 Cluster Refinementmoving v from
Page 57 and 58: 3.4 Cluster RefinementAlgorithm Sea
Page 59 and 60: 3.4 Cluster RefinementData: graph,c
Page 61 and 62: 3.4 Cluster RefinementModularity0.2
Page 63 and 64: 3.5 Further Implementation NotesInd
Page 65 and 66: 3.5 Further Implementation NotesBOO
Page 67: 3.5 Further Implementation Notesfor
Page 70 and 71: 4 Evaluationparameter component des
Page 72 and 73: 4 Evaluationsignificance scale also
Page 74 and 75: 4 EvaluationModularity by Match Fra
Page 76 and 77: 4 Evaluation5% 10% 30% 50% 100%G-no
Page 78 and 79: 4 Evaluationmean modularity0.50 0.5
Page 80 and 81: 4 Evaluation1 2 3 4RWreach-none 1 0
Page 82 and 83: 4 EvaluationG-none M-none G-sgrd M-
Page 84 and 85: 4 Evaluationmean modularity time DI
Page 86 and 87: 4 EvaluationRuntime vs. Graph SizeR
Page 90 and 91: 4 Evaluation(a) karate(b) dolphinsF
Page 92 and 93: 4 Evaluation(a) jazz(b) celegans me
Page 94 and 95: 4 Evaluationadministrators, and gra
Page 97 and 98: 5 Results and Future WorkThe object
Page 99 and 100: 5.3 Directions for Future Workstrat
Page 101: 5.3 Directions for Future Workties
Page 104 and 105: BIBLIOGRAPHY[14] B.L. Chamberlain.
Page 106 and 107: BIBLIOGRAPHY[42] H. Jeong, B. Tombo
Page 108 and 109: BIBLIOGRAPHY[71] A. J. Soper and C.
Page 110 and 111: A The Benchmark Graph Collectionsub
Page 112 and 113: B Clustering ResultsRWreach-sgrd 1
Page 114: B Clustering Resultswalktrap leadin
show all

Multilevel Graph Clustering with Density-Based Quality Measures

Create successful ePaper yourself

Delete template?

Save as template?