GfKl 2008 - Legos
GfKl 2008 - Legos
GfKl 2008 - Legos
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Evaluation Strategies for Learning Algorithms<br />
of Hierarchical Structures<br />
Korinna Bade 1 and Dominik Benz 2<br />
1 Faculty of Computer Science, Otto-von-Guericke-University Magdeburg,<br />
D-39106 Magdeburg, Germany, Email: korinna.bade@ovgu.de<br />
2 Department of Electrical Engineering/Computer Science, University of Kassel,<br />
D-34121 Kassel, Germany, Email: benz@cs.uni-kassel.de<br />
Abstract. The idea to automatically induce a hierarchical structure among a set<br />
of objects or integrate a given hierarchy into the learning process is common to<br />
a number of disciplines like hierarchical clustering (Bade and Nürnberger, <strong>2008</strong>)<br />
and classification or ontology learning. A crucial aspect hereby is how to assess the<br />
quality of the learned hierarchical scheme. Existing evaluation approaches can be<br />
broadly classified in methods defining quality metrics on the resulting scheme alone<br />
and methods which invoke an external “gold-standard” for comparison. We focus<br />
on the latter case, for which various similarity metrics have been proposed, mostly<br />
depending on the characteristics of the applied learning procedure.<br />
This work aims at bringing together the different disciplines by presenting and<br />
comparing existing gold-standard based evaluation methods for learning algorithms<br />
that generate hierarchical structures. We present an interdisciplinary framework in<br />
order to enable comparison across the different contexts, from which the metrics<br />
originate. Our goal is to emphasize the strong similarities of evaluation tasks in different<br />
disciplines and to create a general pool of evaluation methods. Based on prior<br />
work (Dellschaft and Staab, 2006), we analyze properties of (good) evaluation measures.<br />
Different types of structural errors in the learned hierarchies are identified and<br />
their effects on existing measures are shown. Observing strengths and weaknesses of<br />
existing methods, we also suggest some new methods.<br />
Key words: evaluation metrics, hierarchical clustering, ontology learning, goldstandard<br />
References<br />
Bade, K. and Nürnberger, A. (<strong>2008</strong>): Creating a Cluster Hierarchy under Constraints<br />
of a Partially Known Hierarchy. In: Proceedings of the <strong>2008</strong> SIAM International<br />
Conference on Data Mining. (to appear)<br />
Dellschaft, K. and Staab, S. (2006): On How to Perform a Gold Standard Based<br />
Evaluation of Ontology Learning. In: Proc. of 5 th Int. Semantic Web Conference.<br />
228–241.<br />
− 8 −