13.12.2012 Views

GfKl 2008 - Legos

GfKl 2008 - Legos

GfKl 2008 - Legos

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Evaluation Strategies for Learning Algorithms<br />

of Hierarchical Structures<br />

Korinna Bade 1 and Dominik Benz 2<br />

1 Faculty of Computer Science, Otto-von-Guericke-University Magdeburg,<br />

D-39106 Magdeburg, Germany, Email: korinna.bade@ovgu.de<br />

2 Department of Electrical Engineering/Computer Science, University of Kassel,<br />

D-34121 Kassel, Germany, Email: benz@cs.uni-kassel.de<br />

Abstract. The idea to automatically induce a hierarchical structure among a set<br />

of objects or integrate a given hierarchy into the learning process is common to<br />

a number of disciplines like hierarchical clustering (Bade and Nürnberger, <strong>2008</strong>)<br />

and classification or ontology learning. A crucial aspect hereby is how to assess the<br />

quality of the learned hierarchical scheme. Existing evaluation approaches can be<br />

broadly classified in methods defining quality metrics on the resulting scheme alone<br />

and methods which invoke an external “gold-standard” for comparison. We focus<br />

on the latter case, for which various similarity metrics have been proposed, mostly<br />

depending on the characteristics of the applied learning procedure.<br />

This work aims at bringing together the different disciplines by presenting and<br />

comparing existing gold-standard based evaluation methods for learning algorithms<br />

that generate hierarchical structures. We present an interdisciplinary framework in<br />

order to enable comparison across the different contexts, from which the metrics<br />

originate. Our goal is to emphasize the strong similarities of evaluation tasks in different<br />

disciplines and to create a general pool of evaluation methods. Based on prior<br />

work (Dellschaft and Staab, 2006), we analyze properties of (good) evaluation measures.<br />

Different types of structural errors in the learned hierarchies are identified and<br />

their effects on existing measures are shown. Observing strengths and weaknesses of<br />

existing methods, we also suggest some new methods.<br />

Key words: evaluation metrics, hierarchical clustering, ontology learning, goldstandard<br />

References<br />

Bade, K. and Nürnberger, A. (<strong>2008</strong>): Creating a Cluster Hierarchy under Constraints<br />

of a Partially Known Hierarchy. In: Proceedings of the <strong>2008</strong> SIAM International<br />

Conference on Data Mining. (to appear)<br />

Dellschaft, K. and Staab, S. (2006): On How to Perform a Gold Standard Based<br />

Evaluation of Ontology Learning. In: Proc. of 5 th Int. Semantic Web Conference.<br />

228–241.<br />

− 8 −

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!