PDF (1MB) - QUT ePrints
PDF (1MB) - QUT ePrints
PDF (1MB) - QUT ePrints
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
XML Data Clustering: An Overview · 3<br />
(a) A set of XML schemas.<br />
(b) A set of XML documents.<br />
Fig. 1: Examples of XML data.<br />
the “Books” cluster that contains books of several genres. On the other hand, clustering of<br />
documents based only on content features similarity will fail to distinguish between conference<br />
articles and books that follow two different structures. In order to derive a meaningful<br />
grouping, these fragments should be analyzed in terms of both their structural and content<br />
ACM Computing Surveys, Vol. , No. , 2009.