13.07.2015 Views

WWW/Internet - Portal do Software Público Brasileiro

WWW/Internet - Portal do Software Público Brasileiro

WWW/Internet - Portal do Software Público Brasileiro

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

IADIS International Conference <strong>WWW</strong>/<strong>Internet</strong> 2010independently. Furthermore it allows to profit from the MoreLikeThis-similarity 3 function provided byApache Lucene.• Tag (Cloud) Generation Module: To provide the access to related <strong>do</strong>cuments a resource-specificsearch query term/tag cloud is calculated by this module. This tag cloud is of the form TC r = (t 1 ,..., t n , r 1 ,...,r m ), where r 1 ,..., r m are the resources which have any of the query tags t 1 ,..., t n in common. The calculatedresource-specific query tag clouds are serialized to disk on the server-side to improve the performance of thesystem. For retrieving the query tags and the corresponding resources (cf. Figure 1), this module provides asimple HTTP interface using the following two functions:o GetTagCloud(,) generates a XML representation of a query tag clou<strong>do</strong> GetResources(,,) generates a XML representation of theresource list for a particular query tag.• Tag Cloud Presentation Module: This module is a client-side AJAX module implemented inJavaScript. It retrieves the XML representation of a query term/tag cloud or an XML representation of aresource list of a particular query term/tag from the tag cloud generation module and renders a tag cloud in avisually appealing fashion (cf. Trattner and Helic 2009).Figure 3. QueryCloud system - structural diagram.4. EXPERIMENTAL SETUP AND EVALUATION FRAMEWORKTo investigate the feasibility of the tool before actually deploying it we integrated the QueryCloud tagcollection module into Austria-Forum life system and collected the search queries of the users coming from asearch engine such as Google, Yahoo! and Bing to Austria-Forum for a period of 6 months.In order to evaluate the potentials and limitations of the tool we implemented a theoretical framework thatmeasures tag quantity, link quality, network quality and information retrieval quality of the system. Tagquality by means of “tag semantic” is not investigated since we assume that the approach produces goodquality tags as shown by (Antonellis et al. 2009, Carman et al. 2009). Since Austria-Forum offers a built-intagging system, we used this human generated tag corpus (further referred as AF dataset) as our almost“golden standard” to compare it with the tags collected by QueryCloud system within Austria-Forum in ourtheoretical framework.3 http://lucene.apache.org/java/3_0_1/api/all/org/apache/lucene/search/Similarity.html237

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!