Automatic indexing in e-government - VBN - Aalborg Universitet
Automatic indexing in e-government - VBN - Aalborg Universitet
Automatic indexing in e-government - VBN - Aalborg Universitet
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>Automatic</strong> <strong><strong>in</strong>dex<strong>in</strong>g</strong> <strong>in</strong> e-<strong>government</strong><br />
characteristics of the doma<strong>in</strong>. For this purpose we use a questionnaire for ga<strong>in</strong><strong>in</strong>g an<br />
overview of the organization. Subsequently, focus group <strong>in</strong>terviews are employed <strong>in</strong><br />
order to expla<strong>in</strong> and expand the results of the questionnaire survey. The questionnaire<br />
is used to collect data on the employees’ frequency of <strong>in</strong>formation seek<strong>in</strong>g, the types of<br />
<strong>in</strong>formation needs developed, use of <strong>in</strong>formation sources, and metadata preferences <strong>in</strong><br />
relation to specific work tasks <strong>in</strong> the organization. The assumption is that importance of<br />
<strong>in</strong>formation may depend on the work task <strong>in</strong> question. We refer to this first part of the<br />
empirical foundation for the thesis as the doma<strong>in</strong> study.<br />
The second part of the data collection consists of a search test specifically<br />
<strong>in</strong>vestigat<strong>in</strong>g the performance of the two <strong><strong>in</strong>dex<strong>in</strong>g</strong> methods mentioned above. For the<br />
design of the search test we use knowledge ga<strong>in</strong>ed from the doma<strong>in</strong> study <strong>in</strong> order to<br />
qualify the search test design. The search test <strong>in</strong>vestigates the performance of two test<br />
systems. Both test systems employ automatic <strong><strong>in</strong>dex<strong>in</strong>g</strong>; one extracted (free text<br />
<strong><strong>in</strong>dex<strong>in</strong>g</strong>) and one assigned (automatic categorization). Three simulated and one real<br />
search job forms the basis of the test persons’ evaluation of the performance of the test<br />
systems. The relevance of the search results are evaluated by the test persons. The test<br />
sessions are f<strong>in</strong>ished with a short <strong>in</strong>terview.<br />
1.2 Empirical assumptions<br />
The empirical design of the PhD project has been guided by our<br />
methodological start<strong>in</strong>g po<strong>in</strong>t: the cognitive view of <strong>in</strong>formation seek<strong>in</strong>g and retrieval<br />
(cf., Ingwersen & Järvel<strong>in</strong>, 2005). The cognitive viewpo<strong>in</strong>t is methodologically<br />
considered with<strong>in</strong> the research tradition of cognitive constructivism (Talja, Tuom<strong>in</strong>en &<br />
Savola<strong>in</strong>en, 2005). The cognitive viewpo<strong>in</strong>t has emerged as a reaction to a biased focus<br />
on users <strong>in</strong> the user oriented research tradition and on systems <strong>in</strong> the system oriented<br />
research tradition. Thus, the cognitive viewpo<strong>in</strong>t aims at a holistic view on the process<br />
of IR <strong>in</strong>teraction <strong>in</strong> order to achieve <strong>in</strong>tegration between the user oriented and the<br />
system driven research traditions (e.g., Ingwersen, 1992, 1996; Ingwersen & Järvel<strong>in</strong>,<br />
2005). The cognitive view emphasizes the cognitive actors <strong>in</strong>teract<strong>in</strong>g <strong>in</strong> <strong>in</strong>formation<br />
seek<strong>in</strong>g and retrieval. With this view of <strong>in</strong>formation seek<strong>in</strong>g and retrieval, the users and<br />
the <strong>in</strong>formation system must be taken <strong>in</strong>to account when test<strong>in</strong>g performance of an<br />
<strong>in</strong>formation system. As a consequence we test the performance of <strong><strong>in</strong>dex<strong>in</strong>g</strong> methods by<br />
<strong>in</strong>volv<strong>in</strong>g real, potential users <strong>in</strong> the search test. Further, we apply an established<br />
evaluation method for the search test, namely simulated search tasks, which have been<br />
4