01.08.2013 Views

Automatic indexing in e-government - VBN - Aalborg Universitet

Automatic indexing in e-government - VBN - Aalborg Universitet

Automatic indexing in e-government - VBN - Aalborg Universitet

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Automatic</strong> <strong><strong>in</strong>dex<strong>in</strong>g</strong> <strong>in</strong> e-<strong>government</strong><br />

characteristics of the doma<strong>in</strong>. For this purpose we use a questionnaire for ga<strong>in</strong><strong>in</strong>g an<br />

overview of the organization. Subsequently, focus group <strong>in</strong>terviews are employed <strong>in</strong><br />

order to expla<strong>in</strong> and expand the results of the questionnaire survey. The questionnaire<br />

is used to collect data on the employees’ frequency of <strong>in</strong>formation seek<strong>in</strong>g, the types of<br />

<strong>in</strong>formation needs developed, use of <strong>in</strong>formation sources, and metadata preferences <strong>in</strong><br />

relation to specific work tasks <strong>in</strong> the organization. The assumption is that importance of<br />

<strong>in</strong>formation may depend on the work task <strong>in</strong> question. We refer to this first part of the<br />

empirical foundation for the thesis as the doma<strong>in</strong> study.<br />

The second part of the data collection consists of a search test specifically<br />

<strong>in</strong>vestigat<strong>in</strong>g the performance of the two <strong><strong>in</strong>dex<strong>in</strong>g</strong> methods mentioned above. For the<br />

design of the search test we use knowledge ga<strong>in</strong>ed from the doma<strong>in</strong> study <strong>in</strong> order to<br />

qualify the search test design. The search test <strong>in</strong>vestigates the performance of two test<br />

systems. Both test systems employ automatic <strong><strong>in</strong>dex<strong>in</strong>g</strong>; one extracted (free text<br />

<strong><strong>in</strong>dex<strong>in</strong>g</strong>) and one assigned (automatic categorization). Three simulated and one real<br />

search job forms the basis of the test persons’ evaluation of the performance of the test<br />

systems. The relevance of the search results are evaluated by the test persons. The test<br />

sessions are f<strong>in</strong>ished with a short <strong>in</strong>terview.<br />

1.2 Empirical assumptions<br />

The empirical design of the PhD project has been guided by our<br />

methodological start<strong>in</strong>g po<strong>in</strong>t: the cognitive view of <strong>in</strong>formation seek<strong>in</strong>g and retrieval<br />

(cf., Ingwersen & Järvel<strong>in</strong>, 2005). The cognitive viewpo<strong>in</strong>t is methodologically<br />

considered with<strong>in</strong> the research tradition of cognitive constructivism (Talja, Tuom<strong>in</strong>en &<br />

Savola<strong>in</strong>en, 2005). The cognitive viewpo<strong>in</strong>t has emerged as a reaction to a biased focus<br />

on users <strong>in</strong> the user oriented research tradition and on systems <strong>in</strong> the system oriented<br />

research tradition. Thus, the cognitive viewpo<strong>in</strong>t aims at a holistic view on the process<br />

of IR <strong>in</strong>teraction <strong>in</strong> order to achieve <strong>in</strong>tegration between the user oriented and the<br />

system driven research traditions (e.g., Ingwersen, 1992, 1996; Ingwersen & Järvel<strong>in</strong>,<br />

2005). The cognitive view emphasizes the cognitive actors <strong>in</strong>teract<strong>in</strong>g <strong>in</strong> <strong>in</strong>formation<br />

seek<strong>in</strong>g and retrieval. With this view of <strong>in</strong>formation seek<strong>in</strong>g and retrieval, the users and<br />

the <strong>in</strong>formation system must be taken <strong>in</strong>to account when test<strong>in</strong>g performance of an<br />

<strong>in</strong>formation system. As a consequence we test the performance of <strong><strong>in</strong>dex<strong>in</strong>g</strong> methods by<br />

<strong>in</strong>volv<strong>in</strong>g real, potential users <strong>in</strong> the search test. Further, we apply an established<br />

evaluation method for the search test, namely simulated search tasks, which have been<br />

4

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!