05.10.2013 Aufrufe

Information und Wissen: global, sozial und frei? - Fachhochschule ...

Information und Wissen: global, sozial und frei? - Fachhochschule ...

Information und Wissen: global, sozial und frei? - Fachhochschule ...

MEHR ANZEIGEN
WENIGER ANZEIGEN

Sie wollen auch ein ePaper? Erhöhen Sie die Reichweite Ihrer Titel.

YUMPU macht aus Druck-PDFs automatisch weboptimierte ePaper, die Google liebt.

26 Pavel Sirotkin<br />

Formula 1. MAP formula with queries Q, relevant documents R, documents D at<br />

rank r and n returned results. rel is a relevance function assigning 1 to relevant results.<br />

Another metric which has enjoyed wide popularity since its introduction<br />

is Discounted Cumulated Gain or DCG for short (Järvelin and Kekäläinen<br />

2002). The more basic measure upon which it is constructed is the Cumulated<br />

Gain, which is a simple sum of the relevance judgements of all results<br />

up to a certain rank. DCG enhances this rather simple method by introducing<br />

“[a] discounting function [...] that progressively reduces the document score<br />

as its rank increases but not too steeply (e.g., as division by rank) to allow for<br />

user persistence in examining further documents” (Järvelin and Kekäläinen<br />

2002, p. 425). In practice, the authors suggest a logarithmic function, which<br />

can be adjusted (by selecting its base) to provide a more or less strong discount,<br />

depending on the expectations of users’ persistence. DCG can be<br />

modified to allow for better inter-query comparison; to this end, a perfect<br />

ranking for known documents is constructed. The DCG of a result list is then<br />

divided by the ideal DCG, producing normalized DCG (nDCG) in the 0...1<br />

range.<br />

Formula 2. DCG with logarithm base b (based on Järvelin and Kekäläinen 2002).<br />

Metric Evaluations<br />

When a new evaluation metric is introduced, it is usually explained what its<br />

advantage over existing metrics is. Mostly, this happens in theoretical terms;<br />

more often than not, an experimental metric evaluation is also given. There<br />

are many studies comparing one metric to another; however, this has the<br />

disadvantage of being a circular confirmation, indicating at best correlation<br />

between metrics.<br />

Another method was used for evaluating different CG metrics (Järvelin<br />

and Kekäläinen 2000; Järvelin and Kekäläinen 2002). Those were used to<br />

evaluate different IR systems, where one was hypothesized to outperform the

Hurra! Ihre Datei wurde hochgeladen und ist bereit für die Veröffentlichung.

Erfolgreich gespeichert!

Leider ist etwas schief gelaufen!