15.08.2013 Views

General Computer Science 320201 GenCS I & II Lecture ... - Kwarc

General Computer Science 320201 GenCS I & II Lecture ... - Kwarc

General Computer Science 320201 GenCS I & II Lecture ... - Kwarc

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Definition 570 Let A be a web page that is hyperlinked from web pages S1, . . . , Sn, then<br />

<br />

PR(S1) PR(Sn)<br />

PR(A) = 1 − d + d + · · ·<br />

C(S1) C(Sn)<br />

where C(W ) is the number of links in a page W and d = 0.85.<br />

c○: Michael Kohlhase 376<br />

Getting the ranking right is a determining factor for success of a search engine. In fact, the early<br />

of Google was based on the pagerank algorithm discussed above (and the fact that they figured<br />

out a revenue stream using text ads to monetize searches).<br />

The final step for a web search engine is answer composition; at least, if the answer is addressed at<br />

a human user. The main task here is to assemble those information fragments that the user needs<br />

to determine whether the hit described contains information relevant to the respective information<br />

need.<br />

Answer Composition in Search Engines<br />

Answers: To present the<br />

search results we need to address:<br />

Hits and their context<br />

format conversion<br />

caching<br />

Advertising: to finance the<br />

service<br />

advertiser can buy search<br />

terms<br />

ads correspond to search<br />

interest<br />

advertiser pays by click.<br />

c○: Michael Kohlhase 377<br />

Due to the gigantic size of the Internet, search engines are extremely resource-hungry web applications.<br />

The precise figures about the computational resources of the large internet companies are<br />

well-kept trade secrets, but the following figure should give an intuition of the scales involved.<br />

216

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!