Einführung in die Informationswissenschaft

Weitere Magazine

Empfehlungen

Info

Das probabilistische Modell • Fortführung des Beispiels Textstatistik 10 / (12 – 10) 10 / 2 • w („Miranda“) = log ---------------------------------------- = log ------------ = log 35 = 1,54 (11 – 10) / (20 – 11 – 12 + 10) 1 / 7 • Algorithmus: Gewichte alle Dokumente der Datenbank, in denen „Miranda“ vorkommt, mit 1,54! HHU Düsseldorf, WS 2004/05 Einführung in die Informationswissenschaft 282
Textstatistik Textstatistik (Relevance Ranking I). Fazit • Zuordnung von Gewichtungswerten bei Suchanfragen durch Nutzer (bei Google: durch Reihenfolge der Suchworte) • Der Übergang zum Relevance Ranking ist sowohl aus Booleschen Systemen als auch aus informationslinguistischen Komponenten automatisch indexierender Systeme möglich. • Historischer Ausgang: These von Luhn: Häufig vorkommende Worte sind für ein Dokument auch wichtig (schlecht: zu häufige und zu seltene Worte) • Grunddaten für Textstatistik: – Position eines Wortes im Text (im Titel wichtiger als in einer Fußnote) – WDF (Gewichtung nach Auftretenshäufigkeit im Dokument: je häufiger, desto wichtiger) – IDF (Gewichtung nach Auftretenshäufigkeit in der gesamten Datenbank: je mehr Dokumente das Wort enthalten, desto unwichtiger) – „einfacher“ Gewichtungswert (Wort im Dokument): P * WDF * IDF HHU Düsseldorf, WS 2004/05 Einführung in die Informationswissenschaft 283
Seite 1 und 2:
Einführung in die Informationswiss
Seite 3 und 4:
Seite 5 und 6:
Seite 7 und 8:
Seite 9 und 10:
Wer befasst sich mit Information Re
Seite 11 und 12:
Konferenzen Information Retrieval
Seite 13 und 14:
Zeitschriften: Information Retrieva
Seite 15 und 16:
van Rijsbergen Information Retrieva
Seite 17 und 18:
Information Retrieval Eine kurze Ge
Seite 19 und 20:
Grundlagen des Information Retrieva
Seite 21 und 22:
Seite 23 und 24:
Seite 25 und 26:
Seite 27 und 28:
Seite 29 und 30:
(verstandenes) Wissen (Nutzer) Empf
Seite 31 und 32:
Seite 33 und 34:
Seite 35 und 36:
Pull- Service Push- Service Grundla
Seite 37 und 38:
Seite 39 und 40:
Seite 41 und 42:
R e c a l l 100 Grundlagen des Info
Seite 43 und 44:
Seite 45 und 46:
Seite 47 und 48:
Seite 49 und 50:
Seite 51 und 52:
Seite 53 und 54:
Seite 55 und 56:
Seite 57 und 58:
Seite 59 und 60:
Seite 61 und 62:
Seite 63 und 64:
Seite 65 und 66:
Seite 67 und 68:
Seite 69 und 70:
Seite 71 und 72:
Seite 73 und 74:
Seite 75 und 76:
Seite 77 und 78:
Beispiel: Firmendossier (Creditrefo
Seite 79 und 80:
Beispiel: Firmendossier (Creditrefo
Seite 81 und 82:
Beispiel: Zeitungsartikel bei Facti
Seite 83 und 84:
Typische Dokumente: WTM (2) Beispie
Seite 85 und 86:
Typische Dokumente: WTM (3) Beispie
Seite 87 und 88:
Beispiel: Grundsatzurteil (Juris) -
Seite 89 und 90:
Dateien Grundlagen des Information
Seite 91 und 92:
Seite 93 und 94:
Seite 95 und 96:
Funktionalität Boolescher Retrieva
Seite 97 und 98:
Seite 99 und 100:
Seite 101 und 102:
Seite 103 und 104:
Seite 105 und 106:
Seite 107 und 108:
Seite 109 und 110:
Seite 111 und 112:
Seite 113 und 114:
Seite 115 und 116:
Seite 117 und 118:
Seite 119 und 120:
Seite 121 und 122:
Seite 123 und 124:
Seite 125 und 126:
Seite 127 und 128:
Seite 129 und 130:
Gewichtetes Retrieval Gewichten von
Seite 131 und 132:
Gewichtetes Retrieval Intellektuell
Seite 133 und 134:
Gewichtetes Retrieval Intellektuell
Seite 135 und 136:
Gewichtetes Retrieval Boolesches Re
Seite 137 und 138:
Gewichtetes Retrieval Exklusionsmen
Seite 139 und 140:
Gewichtetes Retrieval Boolesches Re
Seite 141 und 142:
Gewichten durch „Cracken“ von K
Seite 143 und 144:
Gewichtetes Retrieval Errechnen der
Seite 145 und 146:
Gewichtetes Retrieval Themenkette:
Seite 147 und 148:
Fazit. Gewichtetes Retrieval - Inte
Seite 149 und 150:
Informationslinguistik Linguistisch
Seite 151 und 152:
Informationslinguistik Informations
Seite 153 und 154:
Informationslinguistik Grundansatz:
Seite 155 und 156:
N-Gramme Informationslinguistik •
Seite 157 und 158:
Informationslinguistik N-Gramme. Be
Seite 159 und 160:
N-Gramme Informationslinguistik •
Seite 161 und 162:
Informationslinguistik N-Gramme: We
Seite 163 und 164:
Informationslinguistik Speicherung
Seite 165 und 166:
Informationslinguistik Stoppwortlis
Seite 167 und 168:
Informationslinguistik Informations
Seite 169 und 170:
Informationslinguistik Wortstammbil
Seite 171 und 172:
Informationslinguistik Abtrennen /
Seite 173 und 174:
Porter-Algorithmus. Schritt 2 Infor
Seite 175 und 176:
Porter- Algorithmus Schritt 4 Infor
Seite 177 und 178:
Porter-Algorithmus: Leistungsfähig
Seite 179 und 180:
Kuhlen- Algorithmus Informationslin
Seite 181 und 182:
Informationslinguistik Morphologie
Seite 183 und 184:
Informationslinguistik MORPHY. Beis
Seite 185 und 186:
Informationslinguistik MORPHY. Beis
Seite 187 und 188:
Informationslinguistik Komposition
Seite 189 und 190:
Informationslinguistik Komposition
Seite 191 und 192:
Informationslinguistik Morphologisc
Seite 193 und 194:
Informationslinguistik Grundform A
Seite 195 und 196:
Informationslinguistik Wort-Bedeutu
Seite 197 und 198:
Meronymie Informationslinguistik
Seite 199 und 200:
Informationslinguistik Relationen v
Seite 201 und 202:
Seite 203 und 204:
Seite 205 und 206:
Informationslinguistik Überblick d
Seite 207 und 208:
Informationslinguistik Worte nach m
Seite 209 und 210:
„Thesaurus“ Informationslinguis
Seite 211 und 212:
Informationslinguistik Erkennung vo
Seite 213 und 214:
Informationslinguistik Phrasenerken
Seite 215 und 216:
Informationslinguistik Phrasenerken
Seite 217 und 218:
Phrasenerkennung durch Indikatorbeg
Seite 219 und 220:
• Anaphora - Ellipsen Information
Seite 221 und 222:
- Anaphoraauflösung Informationsli
Seite 223 und 224:
Informationslinguistik - Soundex be
Seite 225 und 226:
Informationslinguistik Phonetische
Seite 227 und 228:
Informationslinguistik Fazit • Ei
Seite 229 und 230:
Relevance Ranking Textstatistik •
Seite 231 und 232: Precision bei Suchmaschinen im WWW
Seite 233 und 234: Textstatistik Übergang von einer B
Seite 235 und 236: Textstatistik Textstatistik. Der Au
Seite 237 und 238: Textstatistik Die These von Luhn: T
Seite 239 und 240: Textstatistik • einfaches Zählen
Seite 241 und 242: Textstatistik Dokumentspezifische W
Seite 243 und 244: Textstatistik Inverse Dokumenthäuf
Seite 245 und 246: Das Längengewicht Textstatistik
Seite 247 und 248: Sortierung nach Gewichtung. Variant
Seite 249 und 250: Textstatistik Sortiervariante 1:
Seite 251 und 252: Textstatistik Sortierung nach Gewic
Seite 253 und 254: Vektorraummodell Textstatistik - Do
Seite 255 und 256: Vektorraummodell Textstatistik •
Seite 257 und 258: Vektorraummodell Textstatistik •
Seite 259 und 260: Textstatistik Vektorraummodell: Ber
Seite 261 und 262: Textstatistik Dok1 (5 ׀ 3); Dok2 (
Seite 263 und 264: Textstatistik Einsatz des Vektorrau
Seite 273 und 274: Textstatistik Vektorraummodell - Pr
Seite 275 und 276: Das probabilistische Modell Textsta
Seite 277 und 278: Das probabilistische Modell Textsta
Seite 279 und 280: Textstatistik Das probabilistische
Seite 281: Textstatistik Das probabilistische
Seite 285 und 286: Textstatistik Textstatistik (Releva
Seite 287 und 288: Link-Topologie Link-Topologie (Rele
Seite 289 und 290: Link-Topologie Zitationen und Refer
Seite 291 und 292: Bibliographic Coupling A „zitiert
Seite 293 und 294: Co-Zitations-Netz (Link-Struktur) L
Seite 295 und 296: Link-Topologie Hubs und Authorities
Seite 299 und 300: Link-Topologie Einsatz des Kleinber
Seite 303 und 304: Link-Topologie HITS und andere Meth
Seite 305 und 306: PageRank Link-Topologie • das WWW
Seite 307 und 308: PageRank r(A) + r(B) + r(C) = 1 da
Seite 309 und 310: PageRank Link-Topologie • Berechn
Seite 311 und 312: Systemarchitektur von Google Indexe
Seite 313 und 314: Link-Topologie Relevance Ranking be
Seite 315 und 316: Re-Ranking Link-Topologie • „ol
Seite 317 und 318: Re-Ranking Link-Topologie • Besch
Seite 319 und 320: Ranking nach Nutzung • Visits - A
Seite 321 und 322: Link-Topologie Usage Statistics •
Alle anzeigen

Einführung in die Informationswissenschaft

Sie wollen auch ein ePaper? Erhöhen Sie die Reichweite Ihrer Titel.

Template löschen?

Als Template speichern?