06.08.2015 Views

A Wordnet from the Ground Up

A Wordnet from the Ground Up - School of Information Technology ...

A Wordnet from the Ground Up - School of Information Technology ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

106 Chapter 4. Extracting Relation Instancesand(in(cas[0],nom),llook(-1,begin,$T,equal(base[$T],{"taki"})),equal(base[$+1T],{"jak"}),only($+2T,-1,$AR,or(inter(flex[$AR],{adjective , adjectival participles , adverb ,adverbial participles , noun ,numeral }),in(orth[$AR],{"i","lub","czy","oraz","a",",",":","(",")"}))),llook($-1T,begin,$N,and(inter(flex[$N],{noun }),equal(base[$N],{"base form of NLU2"}),in(cas[$N],{nom,acc,dat,inst,loc,voc}))),only($+1N,$-1T,$AL,or(inter(flex[$AL],{adjective , adjectival participles , adverb ,adverbial participles , numeral }),and(inter(flex[$AL],{noun , pronouns }),equal(cas[$AL],{gen})))))Figure 4.2: The essentials of <strong>the</strong> TakichJak pattern implementation in JOSKIPIDuring <strong>the</strong> evaluation, an extracted LU pair could be classified as a correct instanceof hypernymy (possibly indirect, with longer paths accepted), or as one of two formsof nearly correct instances:• not <strong>the</strong> expected hyponym/hypernym order; such pairs occurred more oftenamong <strong>the</strong> results of <strong>the</strong> NomToNom pattern in which <strong>the</strong> direction is not markedby grammatical case;• small inaccuracies in one of <strong>the</strong> LUs: it is part of a larger multiword LU, or ithas a wrong number value, or it is represented by a wrong root (a tagger error).All o<strong>the</strong>r pairs were classified as incorrect. The results in Table 4.1 have been calculatedwith <strong>the</strong> assumption that correct and nearly correct instances are positive. If weexcluded <strong>the</strong> nearly correct class, <strong>the</strong> results would be about 20% lower. The resultswould be very low if we only sought direct hypernymy. This clearly suggests that <strong>the</strong>extracted pairs are not directly helpful in expanding <strong>the</strong> core plWordNet, but <strong>the</strong>y stillare a valuable source of knowledge. They show not only semantic similarity of <strong>the</strong>LUs in a pair, but also <strong>the</strong> direction of <strong>the</strong> relation. Indirect hypernyms can be helpful

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!