06.08.2015 Views

A Wordnet from the Ground Up

A Wordnet from the Ground Up - School of Information Technology ...

A Wordnet from the Ground Up - School of Information Technology ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

4.5. Hybrid Combinations 151We introduce <strong>the</strong> following notation:• x is a lemma, representing one or more LUs, to be added to <strong>the</strong> wordnet, y, y ′are lemmas <strong>from</strong> <strong>the</strong> wordnet, S, S ′ – wordnet synsets;• hypo(S, n), hyper(S, n) are sets of hyponym or hypernym synsets, respectively,of <strong>the</strong> synset S up to n levels;• dist h (S, S ′ ) is <strong>the</strong> number of hypernymy or hyponymy links (depending on <strong>the</strong>direction) between S and S ′ ;• r is <strong>the</strong> context radius – it defines <strong>the</strong> size of <strong>the</strong> context influencing <strong>the</strong> calculationof <strong>the</strong> lemma-to-synset fit (<strong>the</strong> value was set experimentally to 2);• hMSR (set experimentally to 0.4) is <strong>the</strong> threshold defining highly reliable MSRvalues – it corresponds to <strong>the</strong> observed high and reliable values of MSR;• minMSR (set to 0.1) is <strong>the</strong> MSR value below which associations seem to bebased on weak, accidental clues;• maxSens (set to 5) is <strong>the</strong> maximal number of presented activation areas (possibleattachment areas) – <strong>the</strong> number of correct proposals is mostly low, so wewanted to keep <strong>the</strong> number of attachment areas small in order not to clutter <strong>the</strong>screen.Phase I. Lemma-to-synset calculation1. votes(x, S) = ∑ y∈Sfit(x, y)2. fit(x, S) =δ (h=1)(votes(x, S) + ∑ votes(x, S ′ ))S ′ ∈hypo(S,r)∪hyper(S,r)2 ∗ dist(S, S ′ )where δ : N × N → {0, 1}, such that δ(n, s) = 1if and only if (n ≥ 1.5 ∗ h and s ≤ 2) or (n ≥ 2 ∗ h and s > 2)( ∑3. fit(x, S) = δ (h=hMSR) y∈Sscore(x, y) +∑S ′ ∈hypo(S,r)∪hyper(S,r)Phase II. Identify lemma senses: areas and centres∑ y ∈ S ′ score(x, y))2 ∗ dist(S, S ′ )1. synAtt(x) = {S : S = {S : fit(x, S) ∨ weak fit(x, S)}, and S is aconnected hypernymy graph}2. maxScore(x, S) = score (x, max S∈S score(x, S))

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!