06.08.2015 Views

A Wordnet from the Ground Up

A Wordnet from the Ground Up - School of Information Technology ...

A Wordnet from the Ground Up - School of Information Technology ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

174 Chapter 5. Polish WordNet Today and Tomorrow{bok ‘side’, krawędź ‘edge’, skraj ‘brink’, kraj ‘brink’, kant‘edge’, brzeg ‘margin’, obrzeże ‘margin’}{dobra strona ‘good side’, plus ‘plus’, cnota ‘virtue’, walor‘value’, pozytyw ‘positive’, przymiot ‘attribute’, wartość‘value’, zaleta ‘advantage’}{finanse ‘finances’, fundusz ‘fund’, kapitał ‘capital’, budżet‘budget’, środki finansowe ‘financial means’, fundusze ‘funds’}{grób ‘grave’, mogiła ‘grave’, grobowiec ‘thomb’, nagrobek‘gravestone’, miejsce pochówku ‘place of burial’}{istota ‘essence’, sens ‘sens’, sedno ‘core’, główne zagadnienie‘main issue’, meritum ‘crux’, kwintensencja ‘quintessence’, jądro‘gist’}{nierozdzielność ‘inseparability’, nierozerwalność‘indissolubility’, jednolitość ‘uniformity’, spoistość‘cohesiveness’, nierozłączność ‘inseparability’, jedność ‘unity’,spójność ‘cohession’}The verbal and adjectival synsets are more diverse in size (Table 5.5), but it shouldbe emphasised that <strong>the</strong> verbal part has been only expanded a little, and <strong>the</strong> adjectivalpart is <strong>the</strong> same as in <strong>the</strong> core plWordNet. The numbers of synsets and LUs are muchsmaller than for <strong>the</strong> nominal part, so a lot of <strong>the</strong> more specific LUs have not beenadded yet.Percentage of lemmas belonging to <strong>the</strong> n synsets [%]1 2 3 4 5 6 7 8 9 ≥ 10Nouns 76.70 17.68 3.88 1.08 0.38 0.18 0.07 0.03 0.00 0.00Verbs 79.41 14.87 4.03 1.23 0.29 0.17 0.00 0.00 0.00 0.00Adjectives 72.99 15.90 6.56 2.66 1.10 0.08 0.23 0.15 0.15 0.18Table 5.6: The number of synsets to which a lemma belongsTable 5.6 presents a more detailed picture of <strong>the</strong> lemma polysemy. The numbers ofmonosemous lemmas appear in Table 5.3. Here <strong>the</strong>y are expressed as <strong>the</strong> percentagesin <strong>the</strong> first column. It is worth noticing that <strong>the</strong> percentage of monosemous nominallemmas is lower than for <strong>the</strong> o<strong>the</strong>r two categories, in contrast with <strong>the</strong> much higher

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!