06.08.2015 Views

A Wordnet from the Ground Up

A Wordnet from the Ground Up - School of Information Technology ...

A Wordnet from the Ground Up - School of Information Technology ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

42 Chapter 2. Building a <strong>Wordnet</strong> Coreprocessed by <strong>the</strong> morphological analyser Morfeusz (Woliński, 2006) 8 . The inflectionalproperties of word forms are specified in templates by <strong>the</strong> IPIC tagset codes. In keepingwith <strong>the</strong> definition of <strong>the</strong> lexico-semantic relations as linking synsets (Section 2.2),a relation must be valid for any pair of LUs <strong>from</strong> both <strong>the</strong> source and <strong>the</strong> target synset.The substitution test window lets <strong>the</strong> user choose all possible pairs of LUs (<strong>from</strong> bothsynsets) and generate instances of <strong>the</strong> test. The mechanism of tests makes plWNAppdifferent <strong>from</strong> o<strong>the</strong>r wordnet editing tools such as DEBVisDic, but we have not yetevaluated <strong>the</strong> influence of <strong>the</strong> substitution tests on <strong>the</strong> quality of plWordNet.The structuring of plWNApp into two main screen-perspectives (stemming <strong>from</strong> <strong>the</strong>initial separation of broad synset construction and identification of relation instances)appeared to be incompatible with user expectations. The linguists signalled severaltimes that a serious weakness of <strong>the</strong> application was <strong>the</strong> inability to show on one screenall synsets and all relation instances to which an LU belongs, especially in <strong>the</strong> caseof relations that link LUs directly in plWordNet 9 . This request, however, is difficult tomeet without assuming <strong>the</strong> availability of very large high-resolution monitors. Somelinguists also strongly preferred keyboard interaction, and that discouraged fur<strong>the</strong>rdevelopment of graph-based interface, which would require using <strong>the</strong> mouse. On <strong>the</strong>o<strong>the</strong>r hand, we found a two-perspective GUI quite successful in a recent extension ofplWNApp to <strong>the</strong> WordNet Weaver (Section 4.5.3).In order to improve <strong>the</strong> facilities of browsing LUs and <strong>the</strong> associated relationinstances, we introduced an additional perspective (screen) of synset editing. Thisperspective, used for browsing synsets, has a layout similar to <strong>the</strong> LU perspective:a large list of synsets on <strong>the</strong> left (rich filtering possibilities), and on <strong>the</strong> right <strong>the</strong> tabularview of <strong>the</strong> selected synset relations, plus all synset editing panels. The screen and<strong>the</strong> LU perspective are synchronised: <strong>the</strong> filter setting and <strong>the</strong> selected synset or LU aretransferred back and forth when switching. This facilitates browsing <strong>the</strong> LU relationsof <strong>the</strong> LUs which belong to <strong>the</strong> given synset.After <strong>the</strong> construction of <strong>the</strong> initial broad synsets, we proceeded to <strong>the</strong> next step: <strong>the</strong>formation of a net of lexico-semantic relations, that is to say, <strong>the</strong> proper core plWordNet.Two main problems arose. Synsets were too wide; <strong>the</strong>y included not only <strong>the</strong> expectednear-synonyms, but also hypernyms, co-hyponyms and even meronyms. Secondly,many synsets overlapped; many of such synsets belonged to <strong>the</strong> same domain, but<strong>the</strong>ir construction was separated in time and started with different LUs. The linguistshad to extract hypernyms <strong>from</strong> <strong>the</strong> existing synsets and to divide synsets into moreprecise, smaller sets, using detailed guidelines including a number of substitution tests8 The tests were shown alongside all content-adding actions, so <strong>the</strong>ir content was often obvious. Theusers postulated <strong>the</strong> replacement of obligatory tests with an on-demand presentation of <strong>the</strong> instantiatedtest.9 The logistic and administrative circumstances of <strong>the</strong> project made it very hard to correct that, once<strong>the</strong> implementation has been largely completed.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!