06.08.2015 Views

A Wordnet from the Ground Up

A Wordnet from the Ground Up - School of Information Technology ...

A Wordnet from the Ground Up - School of Information Technology ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

38 Chapter 2. Building a <strong>Wordnet</strong> Corehypernymy structure. Hypernymy among broadly conceived synset had not actuallybeen a primary concern: various o<strong>the</strong>r dependencies had been introduced instead. Itis our experience that it would have been much better to build simultaneously <strong>the</strong>structure of all wordnet relations.At this stage, fur<strong>the</strong>r disambiguation and correction of <strong>the</strong> original list of LUs wasperformed. The linguists worked out synsets wide enough for some hypernyms andquasi-synonyms to be listed as synonyms. The substitution test – <strong>the</strong> possibility ofusing two words in <strong>the</strong> same context – is not always precise enough to help distinguish<strong>the</strong>se relations. Also, expressive and vulgar vocabulary was relocated to hyponymicsynsets, obsolete vocabulary – removed.The team of linguists was located in different cities, so we needed a system tosupport distributed work. A support system should not only enable flexible access andkeep <strong>the</strong> integrity of <strong>the</strong> database, but, we assumed, also protect against inconsistenciesand facilitate some management of <strong>the</strong> work, including options for reporting errors andtracing corrections. Both assumption have been heavily revised by practice, as it willbe discussed shortly.The story of wordnet editors begins with Grinder (Tengi, 1998), a software toolthat checked <strong>the</strong> PWN source files and converted <strong>the</strong>m into <strong>the</strong> lexical database. Linguistshad to edit <strong>the</strong> source files. Syntactic and structural errors, such as pointers tononexistent database elements, were identified only during compilation.The EWN project (Vossen, 2002) constructed Polaris, an editor, and Periscope,a graphical database viewer. Both were commercial tools, tightly coupled with certainproperties of <strong>the</strong> EWN database structure. The limitations of Polaris prompted <strong>the</strong>implementation of a new tool, VisDic, for <strong>the</strong> Czech WordNet project (Horák and Smrž,2004). In VisDic, relation definitions are still written in text windows, but an XMLformat is used and some immediate browsing is possible in <strong>the</strong> tool, for example bidirectionalbrowsing of graphs of semantic relations. VisDic is available for research.VisDic was a monolithic application that worked directly on XML files. DEBVis-Dic (Horák et al., 2006), is a lexical database editor that reimplements and extends<strong>the</strong> functionality of VisDic. It is based on <strong>the</strong> client-server architecture and an XMLdatabase server. Both tools are oriented toward editing a wordnet synchronized withwordnets for o<strong>the</strong>r languages by <strong>the</strong> Interlingua Index (Vossen, 2002). That complicates<strong>the</strong>ir basic structure and user interface. Those characteristic features wentbeyond our needs, and anyhow DEBVisDic was not known to be available yet at <strong>the</strong>start of <strong>the</strong> plWordNet project. We decided, <strong>the</strong>refore, to build our own wordnet editor,plWordNetApp [plWNApp] (Piasecki and Koczan, 2007).The plWordNetApp user interface was intended to support <strong>the</strong> division of work onplWordNet construction into <strong>the</strong> steps originating <strong>from</strong> <strong>the</strong> assumed plan of <strong>the</strong> wholeprocess. The DEBVisDic tool is much more general: we designed plWNApp screens

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!