A Wordnet from the Ground Up

More documents

Recommendations

Info

4.5. Hybrid Combinations 163seem low but the number of possible attachments is set to a rather high 5 – to showthe linguist more extracted senses – so it impairs precision. The measure P ≥1 alsoshows that at least one proper attachment area was identified in the majority of LUs.Even for the worst random sample, proposals for 59.12% of new lemmas were foundworth examining, as they include helpful suggestions. The numbers do not show howthe tool can inspire the user, draw her attention to less obvious or domain-dependentsenses, reveal peculiarities in the wordnet state and so on.L All S All S All S+W One S One W One S+W Best P ≥10 26.65 7.90 16.46 45.80 16.24 34.96 42.811 35.76 14.50 24.21 58.73 28.96 47.81 61.192 42.87 21.39 31.20 67.69 40.51 57.72 75.023 48.31 27.36 36.93 73.58 51.08 65.33 81.965 53.52 34.78 43.34 78.46 58.51 71.14 86.186 57.59 43.59 49.99 81.52 64.58 75.31 89.827 61.09 49.90 55.01 83.56 70.45 78.75 91.168 63.38 53.71 58.13 84.47 73.58 80.47 92.499 65.27 56.55 60.53 85.03 75.73 81.62 93.1210 66.07 58.86 62.16 85.26 78.28 82.70 93.54Table 4.6:The accuracy [%] of plWordNet reconstruction; L – the distance from the original synset,S and W mean strong and weak fitness, respectivelyWordNet reconstructionIn the automatic evaluation, we wanted to check the ability of the AAA algorithmto reconstruct parts of plWordNet. The method is meant to expand the existing corestructure of a wordnet, so we identified 1527 LUs in the lower parts of the hypernymystructure as a basis for the evaluation. In order to introduce as little bias as possible,10 LUs were removed from the plWordNet structure in one step of the evaluation. TheC H classifier component was trained without the removed LUs and the AAA algorithmwas run to attach the processed LUs.There are many synsets in plWordNet with a single LU. This makes the evaluationof LUs in such synsets problematic. If we removed singleton synsets, we wouldartificially – and dramatically – alter the overall structure of plWordNet and so introducean unwanted bias. That is why we decided to remove only the LUs and to leave emptysynsets in the modified plWordNet.We assumed three strategies for evaluating the AAA algorithm’s proposals:• All – all proposals are evaluated;• One – only single highest-scoring attachment site is evaluated; this strategy was
164 Chapter 4. Extracting Relation Instancesintroduced mainly for comparison with other approaches (but it is unnatural fromthe point of view of the linguists’ work);• Best P ≥1 – one closest attachment site is evaluated (similarly to the P ≥1 inSec. 4.5.4).Table 4.6 presents the result with a distinction between strong (marked S) andweak fitness (W). As expected, the accuracy of suggestions based on strong fitness issignificantly higher then for weak fitness. Because of the intended use, we assumedthat not only direct hits are useful – if the proposal is close enough to the correct placein plWordNet structure, then it is also a valuable suggestion. The same applies if thereis meronymy or holonymy between the suggested and correct synset.The results are encouraging. Almost half of the suggestions based on strong fitnessare in the close proximity of the correct place in wordnet structure. If making onlyone suggestion was required, the accuracy was boosted to 73.58%. For our goal, thisis an artificial constraint, but it shows how well the AAA algorithm would behave ina fully unsupervised way. Our ultimate goal, though, is to create a tool for supportinga linguist’s work, so the result for Best P ≥1 strategy shows more meaningful data:for how many words there is at least one useful suggestion. The AAA algorithmssuggested at least one strictly correct attachment site for 42.81% words, or for 81.96%words if we consider that close proposals are also useful.Comparison to other ways of automatic expanding a wordnet can be misleading.That is because our primary goal was to construct a tool that facilitates and streamlinesthe linguists’ work. Still, even if we compare our automatic evaluation with the resultsin (Widdows, 2003) during comparable tests on the PWN, our results seem to be better.For example, we had 34.96% for the highest-scored proposal (One S+W in Table 4.6),while Widdows reports a 15% best accuracy for a “correct classifications in the top 4places” (among the top 4 highest proposals). Our similar result for the top 5 proposalsis even higher, 42.81%. The best results reported by Alfonseca and Manandhar (2002)and Witschel (2005) are also at the level of about 15%, but were achieved in tests ona much smaller scale. Witschel also performed tests only in two selected domains. Thealgorithm of Snow et al. (2006), contrary to ours, can be applied only to probabilisticevidence.We made two assumptions: attachment based on the activation area and the simultaneoususe of multiple knowledge sources. The assumption appears to have beensuccessful in boosting the accuracy above the level of the MSR-only decisions (whichis roughly represented in our approach by weak fitness).WNW seems to improve the linguist’s efficiency a lot, but longer observations arenecessary for a reliable justification.The AAA algorithm is overburdened with parameters. Further research is requiredto find either a simplified form or an effective method of parameters optimization.
Page 1 and 2:
A Wordnetfrom the Ground Up
Page 3 and 4:
Work financed by the Polish Ministr
Page 7 and 8:
6 Prefaceheartfelt thanks go to all
Page 9:
8 Chapter 1. Motivation, Goals, Ear
Page 12 and 13:
1.1. Motivation 11[a] special form
Page 14 and 15:
1.1. Motivation 13Affect (Strappara
Page 16 and 17:
1.2. The Goals of the plWordNet Pro
Page 18 and 19:
1.2. The Goals of the plWordNet Pro
Page 20 and 21:
1.3. Early Decisions 19Merge Model:
Page 22:
1.3. Early Decisions 214. On the ot
Page 25 and 26:
24 Chapter 2. Building a Wordnet Co
Page 27 and 28:
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Page 35 and 36:
Page 37 and 38:
Page 39 and 40:
Page 41 and 42:
Page 43 and 44:
Page 45 and 46:
Page 47 and 48:
Page 49 and 50:
48 Chapter 3. Discovering Semantic
Page 51 and 52:
Page 53 and 54:
Page 55 and 56:
Page 57 and 58:
Page 59 and 60:
Page 61 and 62:
Page 63 and 64:
Page 65 and 66:
Page 67 and 68:
Page 69 and 70:
Page 71 and 72:
Page 73 and 74:
Page 75 and 76:
Page 77 and 78:
Page 79 and 80:
Page 81 and 82:
Page 83 and 84:
Page 85 and 86:
Page 87 and 88:
Page 89 and 90:
Page 91 and 92:
Page 93 and 94:
Page 95 and 96:
Page 97 and 98:
Page 99 and 100:
Page 101 and 102:
Page 103 and 104:
102 Chapter 4. Extracting Relation
Page 105 and 106:
Page 107 and 108:
Page 109 and 110:
Page 111 and 112:
Page 113 and 114: 112 Chapter 4. Extracting Relation
Page 163: 162 Chapter 4. Extracting Relation
Page 167 and 168: 166 Chapter 5. Polish WordNet Today
Page 186 and 187: Appendix ATests for Lexico-semantic
Page 188 and 189: 187Test for adjectives (T. IX)1. p1
Page 190 and 191: 189RelatednessTest for nouns (T. XV
Page 192 and 193: BibliographyAgarwal, Abhaya and Alo
Page 194 and 195: Bibliography 193on Deep Lexical Acq
Page 196 and 197: Bibliography 195Derwojedowa, Magdal
Page 198 and 199: Bibliography 197Grefenstette, Grego
Page 200 and 201: Bibliography 199Kurc, Roman. (2008)
Page 202 and 203: Bibliography 201Mohammad, Saif and
Page 204 and 205: Bibliography 203. (2006) “The pot
Page 206 and 207: Bibliography 205and Technology 7(1-
Page 208 and 209: List of Tables2.1 The size of the c
Page 210 and 211: List of Figures2.1 The LU perspecti
Page 212 and 213: List of Figures 2114.16 Completely
Page 214 and 215:
Index 213CBC, see Clustering by Com
Page 216 and 217:
Index 215169, 177, 178, 180, 182hyp
Page 218 and 219:
Index 217mutual hypernymy, 24Mutual
Page 220 and 221:
Index 219SUMO, 14Supported Vector M
Page 222:
A language without a wordnet is at
show all

A Wordnet from the Ground Up

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?