A Wordnet from the Ground Up

More documents

Recommendations

Info

4.5. Hybrid Combinations 1314.5.1 Classifiers for lexical-semantic relationsMSRs should extract wordnet relation instances well: recall is high (up to the limit ofthe size of the vocabulary of the underlying corpus) since any LU pair gets a value.Yet, high values do not tell us what kind of a relation links a given LU pair. Weneed to attach relation labels to the LU pairs related strongly enough. We also need todetermine when two LUs are connected strongly enough to be in an wordnet relation.No threshold on the MSR values answer this question straight (Section 3.4.5). Now, tolabel LU pairs by relation labels or a catch-all unrelated label is a typical classificationproblem, for which Machine Learning is a tool of choice.Separation into several classes is a harder classification task. Many algorithmswork for two classes only or are better tuned for the two-way scenario. A wordnet’sstructure is fully defined by all its relations, but hypernymy is central, especially fornouns. Our first attempt is a classifier which assigns pairs of LU to the positive classclose hypernymy or near-synonymy or to other. Experience of work on MSRs andpattern-based methods suggests that a finer-grained subdivision of the positive class isvery hard.Snow et al. (2005) proposed a supervised ML method of extracting hypernymyinstances. They started from the idea of a supervised algorithm to combine a largenumber of lexico-syntactic patterns into a binary classifier of hypernymic LU pairs.The patterns had been extracted from a large corpus parsed by the dependency parserMiniPar (Lin, 1993). 752311 noun pairs 〈n i , n j 〉 from PWN 2.0 at a distance nolonger than four dependency links in the parse tree have been identified and classifiedas Known Hypernyms (14387) and Known Non-Hypernyms (737924, the ratio 1:50).This was based on the fact that n j is an ancestor of the first sense of n i in the PWN2.0 hypernymy structure. Only “frequently-used” senses of each noun were taken intoaccount.Patterns were generated from classified noun pairs, as descriptions of the dependencypaths that linked nouns in the occurrences. Such defined patterns are a slightlyextended version of lexico-syntactic patterns in MSR construction (Section 3.4.2).Naïve Bayes and logistic regression algorithms were used to train classifiers on thecollected data. Testing, done on noun pairs labelled in relation to PWN, was to distinguishnon-hypernymic pairs from hypernym pairs at unrestricted distance. The bestF-score in 10-fold cross validation was 0.348.Next, Snow et al. (2005) combined a classifier of coordinate nouns (with a commonhypernymic ancestor) with a hypernym classifier and other classifiers based on suchsources as Wikipedia or PWN. In an evaluation on 5387 manually labelled noun pairs,0.3268 was the best F-score for the corpus-based only models (without the use ofstructural information such as in the Wikipedia).
132 Chapter 4. Extracting Relation InstancesKennedy (2006) analysed several modified versions of this method. The modificationsconcerned the data in the training/test corpus (varying the positive-to-negativepair ratio and the method of undersampling negative examples) and small differencesin the way of formatting dependency paths. An additional classifiers based on a versionof the Supported Vector Machines algorithm (Joachims, 2002) was applied too,achieving the best F-score 0.633 for a combination of a classifier and filtering basedon Roget’s Thesaurus.Zhang et al. (2006) explored different types of syntactic dependencies at differentlevels of granularity in the construction of classifiers to find occurrences of relationshipsbetween named entities. Five main kinds of relationships with 24 different subtypeswere considered. This approach is broadly similar but the different objective makes acomparison of the results difficult.ML methods of extracting hypernymy pairs usually take lexico-syntactic featuresdirectly to build a classifier. Tens of thousands of features are typical, each carryingvery sparse information. Most of such information “tells” the classifier about variousaspects of semantic relatedness. Features that point to specific lexico-semantic relationsare rare. Section 3.4.5 notes that near-synonyms and close hypernyms/hyponyms of anLU u would be expected close to the top of the list of LUs most semantically related tou, generated by a good MSR. An application of a syntactic analyser is also assumed:a deep parser in (Zhang et al., 2006) or a shallow dependency parser in (Snow et al.,2005, Kennedy, 2006). For many languages such tools are not available yet.We propose to extract hypernymy pairs by relaxing both assumption. There aretwo phases (Piasecki et al., 2008):1. extract the generic relation of semantic relatedness modelled by some MSR,2. identify hypernymy instances – pairs of LUs – from the MSR’s results.The first phase can use all kinds of information that describes the semantics ofLUs, depending on the MSR extraction method. The second phase concentrates ongroups of semantically related LUs and applies specialised tests that distinguish specificlexico-semantic relations as subtypes of semantic relatedness. The tasks of the firstphase are preliminary filtering and problem complexity reduction, so during the secondphase a broader variety of ML methods can be used. An MSR of good accuracy can(by way of its high values) associate LUs that extremely rarely occur close by inthe corpus at hand. Note that such occurrences are the precondition on any patternbasedmethod. MSRs condense information otherwise distributed among many lexicosyntacticpatterns; in phase 2 we can concentrate on the most promising pairs.The only assumption is the availability of a highly accurate MSR. During experimentswe used an MSR based on the Rank Weight Function transformation [MSR RW F ],an earlier version of the Generalised RWF presented in Section 3.4.4. MSR RW F dif-
Page 1 and 2:
A Wordnetfrom the Ground Up
Page 3 and 4:
Work financed by the Polish Ministr
Page 7 and 8:
6 Prefaceheartfelt thanks go to all
Page 9:
8 Chapter 1. Motivation, Goals, Ear
Page 12 and 13:
1.1. Motivation 11[a] special form
Page 14 and 15:
1.1. Motivation 13Affect (Strappara
Page 16 and 17:
1.2. The Goals of the plWordNet Pro
Page 18 and 19:
1.2. The Goals of the plWordNet Pro
Page 20 and 21:
1.3. Early Decisions 19Merge Model:
Page 22:
1.3. Early Decisions 214. On the ot
Page 25 and 26:
24 Chapter 2. Building a Wordnet Co
Page 27 and 28:
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Page 35 and 36:
Page 37 and 38:
Page 39 and 40:
Page 41 and 42:
Page 43 and 44:
Page 45 and 46:
Page 47 and 48:
Page 49 and 50:
48 Chapter 3. Discovering Semantic
Page 51 and 52:
Page 53 and 54:
Page 55 and 56:
Page 57 and 58:
Page 59 and 60:
Page 61 and 62:
Page 63 and 64:
Page 65 and 66:
Page 67 and 68:
Page 69 and 70:
Page 71 and 72:
Page 73 and 74:
Page 75 and 76:
Page 77 and 78:
Page 79 and 80:
Page 81 and 82: 80 Chapter 3. Discovering Semantic
Page 103 and 104: 102 Chapter 4. Extracting Relation
Page 131: 130 Chapter 4. Extracting Relation
Page 167 and 168: 166 Chapter 5. Polish WordNet Today
Page 183 and 184:
182 Chapter 5. Polish WordNet Today
Page 186 and 187:
Appendix ATests for Lexico-semantic
Page 188 and 189:
187Test for adjectives (T. IX)1. p1
Page 190 and 191:
189RelatednessTest for nouns (T. XV
Page 192 and 193:
BibliographyAgarwal, Abhaya and Alo
Page 194 and 195:
Bibliography 193on Deep Lexical Acq
Page 196 and 197:
Bibliography 195Derwojedowa, Magdal
Page 198 and 199:
Bibliography 197Grefenstette, Grego
Page 200 and 201:
Bibliography 199Kurc, Roman. (2008)
Page 202 and 203:
Bibliography 201Mohammad, Saif and
Page 204 and 205:
Bibliography 203. (2006) “The pot
Page 206 and 207:
Bibliography 205and Technology 7(1-
Page 208 and 209:
List of Tables2.1 The size of the c
Page 210 and 211:
List of Figures2.1 The LU perspecti
Page 212 and 213:
List of Figures 2114.16 Completely
Page 214 and 215:
Index 213CBC, see Clustering by Com
Page 216 and 217:
Index 215169, 177, 178, 180, 182hyp
Page 218 and 219:
Index 217mutual hypernymy, 24Mutual
Page 220 and 221:
Index 219SUMO, 14Supported Vector M
Page 222:
A language without a wordnet is at
show all

A Wordnet from the Ground Up

Create successful ePaper yourself

Delete template?

Save as template?