3.4. Measures of Semantic Relatedness 67given constraint template. Lists of lexical elements can be freely defined, but <strong>the</strong>y aremostly acquired directly <strong>from</strong> corpora, e.g. a list of all adjectives occurring in a corpus.A co-incidence matrix based on constraints has <strong>the</strong> following scheme:M[w i , c t (x)] (3.4)w i is one of <strong>the</strong> target LUs, c t (x) is <strong>the</strong> template c t lexicalised with <strong>the</strong> LU x.A cell M[w i , c t (x)] stores <strong>the</strong> number of occurrences of w i in <strong>the</strong> corpus whichmet <strong>the</strong> lexico-morphosyntactic constraint c t (x). In order to simplify <strong>the</strong> description,we will refer to <strong>the</strong> constraints as features which describe <strong>the</strong> target LUs semantically,and to <strong>the</strong> cells as feature values.A constraint c t (x) is activated for <strong>the</strong> given occurrence of w i during matrix constructionwhen x occurs in <strong>the</strong> given sentence. Constraints are applied to morphosyntacticallytagged text. They can test token annotations in some positions referred toby offsets to <strong>the</strong> context match centre (<strong>the</strong> position of w i ) and can iterate across <strong>the</strong>whole sentence. All JOSKIPI-based constraints return Boolean values that depend on<strong>the</strong> given w i position and <strong>the</strong> surrounding sentence.Constraints of several types were tested for <strong>the</strong> description of nouns (Piasecki et al.,2007b). In <strong>the</strong> end, four types were selected as producing an MSR with <strong>the</strong> best resultsin WBST+H:AdjC – modification by a specific adjective or a specific adjectival participle,NcC – co-ordination with a a specific noun,NmgC – modification by a specific noun in <strong>the</strong> genitive case,VsbC – occurrence of a specific verb for which a given noun can be its subject,The AdjC constraint presented in a schematic form in Figure 3.5 is a example ofa constraint strongly based on morphosyntactic agreement – here on case, number andgender. Such constraints are relatively easy to recognise and have high accuracy inrecognition, see Table 3.6, discussed later. In AdjC, first we are looking for a particularadjective or an adjectival participle (specified by <strong>the</strong> base form) to <strong>the</strong> left of <strong>the</strong> targetLU N in <strong>the</strong> position 0:• <strong>the</strong> llook operator implements searching for tokens that meet <strong>the</strong> conditiongiven as its last argument,• $A is a variable used for iteration (all variable names start with ‘$’),• in and inter are set operators of inclusion and intersection,
68 Chapter 3. Discovering Semantic Relatednessor(and(llook(-1,-5,$A,and( in(flex[$A],{adj,pact,ppas}),inter(base[$A],{"particular base form "}),agrpp(0,$A,{nmb,gnd,cas},3))),or(only($A,-1,$Ad, in(flex[$Ad],{adjectival and adverbial grammatical classes,numerals and punctuation })),and(in(cas[0],nom,acc,dat,loc,inst,voc),<strong>the</strong>re is no o<strong>the</strong>r verb <strong>the</strong>n "być" between -1 and $A positionsnot(llook(-1,$A,$S,and(in(flex[$S], {nominal grammatical classes }),in(cas[$S],{nom,acc,dat,loc,inst,voc}),not( llook($S,$A,$P,equal(flex[$P],{prep})) ))))))),a symmetrical condition for <strong>the</strong> right context)Figure 3.5: Parts of a lexico-morphosyntactic constraint which describes nominal LUs via adjectivalmodification (AdjC)• flex returns a grammatical class 12 of <strong>the</strong> specified token,• adj, pact, ppas are mnemonics for grammatical classes of adjective and twoadjectival participles,• agrpp(0,$A,nmb,gnd,cas,3) is an operator that tests agreement between twospecified positions and according to <strong>the</strong> given list of grammatical categories 13 .After <strong>the</strong> lexical element A has been found and its position stored in $A, we need totest if no tokens between N and A make <strong>the</strong> modification of N by A impossible. Forexample, A may belong to a different noun phrase than N, so agreement is accidental.In <strong>the</strong> following steps of <strong>the</strong> constraint AdjC, <strong>the</strong>n, we test two situations that validate<strong>the</strong> modification:12 In <strong>the</strong> tagset of IPIC (Przepiórkowski, 2004), word forms are divided into 32 grammatical class,a division more fine-grained than parts of speech; this is motivated largely by morphological, derivationaland syntactic properties of word forms.13 The last parameter has a technical meaning for more advance uses of agrpp. It describes <strong>the</strong> numberof categories.
- Page 1 and 2:
A Wordnetfrom the Ground Up
- Page 3 and 4:
Work financed by the Polish Ministr
- Page 7 and 8:
6 Prefaceheartfelt thanks go to all
- Page 9:
8 Chapter 1. Motivation, Goals, Ear
- Page 12 and 13:
1.1. Motivation 11[a] special form
- Page 14 and 15:
1.1. Motivation 13Affect (Strappara
- Page 16 and 17:
1.2. The Goals of the plWordNet Pro
- Page 18 and 19: 1.2. The Goals of the plWordNet Pro
- Page 20 and 21: 1.3. Early Decisions 19Merge Model:
- Page 22: 1.3. Early Decisions 214. On the ot
- Page 25 and 26: 24 Chapter 2. Building a Wordnet Co
- Page 27 and 28: 26 Chapter 2. Building a Wordnet Co
- Page 29 and 30: 28 Chapter 2. Building a Wordnet Co
- Page 31 and 32: 30 Chapter 2. Building a Wordnet Co
- Page 33 and 34: 32 Chapter 2. Building a Wordnet Co
- Page 35 and 36: 34 Chapter 2. Building a Wordnet Co
- Page 37 and 38: 36 Chapter 2. Building a Wordnet Co
- Page 39 and 40: 38 Chapter 2. Building a Wordnet Co
- Page 41 and 42: 40 Chapter 2. Building a Wordnet Co
- Page 43 and 44: 42 Chapter 2. Building a Wordnet Co
- Page 45 and 46: 44 Chapter 2. Building a Wordnet Co
- Page 47 and 48: 46 Chapter 2. Building a Wordnet Co
- Page 49 and 50: 48 Chapter 3. Discovering Semantic
- Page 51 and 52: 50 Chapter 3. Discovering Semantic
- Page 53 and 54: 52 Chapter 3. Discovering Semantic
- Page 55 and 56: 54 Chapter 3. Discovering Semantic
- Page 57 and 58: 56 Chapter 3. Discovering Semantic
- Page 59 and 60: 58 Chapter 3. Discovering Semantic
- Page 61 and 62: 60 Chapter 3. Discovering Semantic
- Page 63 and 64: 62 Chapter 3. Discovering Semantic
- Page 65 and 66: 64 Chapter 3. Discovering Semantic
- Page 67: 66 Chapter 3. Discovering Semantic
- Page 71 and 72: 70 Chapter 3. Discovering Semantic
- Page 73 and 74: 72 Chapter 3. Discovering Semantic
- Page 75 and 76: 74 Chapter 3. Discovering Semantic
- Page 77 and 78: 76 Chapter 3. Discovering Semantic
- Page 79 and 80: 78 Chapter 3. Discovering Semantic
- Page 81 and 82: 80 Chapter 3. Discovering Semantic
- Page 83 and 84: 82 Chapter 3. Discovering Semantic
- Page 85 and 86: 84 Chapter 3. Discovering Semantic
- Page 87 and 88: 86 Chapter 3. Discovering Semantic
- Page 89 and 90: 88 Chapter 3. Discovering Semantic
- Page 91 and 92: 90 Chapter 3. Discovering Semantic
- Page 93 and 94: 92 Chapter 3. Discovering Semantic
- Page 95 and 96: 94 Chapter 3. Discovering Semantic
- Page 97 and 98: 96 Chapter 3. Discovering Semantic
- Page 99 and 100: 98 Chapter 3. Discovering Semantic
- Page 101 and 102: 100 Chapter 3. Discovering Semantic
- Page 103 and 104: 102 Chapter 4. Extracting Relation
- Page 105 and 106: 104 Chapter 4. Extracting Relation
- Page 107 and 108: 106 Chapter 4. Extracting Relation
- Page 109 and 110: 108 Chapter 4. Extracting Relation
- Page 111 and 112: 110 Chapter 4. Extracting Relation
- Page 113 and 114: 112 Chapter 4. Extracting Relation
- Page 115 and 116: 114 Chapter 4. Extracting Relation
- Page 117 and 118: 116 Chapter 4. Extracting Relation
- Page 119 and 120:
118 Chapter 4. Extracting Relation
- Page 121 and 122:
120 Chapter 4. Extracting Relation
- Page 123 and 124:
122 Chapter 4. Extracting Relation
- Page 125 and 126:
124 Chapter 4. Extracting Relation
- Page 127 and 128:
126 Chapter 4. Extracting Relation
- Page 129 and 130:
128 Chapter 4. Extracting Relation
- Page 131 and 132:
130 Chapter 4. Extracting Relation
- Page 133 and 134:
132 Chapter 4. Extracting Relation
- Page 135 and 136:
134 Chapter 4. Extracting Relation
- Page 137 and 138:
136 Chapter 4. Extracting Relation
- Page 139 and 140:
138 Chapter 4. Extracting Relation
- Page 141 and 142:
140 Chapter 4. Extracting Relation
- Page 143 and 144:
142 Chapter 4. Extracting Relation
- Page 145 and 146:
144 Chapter 4. Extracting Relation
- Page 147 and 148:
146 Chapter 4. Extracting Relation
- Page 149 and 150:
148 Chapter 4. Extracting Relation
- Page 151 and 152:
150 Chapter 4. Extracting Relation
- Page 153 and 154:
152 Chapter 4. Extracting Relation
- Page 155 and 156:
154 Chapter 4. Extracting Relation
- Page 157 and 158:
156 Chapter 4. Extracting Relation
- Page 159 and 160:
158 Chapter 4. Extracting Relation
- Page 161 and 162:
160 Chapter 4. Extracting Relation
- Page 163 and 164:
162 Chapter 4. Extracting Relation
- Page 165 and 166:
164 Chapter 4. Extracting Relation
- Page 167 and 168:
166 Chapter 5. Polish WordNet Today
- Page 169 and 170:
168 Chapter 5. Polish WordNet Today
- Page 171 and 172:
170 Chapter 5. Polish WordNet Today
- Page 173 and 174:
172 Chapter 5. Polish WordNet Today
- Page 175 and 176:
174 Chapter 5. Polish WordNet Today
- Page 177 and 178:
176 Chapter 5. Polish WordNet Today
- Page 179 and 180:
178 Chapter 5. Polish WordNet Today
- Page 181 and 182:
180 Chapter 5. Polish WordNet Today
- Page 183 and 184:
182 Chapter 5. Polish WordNet Today
- Page 186 and 187:
Appendix ATests for Lexico-semantic
- Page 188 and 189:
187Test for adjectives (T. IX)1. p1
- Page 190 and 191:
189RelatednessTest for nouns (T. XV
- Page 192 and 193:
BibliographyAgarwal, Abhaya and Alo
- Page 194 and 195:
Bibliography 193on Deep Lexical Acq
- Page 196 and 197:
Bibliography 195Derwojedowa, Magdal
- Page 198 and 199:
Bibliography 197Grefenstette, Grego
- Page 200 and 201:
Bibliography 199Kurc, Roman. (2008)
- Page 202 and 203:
Bibliography 201Mohammad, Saif and
- Page 204 and 205:
Bibliography 203. (2006) “The pot
- Page 206 and 207:
Bibliography 205and Technology 7(1-
- Page 208 and 209:
List of Tables2.1 The size of the c
- Page 210 and 211:
List of Figures2.1 The LU perspecti
- Page 212 and 213:
List of Figures 2114.16 Completely
- Page 214 and 215:
Index 213CBC, see Clustering by Com
- Page 216 and 217:
Index 215169, 177, 178, 180, 182hyp
- Page 218 and 219:
Index 217mutual hypernymy, 24Mutual
- Page 220 and 221:
Index 219SUMO, 14Supported Vector M
- Page 222:
A language without a wordnet is at