Onto.PT: Towards the Automatic Construction of a Lexical Ontology ...

More documents

Recommendations

Info

98 Chapter 6. Thesaurus Enrichment 6.2 Evaluation of the assignment procedure The evaluation of the assignment procedure has two main goals. First, it quantifies the performance of the assignment algorithm. Second, it enables the selection of the most adequate settings, including the similarity measure and the best threshold σ to use in the integration of the synpairs of PAPEL/CARTÃO in TeP. 6.2.1 The gold resource To compare the performance of the assignment algorithm using different settings, we randomly selected 355 noun synpairs of PAPEL 2.0 (Gonçalo Oliveira et al., 2010b) and had them assigned, by two human annotators, to the synsets of TeP 2.0 (Maziero et al., 2008). Before their selection, we made sure that all 355 synpairs had at least one candidate synset in TeP. The manually assigned synpairs constitute a small gold collection, used to evaluate the procedure with different settings. Even though the creation of this resource was a time-consuming task, we now have a reference that helps us understand the behavior of the algorithm. Furthermore, it is now possible to repeat this kind of evaluation as many times as needed. Lexical-semantic knowledge is typically subjective and thus hard to evaluate. Besides depending heavily on the vocabulary range and intuition of the human annotator, when it comes to the division of words into senses, even for expert lexicographers, there is not a consensus because word senses are most of the time fuzzy and also because language evolves everyday (see section 4.3). In order to minimise this problem, both annotators manually selected the assigments for the same 355 synpairs. On average, there were 4.31 candidate synsets for each synpair with a standard deviation of 3.27. Also on average, the first annotator assigned each synpair to 2.03±1.37 synsets, while, for the second, this number was 2.64±2.30. Their matching assignments were 70% and their kappa agreement 0.43, which means fair/moderate agreement (Landis and Koch, 1977; Green, 1997) and shows, once again, how subjective it is to evaluate this kind of knowledge. 6.2.2 Scoring the assignments In order to select the best assignment settings, we performed an extensive comparison of the assignment performance, using different similarity measures (introduced in section 6.1.2) and different thresholds σ. In all the experimentation runs, we used all the noun synpairs in CARTÃO, which includes PAPEL 3.0 and the synpairs extracted from Wiktionary.PT and DA, to establish the synonymy network for computing similarities. More about the size of this network and on its coverage by TeP can be found in section 6.4.1. The evaluation score of each setting was obtained using typical information retrieval measures, namely precision, recall and F -score. For a synpair in the set of assigned synpairs, pi ∈ P , these measures are computed as follows: P recisioni = |Selectedi ∩ Correcti| |Selectedi| P recision = 1 |P | |P | P recisioni i=1
6.2. Evaluation of the assignment procedure 99 Recalli = |Selectedi ∩ Correcti| |Correcti| Recall = 1 |P | |P | Recalli i=1 Fβ = (1 + β 2 P recision × Recall ) × (β2 × P recision) + Recall Besides the typical F1-score, we computed F0.5, which favours precision over recall. We prefer to have a more reliable resource, rather than a larger resource with lower correction. Furthermore, the synpairs not assigned to synsets will have a second chance of being integrated in the thesaurus, during the clustering step. Since there could be more than one possible adequate synset for a synpair, in addition the aforementioned measures, we computed a relaxed recall (RelRecall). For a single synpair, RelRecall is 1 if at least one correct synset is selected: 1, if |Selected ∩ Correct|i > 0 RelRecalli = 0, otherwise RelRecall = 1 |P | Using RelRecall, we may as well compute the relaxed Fβ, RelFβ. RelFβ = (1 + β 2 P recision × RelRecall ) × (β2 × P recision) + RelRecall 6.2.3 Comparing different assignment settings |P | RelRecalli Tables 6.1, 6.2, and 6.3 present the evaluation scores of assignments using different settings, respectively against the references of annotator 1, annotator 2, and the intersection between annotator 1 and annotator 2. For each synpair, the intersection reference includes just the synsets selected by both annotators, and has consequently lower scores. Also, although we ran this evaluation in a wider range of values for σ, for the sake of simplicity, we only present those more relevant for understanding the behaviour of the algorithm. In the tables, we have also included the scores if all the candidates were selected (All), which can be seen as a baseline. Another baseline for precision is the random chance of selecting a correct candidate, which is 59.4%, 67.8% and 48.8%, respectively for annotator 1, annotator 2 and for the intersection. The similarity measures are an indicator for the synset assignment, and they are applied in two different modes: • Best: only the best candidate synset with similarity equal or higher than σ is selected. More than one synset may be selected, but only if there is a tie. • All: all the synsets with similarity equal or higher than σ are selected. As expected, better precisions are obtained with higher values of σ. The best precision (around 82% and 92%) is consistently obtained with the cosine measure, mode All, and σ = 0.35. There is also no surprise on the best recall, which is, of course, 100% for the baseline using all candidate synsets. i=1
Page 1:
PhD Thesis Doctoral Program in Info
Page 5:
Preface About six years ago, almost
Page 9 and 10:
Resumo Não há grandes dúvidas qu
Page 11 and 12:
Contents Chapter 1: Introduction .
Page 13:
8.2.1 Semantic Web model . . . . .
Page 16 and 17:
6.1 Illustrative synonymy network.
Page 18 and 19:
6.3 Evaluation against intersection
Page 21 and 22:
Chapter 1 Introduction A substantia
Page 23 and 24:
1.2. Approach 5 • They are not bu
Page 25 and 26:
1.4. Outline of the thesis 7 which
Page 27 and 28:
Chapter 2 Background Knowledge The
Page 29 and 30:
2.1. Lexical Semantics 11 that, in
Page 31 and 32:
2.1. Lexical Semantics 13 Meronymy
Page 33 and 34:
2.2. Lexical Knowledge Formalisms a
Page 35 and 36:
Page 37 and 38:
Page 39 and 40:
2.3. Information Extraction from Te
Page 41 and 42:
2.3. Information Extraction from Te
Page 43:
2.4. Remarks on this section 25 usi
Page 46 and 47:
28 Chapter 3. Related Work in group
Page 48 and 49:
30 Chapter 3. Related Work ple rela
Page 50 and 51:
32 Chapter 3. Related Work knowledg
Page 52 and 53:
34 Chapter 3. Related Work the ELRA
Page 54 and 55:
36 Chapter 3. Related Work resource
Page 56 and 57:
38 Chapter 3. Related Work English
Page 58 and 59:
40 Chapter 3. Related Work of super
Page 60 and 61:
42 Chapter 3. Related Work • part
Page 62 and 63:
44 Chapter 3. Related Work LSIE fro
Page 64 and 65:
46 Chapter 3. Related Work modifier
Page 66 and 67: 48 Chapter 3. Related Work 6. {,}
Page 68 and 69: 50 Chapter 3. Related Work 1. Extra
Page 70 and 71: 52 Chapter 3. Related Work Due to t
Page 72 and 73: 54 Chapter 3. Related Work comparis
Page 74 and 75: 56 Chapter 3. Related Work creation
Page 76 and 77: 58 Chapter 4. Acquisition of Semant
Page 98 and 99: 80 Chapter 5. Synset Discovery Ther
Page 100 and 101: 82 Chapter 5. Synset Discovery the
Page 102 and 103: 84 Chapter 5. Synset Discovery tb-t
Page 104 and 105: 86 Chapter 5. Synset Discovery cota
Page 106 and 107: 88 Chapter 5. Synset Discovery θ W
Page 108 and 109: 90 Chapter 5. Synset Discovery Tabl
Page 110 and 111: 92 Chapter 5. Synset Discovery word
Page 113 and 114: Chapter 6 Thesaurus Enrichment Gene
Page 115: 6.1. Automatic Assignment of synpai
Page 119 and 120: 6.3. Clustering and integrating new
Page 121 and 122: 6.4. A large thesaurus for Portugue
Page 129: 6.5. Discussion 111 Another contrib
Page 132 and 133: 114 Chapter 7. Moving from term-bas
Page 149 and 150: Chapter 8 Onto.PT: a lexical ontolo
Page 151 and 152: 8.1. Overview 133 items inside a sy
Page 153 and 154: 8.2. Access and Availability 135 no
Page 155 and 156: 8.2. Access and Availability 137 Ex
Page 157 and 158: 8.3. Evaluation 139 Figure 8.3: Ins
Page 159 and 160: 8.3. Evaluation 141 the most reliab
Page 161 and 162: 8.3. Evaluation 143 imation of the
Page 163 and 164: 8.3. Evaluation 145 Relation parteD
Page 165 and 166: 8.4. Using Onto.PT 147 • S: (n) a
Page 167 and 168:
8.4. Using Onto.PT 149 todos os fun
Page 169 and 170:
8.4. Using Onto.PT 151 In addition
Page 171 and 172:
8.4. Using Onto.PT 153 based approa
Page 173:
8.4. Using Onto.PT 155 Uma populaç
Page 176 and 177:
158 Chapter 9. Final discussion 3.
Page 178 and 179:
160 Chapter 9. Final discussion - G
Page 180 and 181:
162 Chapter 9. Final discussion Any
Page 183 and 184:
References Agichtein, E. and Gravan
Page 185 and 186:
References 167 for storing and quer
Page 187 and 188:
References 169 15th International C
Page 189 and 190:
References 171 Symposium (STAIRS 20
Page 191 and 192:
References 173 Hovy, E., Hermjakob,
Page 193 and 194:
References 175 ACM, 38(11):39-41. M
Page 195 and 196:
References 177 ACL Press. Partee, B
Page 197 and 198:
References 179 Russell, S. and Norv
Page 199 and 200:
References 181 Proceedings of 13th
Page 201 and 202:
Appendix A Description of the extra
Page 203 and 204:
• x propriedadeDeAlgoQueCausa y -
Page 205:
• x antonimoAdjDe y Property - x
Page 208 and 209:
190 Appendix B. Coverage of EuroWor
Page 210 and 211:
Page 212:
show all

Onto.PT: Towards the Automatic Construction of a Lexical Ontology ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?