PhD thesis - School of Informatics - University of Edinburgh

More documents

Recommendations

Info

Chapter 3. Tracking English Inclusions in German 81 Test set Development set Method Acc P R F Acc P R F Internet BL 93.62% - - - 93.95% - - - FS 97.93% 92.13% 75.82% 83.18 98.07% 93.48% 73.31% 82.17 FS+CC 98.13% 91.58% 78.92% 84.78 98.25% 92.75% 77.37% 84.37 Space BL 97.19% - - - 96.99% - - - FS 98.89% 85.61% 79.61% 82.50 99.36% 91.38% 87.42% 89.36 FS+CC 98.97% 84.02% 85.31% 84.66 99.45% 89.19% 93.61% 91.35 EU BL 98.93% - - - 99.69% - - - FS 99.65% 83.24% 85.63% 84.42 99.78% 60.00% 71.43% 65.22 FS+CC 99.65% 82.16% 87.36% 84.68 99.78% 59.26% 76.19% 66.67 Table 3.14: Evaluation of the full system (FS) on the unseen test data with optional consistency checking (CC) versus the baseline (BL).
Chapter 3. Tracking English Inclusions in German 82 The F-score for the space travel test data is almost 7 points lower than that obtained for the development set. This performance drop is caused by lower precision and recall. Although the classifier is overgenerating on the development set for this particular domain, the fact that the scores on the unseen test data are relatively con- sistent across all three different domains is a positive result. Moreover, each data set is relatively small which makes it difficult to draw clear conclusions. In fact, the test and development data on space travel are slightly different in nature as can be seen in Table 3.1. While both sets contain a similar percentage of English inclusions (2.8% versus 3%), those in the test set are much less repeated than those in the development set which is reflected in their type-token-ratios amounting to 0.33 and 0.15, respec- tively. Therefore, the higher development test data scores could be due to the higher number of repetitions of English inclusions in the space travel development data. The full system F-scores for the EU test data are considerably higher than for the development set (84.42 versus 65.22 points). This is not surprising since the EU development data only contains 30 different English inclusions, less than 1% of all types, which made it an unusual data set for evaluating the classifier on. Error analysis was therefore focused mainly on the output of the other two data sets. The EU test data, on the other hand, contains 86 different English inclusions, i.e. three times as many types as in the development data (see Table 3.1). Considering that the English inclusion classifier yields an equally high performance on the unseen EU test data as on the other two test data sets, it can be concluded that system design decisions and post-processing rules are made general enough to apply to documents on different domains. The best overall F-scores on all six data sets are obtained when combining the full system with a second consistency checking run (Internet test data: F=84.78, Space travel test data: F=84.66, EU test data: F=84.68). This second run essentially ensures that all English inclusions found in the first run are consistently classified within each document. This is done by applying an on-the-fly gazetteer which is generated auto- matically. This setup was explained in more detail in Section 3.3.6. The results listed in Table 3.14 show that the improvement in F-score is always caused by an increase in recall, outweighing the smaller decrease in precision. While this improvement is es- sential for document classification, particularly when comparing different classifiers, it is unlikely to be beneficial when performing language classification on tokens in individual sentences, for example during the text analysis of a TTS synthesis system.
Page 1 and 2:
Automatic Detection of English Incl
Page 3 and 4:
these parsers with the annotation-f
Page 5 and 6:
Declaration I declare that this the
Page 7 and 8:
3.3.5 Post-processing Module . . .
Page 9 and 10:
A.2.2 Kappa Coefficient . . . . . .
Page 11 and 12:
5.6 Average relative token frequenc
Page 13 and 14:
3.16 Most frequent English inclusio
Page 15 and 16:
Chapter 1. Introduction 2 siderable
Page 17 and 18:
Chapter 1. Introduction 4 Chapter 3
Page 19 and 20:
Chapter 1. Introduction 6 1.1 Relat
Page 21 and 22:
Chapter 2. Background and Theory 8
Page 23 and 24:
Page 25 and 26:
Page 27 and 28:
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Page 35 and 36:
Page 37 and 38:
Page 39 and 40:
Page 41 and 42:
Page 43 and 44: Chapter 2. Background and Theory 30
Page 59 and 60: Chapter 3 Tracking English Inclusio
Page 61 and 62: Chapter 3. Tracking English Inclusi
Page 93: Chapter 3. Tracking English Inclusi
Page 113 and 114: Chapter 4 System Extension to a New
Page 115 and 116: Chapter 4. System Extension to a Ne
Page 129 and 130: Chapter 5 Parsing English Inclusion
Page 131 and 132: Chapter 5. Parsing English Inclusio
Page 145 and 146:
Chapter 5. Parsing English Inclusio
Page 147 and 148:
Page 149 and 150:
Page 151 and 152:
Page 153 and 154:
Page 155 and 156:
Page 157 and 158:
Page 159 and 160:
Chapter 6 Other Potential Applicati
Page 161 and 162:
Chapter 6. Other Potential Applicat
Page 163 and 164:
Page 165 and 166:
Page 167 and 168:
Page 169 and 170:
Page 171 and 172:
Page 173 and 174:
Page 175 and 176:
Page 177 and 178:
Page 179 and 180:
Page 181 and 182:
Page 183 and 184:
Page 185 and 186:
Page 187 and 188:
Chapter 7 Conclusions and Future Wo
Page 189 and 190:
Chapter 7. Conclusions and Future W
Page 191 and 192:
Appendix A. Evaluation Metrics and
Page 193 and 194:
Page 195 and 196:
Page 197 and 198:
Page 199 and 200:
Appendix B. Guidelines for Annotati
Page 201 and 202:
Page 203 and 204:
Page 205 and 206:
Appendix C TIGER Tags and Labels C.
Page 207 and 208:
Appendix C. TIGER Tags and Labels 1
Page 209 and 210:
Appendix C. TIGER Tags and Labels 1
Page 211 and 212:
Bibliography 198 Andersen, G. (2005
Page 213 and 214:
Bibliography 200 Bresnan, J. (2001)
Page 215 and 216:
Bibliography 202 Damashek, M. (1995
Page 217 and 218:
Bibliography 204 Finkel, J., Dingar
Page 219 and 220:
Bibliography 206 Hachey, B., Alex,
Page 221 and 222:
Bibliography 208 Kirkness, A. (1984
Page 223 and 224:
Bibliography 210 and Technology (In
Page 225 and 226:
Bibliography 212 Poplack, S. (1988)
Page 227 and 228:
Bibliography 214 Sokol, D. K. (2000
Page 229:
Bibliography 216 Yang, W. (1990). A
show all

PhD thesis - School of Informatics - University of Edinburgh

Create successful ePaper yourself

Delete template?

Save as template?