PhD thesis - School of Informatics - University of Edinburgh

More documents

Recommendations

Info

Chapter 4. System Extension to a New Language 113 Post-processing Accuracy Precision Recall F-score Δ F None 96.74% 82.91% 62.98% 71.59 16.57 Single letters 97.75 90.51% 72.52% 80.52 7.16 Ambiguous words 97.83 91.60% 74.14% 81.95 5.73 Person names 98.12% 86.53% 84.08% 85.29 2.39 Function words 98.21% 91.36% 81.54% 86.17 1.51 Currencies etc. 98.30% 91.08% 81.85% 86.22 1.46 Abbreviations 98.39% 90.87% 83.77% 87.18 0.50 Full System - CC 98.45% 91.60% 84.08% 87.68 - Table 4.4: Evaluation of the post-processing module with one type of post-processing removed at a time on the French development data. Δ F represents the change in F-score compared to the full English inclusion classifier without consistency checking (CC). post-processing are added to a gazetteer. This gazetteer is then checked on the fly to assure that tokens that were not already previously tagged by the system are classified correctly as well. Consistency checking is therefore mainly aimed at identifying En- glish inclusions which the POS tagger did not tag correctly. For example, the word Google was once incorrectly tagged as a present tense verb (VER:pres) and could therefore not be classified by the system initially. However, since the same token was also listed in the on-the-fly gazetteer which was generated for the particular document it occurred in, consistency checking resulted in the correct classification. Table 4.5 presents the performance of the full French and German systems with optional consistency checking on both the development and test data. The results show that consistency checking does not have the same effect on the French as it does on the German data. It only yields a small improvement in F-score of 0.45 points on the French development data but no improvement on the French test data. One reason for this discrepancy between languages could be the POS tagging of English inclusions. While English inclusions in the German development data are assigned on average 1.2 POS tags by TnT, the TreeTagger tags the English inclusions in the French development data only with 1.1 different POS tags. The latter is therefore slightly more consistent. The second reason is that English inclusions are repeated less often in the French data than in the German which is demonstrated in their TTRs (0.34 in French
Chapter 4. System Extension to a New Language 114 development and test sets versus 0.29 and 0.25 in German development and test sets, see Table 4.1). This means that the classifier is already less likely to miss inclusions which minimises the effect of consistency checking for French. Test set Development set Method Acc P R F Acc P R F French data FS 98.10% 88.59% 84.11% 86.29 98.45% 91.60% 84.08% 87.68 FS+CC 98.08% 88.35% 84.30% 86.28 98.49% 91.39% 85.09% 88.13 German data FS 97.93% 92.13% 75.82% 83.18 98.07% 93.48% 73.31% 82.17 FS+CC 98.13% 91.58% 78.92% 84.78 98.25% 92.75% 77.37% 84.37 Table 4.5: Evaluation of the full system (FS) with optional consistency checking (CC). 4.5 Chapter Summary This chapter described how the English inclusion classifier was successfully converted to a new language, French. The extended system is able to process either German or French text for identifying English inclusions. The system is a pipeline made up of sev- eral modules, including pre-processing, a lexicon, a search-engine, a post-processing and a document consistency checking module. The extension of the core system was carried out in only one person week and resulted in a system performance of 71.59 points in F-score on the French development data. A further week was spent on im- plementing the post-processing module which boosted the F-score to 87.68 points. A third week was required to select external language resources plus collect and annotate French evaluation data in the domain of internet and telecoms. The performance drop between the French development set and the unseen test sets is relatively small (1.85 in F-score) which means that the system does not seriously over- or undergenerate for this domain but results in an equally high performance on new data. This chapter also demonstrated that the English inclusion classifier is easily extendable to a new language in a relative short period of time and without having to
Page 1 and 2:
Automatic Detection of English Incl
Page 3 and 4:
these parsers with the annotation-f
Page 5 and 6:
Declaration I declare that this the
Page 7 and 8:
3.3.5 Post-processing Module . . .
Page 9 and 10:
A.2.2 Kappa Coefficient . . . . . .
Page 11 and 12:
5.6 Average relative token frequenc
Page 13 and 14:
3.16 Most frequent English inclusio
Page 15 and 16:
Chapter 1. Introduction 2 siderable
Page 17 and 18:
Chapter 1. Introduction 4 Chapter 3
Page 19 and 20:
Chapter 1. Introduction 6 1.1 Relat
Page 21 and 22:
Chapter 2. Background and Theory 8
Page 23 and 24:
Page 25 and 26:
Page 27 and 28:
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Page 35 and 36:
Page 37 and 38:
Page 39 and 40:
Page 41 and 42:
Page 43 and 44:
Page 45 and 46:
Page 47 and 48:
Page 49 and 50:
Page 51 and 52:
Page 53 and 54:
Page 55 and 56:
Page 57 and 58:
Page 59 and 60:
Chapter 3 Tracking English Inclusio
Page 61 and 62:
Chapter 3. Tracking English Inclusi
Page 63 and 64:
Page 65 and 66:
Page 67 and 68:
Page 69 and 70:
Page 71 and 72:
Page 73 and 74:
Page 75 and 76: Chapter 3. Tracking English Inclusi
Page 113 and 114: Chapter 4 System Extension to a New
Page 115 and 116: Chapter 4. System Extension to a Ne
Page 125: Chapter 4. System Extension to a Ne
Page 129 and 130: Chapter 5 Parsing English Inclusion
Page 131 and 132: Chapter 5. Parsing English Inclusio
Page 159 and 160: Chapter 6 Other Potential Applicati
Page 161 and 162: Chapter 6. Other Potential Applicat
Page 177 and 178:
Chapter 6. Other Potential Applicat
Page 179 and 180:
Page 181 and 182:
Page 183 and 184:
Page 185 and 186:
Page 187 and 188:
Chapter 7 Conclusions and Future Wo
Page 189 and 190:
Chapter 7. Conclusions and Future W
Page 191 and 192:
Appendix A. Evaluation Metrics and
Page 193 and 194:
Page 195 and 196:
Page 197 and 198:
Page 199 and 200:
Appendix B. Guidelines for Annotati
Page 201 and 202:
Page 203 and 204:
Page 205 and 206:
Appendix C TIGER Tags and Labels C.
Page 207 and 208:
Appendix C. TIGER Tags and Labels 1
Page 209 and 210:
Appendix C. TIGER Tags and Labels 1
Page 211 and 212:
Bibliography 198 Andersen, G. (2005
Page 213 and 214:
Bibliography 200 Bresnan, J. (2001)
Page 215 and 216:
Bibliography 202 Damashek, M. (1995
Page 217 and 218:
Bibliography 204 Finkel, J., Dingar
Page 219 and 220:
Bibliography 206 Hachey, B., Alex,
Page 221 and 222:
Bibliography 208 Kirkness, A. (1984
Page 223 and 224:
Bibliography 210 and Technology (In
Page 225 and 226:
Bibliography 212 Poplack, S. (1988)
Page 227 and 228:
Bibliography 214 Sokol, D. K. (2000
Page 229:
Bibliography 216 Yang, W. (1990). A
show all

PhD thesis - School of Informatics - University of Edinburgh

Create successful ePaper yourself

Delete template?

Save as template?