PhD thesis - School of Informatics - University of Edinburgh
PhD thesis - School of Informatics - University of Edinburgh
PhD thesis - School of Informatics - University of Edinburgh
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
3.3.5 Post-processing Module . . . . . . . . . . . . . . . . . . . . 63<br />
3.3.6 Document Consistency Checking . . . . . . . . . . . . . . . 66<br />
3.3.7 Output . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66<br />
3.4 Evaluation and Analysis . . . . . . . . . . . . . . . . . . . . . . . . 68<br />
3.4.1 Evaluation <strong>of</strong> the Tool Output . . . . . . . . . . . . . . . . . 68<br />
3.4.2 Evaluation <strong>of</strong> Individual System Modules . . . . . . . . . . . 69<br />
3.4.3 Evaluation on Unseen Data . . . . . . . . . . . . . . . . . . . 80<br />
3.5 Parameter Tuning Experiments . . . . . . . . . . . . . . . . . . . . . 85<br />
3.5.1 Task-based Evaluation <strong>of</strong> Different POS taggers . . . . . . . . 85<br />
3.5.2 Task-based Evaluation <strong>of</strong> Different Search Engines . . . . . . 89<br />
3.6 Machine Learning Experiments . . . . . . . . . . . . . . . . . . . . . 92<br />
3.6.1 In-domain Experiments . . . . . . . . . . . . . . . . . . . . . 92<br />
3.6.2 Cross-domain Experiments . . . . . . . . . . . . . . . . . . . 95<br />
3.6.3 Learning Curve . . . . . . . . . . . . . . . . . . . . . . . . . 96<br />
3.7 Chapter Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98<br />
4 System Extension to a New Language 100<br />
4.1 Time Spent on System Extension . . . . . . . . . . . . . . . . . . . . 101<br />
4.2 French Development and Test Data Preparation . . . . . . . . . . . . 102<br />
4.3 System Module Conversion to French . . . . . . . . . . . . . . . . . 104<br />
4.3.1 Pre-processing Module . . . . . . . . . . . . . . . . . . . . . 105<br />
4.3.2 Lexicon Module . . . . . . . . . . . . . . . . . . . . . . . . 107<br />
4.3.3 Search Engine Module . . . . . . . . . . . . . . . . . . . . . 107<br />
4.3.4 Post-processing Module . . . . . . . . . . . . . . . . . . . . 108<br />
4.4 French System Evaluation . . . . . . . . . . . . . . . . . . . . . . . 109<br />
4.4.1 Evaluation on Test and Development Data . . . . . . . . . . . 109<br />
4.4.2 Evaluation <strong>of</strong> the Post-Processing Module . . . . . . . . . . . 112<br />
4.4.3 Consistency Checking . . . . . . . . . . . . . . . . . . . . . 112<br />
4.5 Chapter Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114<br />
5 Parsing English Inclusions 116<br />
5.1 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117<br />
5.2 Data Preparation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118<br />
5.2.1 Data Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119<br />
vi