Full proceedings volume - Australasian Language Technology ...

More documents

Recommendations

Info

formation extraction. In Proceedings of the 4th International Conference on Advances in Intelligent Data Analysis (IDA-2001), pages 309–318. Sebastian Seung, Manfred Opper, and Haim Sompolinsky. 1992. Query by committee. In Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, pages 287–295, Pittsburgh, PA, US. Nataliya Sokolovska. 2011. Aspects of Semi- Supervised and Active Learning in Conditional Random Fields. In Proceedings of the European Conference on Machine Learning (ECML PKDD) 2011, pages 273–288. Nancy Stenson. 1981. Studies in Irish Syntax. Gunter Narr Verlag Tübingen. Sebastian Sulger. 2009. Irish clefting and information-structure. In Proceedings of the LFG ’09 Conference. Cynthia Thompson, Mary Elaine Califf, and Raymond Mooney. 1999. Active learning for natural language parsing and information extraction. In Proceedings of the Sixteenth International Machine Learning Conference (ICML-99), pages 406–414, Bled, Slovenia. Elaine Uí Dhonnchadha, Caoilfhionn Nic Pháidín, and Josef van Genabith. 2003. Design, implementation and evaluation of an inflectional morphology finite state transducer for Irish. Machine Translation, 18:173–193. Elaine Uí Dhonnchadha. 2009. Part-of-Speech Tagging and Partial Parsing for Irish using Finite-State Transducers and Constraint Grammar. Ph.D. thesis, Dublin City University. 32
Unsupervised Estimation of Word Usage Similarity Marco Lui, ♠♥ Timothy Baldwin, ♠♥ and Diana McCarthy ♣ ♠ NICTA Victoria Research Laboratory ♥ Dept of Computing and Information Systems, The University of Melbourne ♣ Dept of Theoretical and Applied Linguistics, University of Cambridge mhlui@unimelb.edu.au, tb@ldwin.net, diana@dianamccarthy.co.uk Abstract We present a method to estimate word use similarity independent of an external sense inventory. This method utilizes a topicmodelling approach to compute the similarity in usage of a single word across a pair of sentences, and we evaluate our method in terms of its ability to reproduce a humanannotated ranking over sentence pairs. We find that our method outperforms a bag-ofwords baseline, and that for certain words there is very strong correlation between our method and human annotators. We also find that lemma-specific models do not outperform general topic models, despite the fact that results with the general model vary substantially by lemma. We provide a detailed analysis of the result, and identify open issues for future research. 1 Introduction Automated Word Usage Similarity (Usim) is the task of determining the similarity in use of a particular word across a pair of sentences. It is related to the tasks of word sense disambiguation (WSD) and word sense induction (WSI), but differs in that Usim does not pre-suppose a predefined sense inventory. It also captures the fact that word senses may not always be distinct, and that the applicability of word senses is not necessarily mutually exclusive. In Usim, we consider pairs of sentences at a time, and quantify the similarity of the sense of the target word being used in each sentence. An example of a sentence pair (SPAIR) using similar but not identical senses of the word dry is given in Figure 1. Usim is a relatively new NLP task, partly due to the lack of resources for its evaluation. Erk et al. (2009) recently produced a corpus of sentence Part c) All this has been a little dry so far: now for some fun. For people who knew him, it was typical of his dry humor, but some in the audience thought he was tipsy. Figure 1: Example of an SPAIR judged by annotators to use similar but not identical senses of the word dry. pairs annotated for usage similarity judgments, allowing Usim to be formulated as a distinct task from the related tasks of word sense disambiguation and word sense induction. In this work, we propose a method to estimate word usage similarity in an entirely unsupervised fashion through the use of a topic model. We make use of the well-known Latent Dirichlet Allocation (LDA) model (Blei et al., 2003) to model the distribution of topics in a sentence, then examine the similarity between sentences on the basis of the similarity between their topic distributions. Our main contributions in this work are: (1) we introduce a method to compute word usage similarity in an unsupervised setting based on topic modelling; (2) we show that our method performs better than the bag-of-words modelling approach; (3) we find that each lemma has a distinct optimum parametrization of the approach that does not generalize across parts of speech; and (4) we demonstrate empirically that per-lemma topic models do not perform differently from global topic models. 2 Background Polysemy is a linguistic phenomenon whereby the same word has different meaning depending on the context it is used it. For example, the use of the word charge in the phrase charge a battery is different from its use in the phrase charge a hill, and also distinct from its use in charge in court. Marco Lui, Timothy Baldwin and Diana McCarthy. 2012. Unsupervised Estimation of Word Usage Similarity. In Proceedings of Australasian Language Technology Association Workshop, pages 33−41.
Page 1 and 2: Australasian Language Technology As
Page 3 and 4: ALTA 2012 Workshop Committees Works
Page 5 and 6: ALTA 2012 Programme The proceedings
Page 7 and 8: Contents Invited talks 1 Using a la
Page 9 and 10: Invited talks 1
Page 11 and 12: Diverse Words, Shared Meanings: Sta
Page 13 and 14: A Citation Centric Annotation Schem
Page 15 and 16: The structure of this paper is as f
Page 17 and 18: understanding of sentences surround
Page 19 and 20: henceforth referred to as Annotator
Page 21 and 22: References Angrosh, M A, Cranefield
Page 23 and 24: Semantic Judgement of Medical Conce
Page 25 and 26: 2012). The TE model provides a form
Page 27 and 28: Correlation coefficient with human
Page 29 and 30: Pair # Concept 1 Doc. Freq. Concept
Page 31 and 32: Active Learning and the Irish Treeb
Page 33 and 34: 2.2 Sources of annotator disagreeme
Page 35 and 36: complement (csubj) 2 . See Figure 4
Page 37 and 38: top Y trees from this ordered set a
Page 39: References Ron Artstein and Massimo
Page 43 and 44: not lend itself to determining unsu
Page 45 and 46: T 0: think, want, thing, look, tell
Page 47 and 48: correlation is much stronger than t
Page 49 and 50: References Eneko Agirre and Philip
Page 51 and 52: eaders with sentences with a negati
Page 53 and 54: Question Which word is closest in m
Page 55 and 56: Acceptability and negativity: conce
Page 57 and 58: MORE NEGATIVE / LESS NEGATIVE MR Δ
Page 59 and 60: Julie Elizabeth Weeds. 2003. Measur
Page 61 and 62: cation produced by a supervised mac
Page 63 and 64: understood to carry a lower weight
Page 65 and 66: Figure 1: Average classification ac
Page 67 and 68: 0.8 0.75 0.7 AuthorA4 AuthorB4 Auth
Page 69 and 70: Segmentation and Translation of Jap
Page 71 and 72: tomatosōsu “tomato sauce”, rev
Page 73 and 74: “markup” and “Mach”, so the
Page 75 and 76: MWE Segmentation Possible Translati
Page 77 and 78: English Term Pairs from Search Engi
Page 79 and 80: techniques used are, of necessity,
Page 81 and 82: ation metrics robust, if they are v
Page 83 and 84: Figure 2: Methodologies of human as
Page 85 and 86: an optimal method? Machine translat
Page 87 and 88: Towards Two-step Multi-document Sum
Page 89 and 90: archical and non-hierarchical relat
Page 91 and 92:
We use the MetaMap 7 tool to automa
Page 93 and 94:
T T & C CC z -1.5 -1.27 -1.33 p-val
Page 95 and 96:
References Alan R. Aronson. 2001. E
Page 97 and 98:
techniques and results are shown. F
Page 99 and 100:
Rock Hip-Hop Electronic Punk Metal
Page 101 and 102:
Rank Frequency Frequency (filtered)
Page 103 and 104:
Ex. 1 Ex. 2 Ex. 3 Score Song Title
Page 105 and 106:
Free-text input vs menu selection:
Page 107 and 108:
consistent with the much earlier ed
Page 109 and 110:
3.2 Dialogue system architecture Th
Page 111 and 112:
Control Free-text Menu-based n=119
Page 113 and 114:
tions in analyzing algebra example
Page 115 and 116:
langid.py for better language model
Page 117 and 118:
documents of MIME type text/html wi
Page 119 and 120:
Acknowledgments NICTA is funded by
Page 121 and 122:
LaBB-CAT: an Annotation Store Rober
Page 123 and 124:
elationship between the coincidence
Page 125 and 126:
Communication 33 (special issue on
Page 127 and 128:
matically extracting location infor
Page 129 and 130:
Classifier DBp:1R Geo:1R D+G:1R DBp
Page 131 and 132:
ALTA Shared Task papers 123
Page 133 and 134:
All Struct. Unstruct. Total - Abstr
Page 135 and 136:
System Population Intervention Back
Page 137 and 138:
tence positions (absolute and relat
Page 139 and 140:
sitional attributes, and sequential
Page 141 and 142:
ordering labels, All includes BOW,
Page 143 and 144:
3 Software Used All experimentation
Page 145 and 146:
Combination Output Public Private C
Page 147 and 148:
Experiments with Clustering-based F
Page 149 and 150:
3 Results For the initial experimen
show all

Full proceedings volume - Australasian Language Technology ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?