Automatic Mapping Clinical Notes to Medical - RMIT University

More documents

Recommendations

Info

$journal of latex class files, vol. 6, no. 1, january ... - RMIT University$

TIME ACC COVER FAIL RATE % secs % % % n ≤ 40 n > 40 Original 91.9 — 86.92 99.29 0.621 1.961 punctuation + repair 63.5 30.9 86.89 99.58 0.399 0.654 Table 2: Parsing performance on Section 23 with constraints and chart repair. tempts. This allows us to further exploit the tight integration of the supertagger and parser by adding one lexical category at a time until a parse of the sentence is found. Chart repair alone gives an 11% improvement in speed. Constraints improve efficiency by avoiding the construction of sub-derivations that will not be used. They have a significant impact on parsing speed and coverage without reducing the accuracy, provided the constraints are identified with sufficient precision. When both methods are used the speed increases by 30-35%, the failure rate decreases by 40-65%, both for sentences of length 1-40 and 41+, while the accuracy is not decreased. The result is an even faster state-of-the-art widecoverage CCG parser. 12 Acknowledgments We would like to thank the anonymous reviewers for their feedback. This research was funded under Australian Research Council Discovery grants DP0453131 and DP0665973. References Srinivas Bangalore and Aravind Joshi. 1999. Supertagging: An approach to almost parsing. Computational Linguistics, 25(2):237–265. Adam L. Berger, Stephen Della Pietra, and Vincent J. Della Pietra. 1996. A maximum entropy approach to natural language processing. Computational Linguistics, 22(1):39–71. Fabio Ciravegna, Alberto Lavelli, and Giorgio Satta. 1997. Efficient full parsing for Information Extraction. In Proceedings of the Meeting of the Working Groups on Automatic Learning and Natural Language of the Associazione Italiana per l’Intelligenza Artificale (AI*IA), Turin, Italy. Stephen Clark and James R. Curran. 2004. The importance of supertagging for wide-coverage CCG parsing. In 20th International Conference on Computational Linguistics, pages 282–288, Geneva, Switzerland. Stephen Clark and James R. Curran. 2006. Wide-coverage statistical parsing with CCG and log-linear models. (submitted). 8 Stephen Clark. 2002. A supertagger for Combinatory Categorial Grammar. In Proceedings of the TAG+ Workshop, pages 19–24, Venice, Italy. Ronald Cole, Joseph Mariani, Hans Uszkoreit, Annie Zaenen, and Victor Zue. 1997. Survey of the state of the art in human language technology. Cambridge University Press, New York, NY, USA. Jay Earley. 1970. An efficient context-free parsing algorithm. Communications of the ACM, 13(2):94–102. Ralph Grishman. 1997. Information Extraction: Techniques and challenges. In SCIE ’97: International Summer School on Information Extraction, pages 10–27, London, UK. Springer-Verlag. Julia Hockenmaier and Mark Steedman. 2006. CCGbank - a corpus of CCG derivations and dependency structures extracted from the Penn Treebank. (submitted). Dan Klein and Christopher D. Manning. 2003. A* parsing: Fast exact Viterbi parse selection. In Proceedings of Human Language Technology and the North American Chapter of the Association for Computational Linguistics Conference, pages 119–126, Edmond, Canada. Klass Sikkel and Anton Nijholt. 1997. Parsing of Context- Free languages. In Grzegorz Rozenberg and Arto Salomaa, editors, Handbook of Formal Languages, Volume 2: Linear Modelling: Background and Application, pages 61–100. Springer-Verlag, New York. Mark Steedman. 2000. The Syntactic Process. The MIT Press, Cambridge, MA. K. Vijay-Shanker and David J. Weir. 1990. Polynomial time parsing of combinatory categorial grammars. In Proceedings of the 28th Annual Meeting on Association for Computational Linguistics, pages 1–8, Pittsburgh, Pennsylvania. Daniel H. Younger. 1967. Recognition and parsing of contex-free languages in time n 3 . Information and Control, 10(2):189–208.
Improved Default Sense Selection for Word Sense Disambiguation Tobias HAWKER and Matthew HONNIBAL Language Technology Research Group School of Information Technologies University of Sydney {toby,mhonn}@it.usyd.edu.au Abstract small, and most words will have only a few examples. The low inter-annotator agreement exac- Supervised word sense disambiguation erbates this problem, as the already small samples has proven incredibly difficult. Despite are thus also somewhat noisy. These difficulties significant effort, there has been little suc- mean that different sense frequency heuristics can cess at using contextual features to accu- significantly change the performance of a ‘baserately assign the sense of a word. Instead, line’ system. In this paper we discuss several such few systems are able to outperform the de- heuristics, and find that most outperform the comfault sense baseline of selecting the highmonly used first-sense strategy, one by as much as est ranked WordNet sense. In this pa- 1.3%. per, we suggest that the situation is even worse than it might first appear: the highest ranked WordNet sense is not even the best default sense classifier. We evaluate several default sense heuristics, using supersenses and SemCor frequencies to achieve significant improvements on the WordNet ranking strategy. The sense ranks in WordNet are derived from semantic concordance texts used in the construction of the database. Most senses have explicit counts listed in the database, although sometimes the counts will be reported as 0. In these cases, the senses are presumably ranked by the lexicographer’s intuition. Usually these counts are higher than the frequency of the sense in the SemCor 1 Introduction sense-tagged corpus (Miller et al., 1993), although not always. This introduces the first alternative Word sense disambiguation is the task of select- heuristic: using SemCor frequencies where availing the sense of a word intended in a usage conable, and using the WordNet sense ranking when text. This task has proven incredibly difficult: there are no examples of the word in SemCor. We in the SENSEVAL 3 all words task, only five of find that this heuristic performs significantly better the entered systems were able to use the contex- than the first-sense strategy. tual information to select word senses more ac- Increasing attention is also being paid to coarse curately than simply selecting the sense listed as grained word senses, as it is becoming obvious most likely in WordNet (Fellbaum, 1998). Suc- that WordNet senses are too fine grained (Hovy cessful WSD systems mostly fall back to the first- et al., 2006). Kohomban and Lee (2005) explore sense strategy unless the system was very confi- finding the most general hypernym of the sense dent in over-ruling it (Hoste et al., 2001). being used, as a coarser grained WSD task. Sim- Deciding which sense of a word is most likely, ilarly, Ciaramita and Altun (2006) presents a sys- irrespective of its context, is therefore a crucial tem that uses sequence tagging to assign ‘super- task for word sense disambiguation. The decision senses’ — lexical file numbers — to words. Both is complicated by the high cost (Chklovski and of these systems compare their performance to a Mihalcea, 2002) and low inter-annotator agree- baseline of selecting the coarse grained parent of ment (Snyder and Palmer, 2004) of sense-tagged the first-ranked fine grained sense. Ciaramita and corpora. The high cost means that the corpora are Altun also use this baseline as a feature in their Proceedings of the 2006 Australasian Language Technology Workshop (ALTW2006), pages 9–15. 9
Page 1: Proceeding of the Australasian Lang
Page 4 and 5: Program Chairs: Committees Lawrence
Page 6 and 7: Towards the Evaluation of Referring
Page 8 and 9: The cat ate the hat that I made NP/
Page 10 and 11: Figure 2: Cells affected by adding
Page 12 and 13: were used because the accuracy for
Page 16 and 17: model. We explore different estimat
Page 18 and 19: S.S. Source Sense Source S.S. Acc.
Page 20 and 21: acy. The difference in correctly as
Page 22 and 23: Experiments with Sentence Classific
Page 24 and 25: datasets with skewed distribution t
Page 26 and 27: words produced by the feature selec
Page 28 and 29: Sentence Class NB DT SVM Apology 1.
Page 30 and 31: Computational Semantics in the Natu
Page 32 and 33: S[sem = ] -> NP[sem=?subj] VP[sem=?
Page 34 and 35: Any string which cannot be decompos
Page 36 and 37: satisfy(Formula,model(D,F),G,pos):n
Page 38 and 39: Classifying Speech Acts using Verba
Page 40 and 41: The principles are dichotomous —
Page 42 and 43: ance. Several additional example ut
Page 44 and 45: our results. In classifying only th
Page 46 and 47: Word Relatives in Context for Word
Page 48 and 49: create a collection of training exa
Page 50 and 51: Query Sense The nave was rebuilt in
Page 52 and 53: Algorithm Avg S2LS Avg S3LS Avg S2L
Page 54 and 55: B. Snyder and M. Palmer. 2004. The
Page 56 and 57: it is even used as a black box that
Page 58 and 59: ecognition in QA, so we will not us
Page 60 and 61: BPER ILOC IPER BLOC BLOC BDATE BLOC
Page 62 and 63: of noise introduced by the addition
Page 64 and 65:
2 Existing annotated corpora Much o
Page 66 and 67:
Our FUSE|tel spectrum of HD|sta 738
Page 68 and 69:
N Word Correct Tagged 23 OH mol non
Page 70 and 71:
L ATEX typesetting information into
Page 72 and 73:
in English, neither the determiner
Page 74 and 75:
con 4 (Sagot et al., 2006) and Morp
Page 76 and 77:
Feature type Positions/description
Page 78 and 79:
Miriam Butt, Helge Dyvik, Tracy Hol
Page 80 and 81:
ently mostly solved by employing hu
Page 82 and 83:
Figure 1: Augmented SCT Lexicon Tok
Page 84 and 85:
medical terms can be composed by ad
Page 86 and 87:
References Aronson, A. R. (2001). E
Page 88 and 89:
technique to most question types, i
Page 90 and 91:
User Question Analyser QA System Se
Page 92 and 93:
Figure 2: Precision Figure 3: Cover
Page 94 and 95:
Analysis and Review. Springer-Verla
Page 96 and 97:
2 Background 2.1 Pronoun Reference
Page 98 and 99:
ous accounts of the effects of cohe
Page 100 and 101:
Table 1: Overall Results for Object
Page 102 and 103:
References Jennifer Arnold, Janet E
Page 104 and 105:
In order for appropriate documents
Page 106 and 107:
y Michael West. He found that his t
Page 108 and 109:
low-scoring documents are “Click
Page 110 and 111:
a) b) You must complete at least 9
Page 112 and 113:
uct). In this paper, we will only c
Page 114 and 115:
indicate the categories of substruc
Page 116 and 117:
Argument lowering Dependent geach
Page 118 and 119:
who [u : np] WH(np, s, s/?gq) λP.
Page 120 and 121:
algorithms, and report the performa
Page 122 and 123:
2.3 Coverage of the Human Data Out
Page 124 and 125:
and minimal subsets of the data. Of
Page 126 and 127:
output. Finally, and related to the
Page 128 and 129:
identifies, to avoid overwhelming t
Page 130 and 131:
y the student is first parsed, usin
Page 132 and 133:
a given word sequence Sorig as a pe
Page 134 and 135:
A problem arises with this scheme w
Page 136 and 137:
Example 1: Original Headline: Europ
Page 138 and 139:
|relations(sentence1) ∩ relations
Page 140 and 141:
2685 True False 1275 Bleu Dep Bleu
Page 142 and 143:
Features Acc. C1-prec. C1-recall C1
Page 144 and 145:
Given the lack of research in VSD u
Page 146 and 147:
ased features: Type 1 Original sibl
Page 148 and 149:
Preposition of the adjunct semantic
Page 150 and 151:
Algorithm 1 The algorithm of combin
Page 152 and 153:
References Timothy Baldwin and Fran
Page 154 and 155:
dependencies in German with respect
Page 156 and 157:
wij moeten dit probleem aanpakken w
Page 158 and 159:
a brevity penalty of BLEU n = 1 n =
Page 160 and 161:
plicitly matching the syntax of the
Page 162 and 163:
Probabilities improve stress-predic
Page 164 and 165:
Towards Cognitive Optimisation of a
Page 166 and 167:
Natural Language Processing and XML
Page 168 and 169:
Extracting Patient Clinical Profile
show all

Automatic Mapping Clinical Notes to Medical - RMIT University

Create successful ePaper yourself

Delete template?

Save as template?