Automatic Mapping Clinical Notes to Medical - RMIT University

More documents

Recommendations

Info

$journal of latex class files, vol. 6, no. 1, january ... - RMIT University$

ance. Several additional example utterances were extracted from the coding manual to increase the number of instances of under-represented VRM categories (notably Confirmations and Interpretations). The final corpus contained 1368 annotated utterances from 14 dialogues and several sets of isolated utterances. Table 3 shows the frequency of each VRM mode in the corpus. VRM Instances Percentage Disclosure 395 28.9% Edification 391 28.6% Advisement 73 5.3% Confirmation 21 1.5% Question 218 15.9% Acknowledgement 97 7.1% Interpretation 64 4.7% Reflection 109 8.0% Table 3: The distribution of VRMs in the corpus 5.2 Features for Classification The VRM annotation guide provides detailed instructions to guide humans in correctly classifying the literal meaning of utterances. These suggested features are shown in Table 4. We have attempted to map these features to computable features for training our statistical VRM classifier. Our resulting set of features is shown in Table 5 and includes several additional features not identified by Stiles that we use to further characterise utterances. These additional features include: • Utterance Length: The number of words in the utterance. • First Word: The first word in each utterance, represented as a series of independent boolean features (one for each unique first word present in the corpus). • Last Token: The last token in each utterance – either the final punctuation (if present) or the final word in the utterance. As for the First Word features, these are represented as a series of independent boolean features. • Bigrams: Bigrams extracted from each utterance, with a variable threshold for including only frequent bigrams (above a specified threshold) in the final feature set. 36 VRM Category Form Criteria Disclosure Declarative; 1st person singular or plural where other is not a referent. Edification Declarative; 3rd person. Advisement Imperative or 2nd person with verb of permission, prohibition or obligation. Confirmation 1st person plural where referent includes the other (i.e., “we” refers to both speaker and other). Question Interrogative, with inverted subject-verb order or interrogative words. Acknowledgement Non-lexical or contentless utterances; terms of address or salutation. Interpretation 2nd person; verb implies an attribute or ability of the other; terms of evaluation. Reflection 2nd person; verb implies internal experience or volitional action. Table 4: VRM form criteria from (Stiles, 1992) The intuition for including the utterance length as a feature is that different VRMs are often associated with longer or shorter utterances - e.g., Acknowledgement utterances are often short, while Edifications are often longer. To compute our utterance features, we made use of the Connexor Functional Dependency Grammar (FDG) parser (Tapanainen and Jarvinen, 1997) for grammatical analysis and to extract syntactic dependency information for the words in each utterance. We also used the morphological tags assigned by Connexor. This information was used to calculate utterance features as follows: • Functional Dependencies: Dependency functions were used to identify main subjects and main verbs within utterances, as required for features including the 1st/2nd/3rd person subject, inverted subject-verb order and imperative verbs. • Syntactic Functions: Syntactic function in-
formation was determined using the Connexor parser. This information was used to identify the main utterance subject where dependency information was not available. • Morphology: Morphological tags, also generated by Connexor, were used to distinguish between first and third person pronouns, as well as between singular and plural forms of first person pronouns. Additionally, we used morphological tags from Connexor to identify imperative verbs. • Hand-constructed word lists: Several of the features used relate to closed sets of common lexical items (e.g., verbs of permission, interrogative words, variations of “yes” and “no”). For these features, we employ handconstructed simple lists, using online thesauri to expand our lists from an initial set of seed words. While some of the lists are not exhaustive, they seem to help our results and involved only a small amount of effort; none took more than an hour to construct. Feature Likely VRM 1st person singular subject D,Q 1st person plural singular subject D,C 3rd person subject E,Q 2nd person subject A,Q,I,R Inverted subject-verb order Q Imperative verb A Verbs of permission, prohibition, obligation A Interrogative words Q Non-lexical content K Yes/No variants K Terms of evaluation I Utterance length all First word all Last token all Bi-grams all Table 5: Features used in VRM Classifier 6 Results Our classification results using several different learning algorithms and variations in feature sets are summarised in Table 6. We experimented with using only the linguistic features suggested 37 by Stiles, using only the additional features we identified, and using a combination of all features shown in Table 5. All our results were validated using stratified 10-fold cross validation. We used supervised learning methods implemented in Weka (Witten and Frank, 2005) to train our classifier. Through experimentation, we found that Weka’s Support Vector Machine implementation (SMO) provided the best classification performance. Encouragingly, other relatively simple approaches, such as a Bayesian Network classifier using the K2 hill-climbing search algorithm, also performed reasonably well. The baseline against which we compare our classifier’s performance is a OneR (one rule) classifier using an identical feature set. This baseline system is a one-level decision tree, (i.e., based on a set of rules that test only the single most discriminative feature). As shown in Table 6, the accuracy of this baseline varies from 42.76% to 49.27%, depending on the exact features used. Regardless of features or algorithms, our classifier performs significantly better than the baseline system. Mean Algorithm Feature Set Accuracy Abs Error SVM All 79.75% 0.19 SVM Only Stiles’ 60.82% 0.20 SVM No Stiles’ 74.49% 0.19 Bayes Net All 78.51% 0.06 Bayes Net Only Stiles’ 60.16% 0.12 Bayes Net No Stiles’ 75.68% 0.07 Baseline All 49.27% 0.36 Baseline Only Stiles’ 49.27% 0.36 Baseline No Stiles’ 42.76% 0.38 Table 6: VRM classifier results Another tunable parameter was the level of pruning of n-grams from our feature set according to their frequency of occurrence. Heuristically, we determined that a cut-off of 5 (i.e., only n-grams that occur five or more times in our corpus of utterances were included as features) gave us the highest accuracy for the learning algorithms tested. 7 Discussion This work appears to be the first attempt to automatically classify utterances according to their literal meaning with VRM categories. There are thus no direct comparisons to be easily drawn for
Page 1: Proceeding of the Australasian Lang
Page 4 and 5: Program Chairs: Committees Lawrence
Page 6 and 7: Towards the Evaluation of Referring
Page 8 and 9: The cat ate the hat that I made NP/
Page 10 and 11: Figure 2: Cells affected by adding
Page 12 and 13: were used because the accuracy for
Page 14 and 15: TIME ACC COVER FAIL RATE % secs % %
Page 16 and 17: model. We explore different estimat
Page 18 and 19: S.S. Source Sense Source S.S. Acc.
Page 20 and 21: acy. The difference in correctly as
Page 22 and 23: Experiments with Sentence Classific
Page 24 and 25: datasets with skewed distribution t
Page 26 and 27: words produced by the feature selec
Page 28 and 29: Sentence Class NB DT SVM Apology 1.
Page 30 and 31: Computational Semantics in the Natu
Page 32 and 33: S[sem = ] -> NP[sem=?subj] VP[sem=?
Page 34 and 35: Any string which cannot be decompos
Page 36 and 37: satisfy(Formula,model(D,F),G,pos):n
Page 38 and 39: Classifying Speech Acts using Verba
Page 40 and 41: The principles are dichotomous —
Page 44 and 45: our results. In classifying only th
Page 46 and 47: Word Relatives in Context for Word
Page 48 and 49: create a collection of training exa
Page 50 and 51: Query Sense The nave was rebuilt in
Page 52 and 53: Algorithm Avg S2LS Avg S3LS Avg S2L
Page 54 and 55: B. Snyder and M. Palmer. 2004. The
Page 56 and 57: it is even used as a black box that
Page 58 and 59: ecognition in QA, so we will not us
Page 60 and 61: BPER ILOC IPER BLOC BLOC BDATE BLOC
Page 62 and 63: of noise introduced by the addition
Page 64 and 65: 2 Existing annotated corpora Much o
Page 66 and 67: Our FUSE|tel spectrum of HD|sta 738
Page 68 and 69: N Word Correct Tagged 23 OH mol non
Page 70 and 71: L ATEX typesetting information into
Page 72 and 73: in English, neither the determiner
Page 74 and 75: con 4 (Sagot et al., 2006) and Morp
Page 76 and 77: Feature type Positions/description
Page 78 and 79: Miriam Butt, Helge Dyvik, Tracy Hol
Page 80 and 81: ently mostly solved by employing hu
Page 82 and 83: Figure 1: Augmented SCT Lexicon Tok
Page 84 and 85: medical terms can be composed by ad
Page 86 and 87: References Aronson, A. R. (2001). E
Page 88 and 89: technique to most question types, i
Page 90 and 91: User Question Analyser QA System Se
Page 92 and 93:
Figure 2: Precision Figure 3: Cover
Page 94 and 95:
Analysis and Review. Springer-Verla
Page 96 and 97:
2 Background 2.1 Pronoun Reference
Page 98 and 99:
ous accounts of the effects of cohe
Page 100 and 101:
Table 1: Overall Results for Object
Page 102 and 103:
References Jennifer Arnold, Janet E
Page 104 and 105:
In order for appropriate documents
Page 106 and 107:
y Michael West. He found that his t
Page 108 and 109:
low-scoring documents are “Click
Page 110 and 111:
a) b) You must complete at least 9
Page 112 and 113:
uct). In this paper, we will only c
Page 114 and 115:
indicate the categories of substruc
Page 116 and 117:
Argument lowering Dependent geach
Page 118 and 119:
who [u : np] WH(np, s, s/?gq) λP.
Page 120 and 121:
algorithms, and report the performa
Page 122 and 123:
2.3 Coverage of the Human Data Out
Page 124 and 125:
and minimal subsets of the data. Of
Page 126 and 127:
output. Finally, and related to the
Page 128 and 129:
identifies, to avoid overwhelming t
Page 130 and 131:
y the student is first parsed, usin
Page 132 and 133:
a given word sequence Sorig as a pe
Page 134 and 135:
A problem arises with this scheme w
Page 136 and 137:
Example 1: Original Headline: Europ
Page 138 and 139:
|relations(sentence1) ∩ relations
Page 140 and 141:
2685 True False 1275 Bleu Dep Bleu
Page 142 and 143:
Features Acc. C1-prec. C1-recall C1
Page 144 and 145:
Given the lack of research in VSD u
Page 146 and 147:
ased features: Type 1 Original sibl
Page 148 and 149:
Preposition of the adjunct semantic
Page 150 and 151:
Algorithm 1 The algorithm of combin
Page 152 and 153:
References Timothy Baldwin and Fran
Page 154 and 155:
dependencies in German with respect
Page 156 and 157:
wij moeten dit probleem aanpakken w
Page 158 and 159:
a brevity penalty of BLEU n = 1 n =
Page 160 and 161:
plicitly matching the syntax of the
Page 162 and 163:
Probabilities improve stress-predic
Page 164 and 165:
Towards Cognitive Optimisation of a
Page 166 and 167:
Natural Language Processing and XML
Page 168 and 169:
Extracting Patient Clinical Profile
show all

Automatic Mapping Clinical Notes to Medical - RMIT University

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?