Automatic Mapping Clinical Notes to Medical - RMIT University

More documents

Recommendations

Info

$journal of latex class files, vol. 6, no. 1, january ... - RMIT University$

2 Background 2.1 Pronoun Reference Resolution In this paper I will be focusing on the interpretation of unbound pronouns. Much has been written on this area of anaphora resolution and only a cursory overview is possible in this paper (see Hirst (1981) for a comprehensive overview of earlier work and Mitkov (2002) for an overview of more recent work). In this section, I will describe a generalized model of pronoun resolution and how salience plays a role in this process as well as discuss in some detail how salience is determined. When interpreting pronouns in discourse, readers search a list of previously evoked entities in memory. Following Karttunen (1976) and Heim (1982), I will call these discourse referents (or just referents, for short). The list of discourse referents is ranked according to salience. Two basic approaches may be taken to salience ranking: a categorical approach in which at most one referent is salient and all others are then, by definition, not salient; or a gradient approach in which referents are ranked along a salience continuum. In computational implementations of pronoun resolution algorithms, a gradient approach is often used, perhaps by necessity (cf., Lappin and Leass (1994)). However, psycholinguistic studies are often not so explicit about the approach taken and the results of most studies can be explained in terms of a categorical salience ranking. For instance, Gernsbacher and Hargreaves (1988) present a model of comprehension in which order-of-mention determines salience ranking, but their experimental evidence only compares first and second mentioned entities. In another case, Hudson-D’Zmura and Tanenhaus (1997) seek to verify the basic predictions of Centering Theory (Grosz and Sidner, 1986; Grosz et al., 1995), one aspect of which is a syntactic hierarchy: subjects > objects > others. However, their experimental evidence really only demonstrates a categorical ranking: subjects > others. The difference between the categorical and gradient approaches is important in the present study because if salience is categorical, then pronominal reference should show no preference among non-salient entities. On the other hand, if salience is gradient, then it should be possible to observe preferences even among non-salient (or perhaps more accurately here, less salient) entities. I should note however, that while it is clear that 90 those who take a gradient point of view must rule out a categorical point of view, I do not intend to imply that those studies that have a categorical approach (implied or otherwise) necessarily rule out a gradient approach. Many of those investigators may in fact be amenable to it. However, actual evidence of the necessity of a gradient approach in theory remains somewhat scarce. It is hoped that the present study will add to this evidence. 2.2 Computing Salience Returning then to the model of pronoun resolution, the list of referents is first pruned to remove incompatible referents based on morphosyntactic features (Arnold et al., 2000; Boland et al., 1998). Then search for a referent should proceed with respect to the ranking (either categorical or gradient) of the referents in the list. But what determines this ranking? One of the most dominant factors has been shown to be syntactic prominence. However, in Rose (2005), I have argued that (in English, at least) syntactic and semantic information are often conflated. I will therefore discuss both of these factors below. In addition, another significant factor is coherence relations, also discussed this further below. 2.2.1 Syntactic Prominence Several ways of determining the syntactic prominence of evoked entities have been discussed in the literature. These include left-to right ordering in which discourse referents introduced earlier are more prominent than those introduced later (Gernsbacher and Hargreaves, 1988), depth-first hierarchical search in which referents introduced higher and leftward in the syntactic tree are more prominent (Hobbs, 1978), and grammatical role in which referents introduced as subjects are more prominent than those introduced in other roles (Grosz et al., 1995). These different approaches typically make the same predictions when dealing with syntactically simpler constructions (i.e, no subordinate clauses or complex noun phrases), but may make different predictions with more complex constructions. The stimuli used in the experiment described below are all relatively syntactically simple and so in this paper I will not evaluate the differences among these various approaches. 1 1 See Rose (2005) for detailed discussion of these different approaches and a psycholinguistic and corpus linguistic comparison of hierarchical and grammatical role approaches.
However, for expository purposes, I will use grammatical role labels in discussion below. When ranking referents according to the grammatical role in which they have been introduced, a categorical ranking predicts that the referent introduced as subject is salient and that any other referent is non-salient. This is the point of view implicitly taken in Stevenson et al. (2000), for example, in which they argue that a pronoun should refer to the referent introduced as subject of the preceding utterance. 2 A gradient salience approach, however, requires a more detailed prominence hierarchy such as that in (2). This is the point of view taken in most studies using the Centering framework (Grosz and Sidner, 1986; Grosz et al., 1995). An even more detailed gradient salience approach may be taken in which the points on the prominence hierarchy carry different (relative) weights. This is the approach taken in many practical applications such as the pronoun resolution algorithm of Lappin and Leass (1994). (2) subject > object > oblique > others 2.2.2 Semantic Prominence In English, syntactic role and semantic role are often conflated. That is, syntactic subjects are often semantic agents while syntactic objects are often semantic patients, and so on. Thus, it could be that the kind of pronoun resolution preferences previously observed and usually attributed to syntactic prominence effects might actually be attributed to semantic prominence effects. In other words, perhaps subject-preference is actually agent-preference. In order to investigate this question, in Rose (2005), I used argument-reordering constructions: constructions which allow the order of arguments to vary—hence effecting a different relative syntactic prominence of discourse referents—with no (or minimal) change in their semantic role. For instance, so-called psychological-verbs have the alternate forms shown in (3)-(4). (3) The audience admired the acrobats. (4) The acrobats amazed the audience. 2 More precisely, Stevenson et al. (2000), using the Centering framework (Grosz and Sidner, 1986; Grosz et al., 1995), argue that the backward-looking center, Cb, should refer to the subject of the preceding utterance. This is a simplification of the original Centering proposal in which it was suggested that the Cb refer to the highest-ranking member of the set of forward-looking centers in the previous utterance which is realized in the current utterance. 91 In (3), AUDIENCE is realized in a more syntactically prominent position than is ACROBATS: that is, in subject position. The reverse is true in (4). On the other hand, the semantic roles remain the same in both: ACROBATS is the stimulus while AUDIENCE is the experiencer. If syntactic prominence is most important, then a subsequent pronoun they should pick out AUDIENCE in (3) and ACROBATS in (4). On the other hand, if semantic prominence is most important, then a subsequent pronoun should pick out the same discourse referent in both alternatives. Assuming experiencer is higher on a semantic prominence hierarchy than stimulus (cf., thematic hierarchies in Jackendoff (1972); Speas (1990); inter alia), then this would be AUDIENCE. In Rose (2005), I compared the effects of syntactic and semantic prominence on the salience of discourse referents in psycholinguistic experiments using two argument-reordering constructions: tough-constructions and spray/loadconstructions. Results show that both syntactic and semantic prominence contribute to the salience of discourse referents. This suggests that experiments of this sort should carefully control for both syntactic and semantic prominence. In the present experiment, I do so by using an argumentreordering construction for the test stimuli. 2.2.3 Coherence Relations Several investigators have theorized and observed that pronoun interpretation preferences differ when there is a causal connection between utterances compared to when there is a narrative connection (Hobbs, 1978; Kehler, 2002; Stevenson et al., 2000). For instance, in the narrative relation shown above in (1) (repeated below as (5)), the preference is for the pronoun to refer to LUKE. However, in (6) in which the utterances are related by a causal connection, the preference is for the pronoun to refer to MAX. (5) Lukei hit Maxj and then he i/#j ran home. (6) Lukei hit Maxj because he #i/j ran home. Therefore, when investigating pronoun resolution, it is also necessary to take into account the influence of coherence relations by either controlling for these relations or making them another point of investigation. In the present study, I will take the latter course of action in order to see how coherence relations might influence pronoun resolution to competing non-salient entities. Previ-
Page 1:
Proceeding of the Australasian Lang
Page 4 and 5:
Program Chairs: Committees Lawrence
Page 6 and 7:
Towards the Evaluation of Referring
Page 8 and 9:
The cat ate the hat that I made NP/
Page 10 and 11:
Figure 2: Cells affected by adding
Page 12 and 13:
were used because the accuracy for
Page 14 and 15:
TIME ACC COVER FAIL RATE % secs % %
Page 16 and 17:
model. We explore different estimat
Page 18 and 19:
S.S. Source Sense Source S.S. Acc.
Page 20 and 21:
acy. The difference in correctly as
Page 22 and 23:
Experiments with Sentence Classific
Page 24 and 25:
datasets with skewed distribution t
Page 26 and 27:
words produced by the feature selec
Page 28 and 29:
Sentence Class NB DT SVM Apology 1.
Page 30 and 31:
Computational Semantics in the Natu
Page 32 and 33:
S[sem = ] -> NP[sem=?subj] VP[sem=?
Page 34 and 35:
Any string which cannot be decompos
Page 36 and 37:
satisfy(Formula,model(D,F),G,pos):n
Page 38 and 39:
Classifying Speech Acts using Verba
Page 40 and 41:
The principles are dichotomous —
Page 42 and 43:
ance. Several additional example ut
Page 44 and 45:
our results. In classifying only th
Page 46 and 47: Word Relatives in Context for Word
Page 48 and 49: create a collection of training exa
Page 50 and 51: Query Sense The nave was rebuilt in
Page 52 and 53: Algorithm Avg S2LS Avg S3LS Avg S2L
Page 54 and 55: B. Snyder and M. Palmer. 2004. The
Page 56 and 57: it is even used as a black box that
Page 58 and 59: ecognition in QA, so we will not us
Page 60 and 61: BPER ILOC IPER BLOC BLOC BDATE BLOC
Page 62 and 63: of noise introduced by the addition
Page 64 and 65: 2 Existing annotated corpora Much o
Page 66 and 67: Our FUSE|tel spectrum of HD|sta 738
Page 68 and 69: N Word Correct Tagged 23 OH mol non
Page 70 and 71: L ATEX typesetting information into
Page 72 and 73: in English, neither the determiner
Page 74 and 75: con 4 (Sagot et al., 2006) and Morp
Page 76 and 77: Feature type Positions/description
Page 78 and 79: Miriam Butt, Helge Dyvik, Tracy Hol
Page 80 and 81: ently mostly solved by employing hu
Page 82 and 83: Figure 1: Augmented SCT Lexicon Tok
Page 84 and 85: medical terms can be composed by ad
Page 86 and 87: References Aronson, A. R. (2001). E
Page 88 and 89: technique to most question types, i
Page 90 and 91: User Question Analyser QA System Se
Page 92 and 93: Figure 2: Precision Figure 3: Cover
Page 94 and 95: Analysis and Review. Springer-Verla
Page 98 and 99: ous accounts of the effects of cohe
Page 100 and 101: Table 1: Overall Results for Object
Page 102 and 103: References Jennifer Arnold, Janet E
Page 104 and 105: In order for appropriate documents
Page 106 and 107: y Michael West. He found that his t
Page 108 and 109: low-scoring documents are “Click
Page 110 and 111: a) b) You must complete at least 9
Page 112 and 113: uct). In this paper, we will only c
Page 114 and 115: indicate the categories of substruc
Page 116 and 117: Argument lowering Dependent geach
Page 118 and 119: who [u : np] WH(np, s, s/?gq) λP.
Page 120 and 121: algorithms, and report the performa
Page 122 and 123: 2.3 Coverage of the Human Data Out
Page 124 and 125: and minimal subsets of the data. Of
Page 126 and 127: output. Finally, and related to the
Page 128 and 129: identifies, to avoid overwhelming t
Page 130 and 131: y the student is first parsed, usin
Page 132 and 133: a given word sequence Sorig as a pe
Page 134 and 135: A problem arises with this scheme w
Page 136 and 137: Example 1: Original Headline: Europ
Page 138 and 139: |relations(sentence1) ∩ relations
Page 140 and 141: 2685 True False 1275 Bleu Dep Bleu
Page 142 and 143: Features Acc. C1-prec. C1-recall C1
Page 144 and 145: Given the lack of research in VSD u
Page 146 and 147:
ased features: Type 1 Original sibl
Page 148 and 149:
Preposition of the adjunct semantic
Page 150 and 151:
Algorithm 1 The algorithm of combin
Page 152 and 153:
References Timothy Baldwin and Fran
Page 154 and 155:
dependencies in German with respect
Page 156 and 157:
wij moeten dit probleem aanpakken w
Page 158 and 159:
a brevity penalty of BLEU n = 1 n =
Page 160 and 161:
plicitly matching the syntax of the
Page 162 and 163:
Probabilities improve stress-predic
Page 164 and 165:
Towards Cognitive Optimisation of a
Page 166 and 167:
Natural Language Processing and XML
Page 168 and 169:
Extracting Patient Clinical Profile
show all

Automatic Mapping Clinical Notes to Medical - RMIT University

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?