Designing an Anaphora Resolution Algorithm for Route Instructions

More documents

Recommendations

Info

discourse deictic anaphors Precision was 63.6% and Recall 70%. One of the most common classification errors was that discourse deictic or vague anaphors were classified as individuals because there was an individual antecedent available. 2.3.3 The PHORA-Algorithm In this section, I will describe the PHORA-algorithm developed by Byron (2002) which uses salience calculations as criteria to resolve individual anaphors but which also takes advantage of the semantic constraints on the antecedents given by the predicative context of the anaphor. As discourse deictic anaphors usually refer to less salient abstract entities, semantic constraints can be used to find their referents. Byron makes use of the observation that pronouns which do not refer to the most salient item are typically constrained by their context and that they constrain the anaphor to be incompatible with the more salient referent while indicating the intended referent. Taking semantic constraints into account is useful for pronoun resolution algorithms in order to identify the referent of discourse deictic anaphors. This task is generally judged as being difficult and excluded from most of the pronoun resolution studies until now. Following Webber (1991) who suggests that each discourse unit (context) has a ‘pseudo-DE’ that ‘stands proxy’ for its propositional context, the first action of the algorithm is to build up the discourse entities and the discourse proxies for the actual discourse unit (DU n ). To resolve a pronoun in the following discourse unit (DU n+1 ), the algorithm firstly calculates the most general semantic type (T) that satisfies the constraints of the predicative context of the pronoun. Then, the algorithm checks the discourse entities in salience order to find a referent that matches the features of the pronoun. Depending on the type of anaphor (either personal or demonstrative pronoun) the algorithm uses different search orders. Finally, each discourse entity is tested with respect to the type constraint. Every entity that matches the type constraint of the anaphor (i.e. it either has the same semantic type as the anaphor or it is a subtype of it) is a possible referent. In general, there is only one possible referent left after testing the discourse entities. 18
The discourse model of the PHORA-algorithm contains two kinds of entities: mentioned entities and activated entities. Mentioned entities are referred to by noun phrases (e.g. proper names, descriptive noun phrases, demonstrative, personal, possessive and reflexive noun phrases). Entities referred to by NPs are inserted into the discourse model as soon as the corresponding NP is interpreted. By default the left-most noun phrase in each clause is the ‘focused’ mentioned entity of the actual discourse unit. All discourse entities have the following set of attributes: input (linguistic constituent), number (singular or plural), type (PERSON or ENGINE), composition (hetero- or homogenous), specificity (individual or kind), interpretation (referent or proxy associated with this discourse entity). Activated entities are proxies for non-noun-phrase-referents like entire sentences, nominals, clauses etc. They are evoked after each clause is interpreted. One clause can trigger multiple proxies. While mentioned entities remain in the discourse model for the complete discourse, activated entities only remain in the discourse model until the interpretation of the next clause is finished. Contrary to Eckert & Strube (2000), discourse deictic reference evokes a mentioned entity which is treated as any other mentioned entity and therefore, remains in the discourse model for the entire discourse. Semantic type constraints for the pronouns are determined in the beginning of the pronouns’ resolution process. The criteria resemble Eckert & Strube’s list of A- and I-Incompatibility except that the set of types is more complex. Byron’s algorithm does not only distinguish between individuals and abstract objects, the type system is so detailed that equation constructions like “that’s a good route” constrain the anaphor as being an entity of the type ROUTE. Since during pronoun resolution the first referent that matches the semantic type constraints and the agreement features for the pronoun is the preferred referent, the search order of the algorithm plays an important role. Thus, the algorithm employs different search orders for personal and demonstrative pronouns, which reflects the fact that demonstrative pronouns refer more often to activated (nonsalient) entities whilst personal pronouns usually refer to mentioned entities (cf. 19
Page 1 and 2: Designing an Anaphora Resolution Al
Page 3 and 4: demonstrative pronouns and 86% of
Page 5 and 6: 1. Introduction The aim of the proj
Page 7 and 8: 2. Background and Related Work As t
Page 9 and 10: concept in the discourse, so they a
Page 11 and 12: discourse while abstract entities a
Page 13 and 14: entities ‘in focus’ while demon
Page 15 and 16: Synchronising units are used to ind
Page 17: leads to the insertion of a new ent
Page 21 and 22: According to Poesio & Modjeska (200
Page 23 and 24: 3. RIAR- and SR-Algorithms In this
Page 25 and 26: Discourse Model at unit(1): origin=
Page 27 and 28: Prepositional phrases like ‘to th
Page 29 and 30: position is the number of the updat
Page 31 and 32: If, as in the example above ‘the
Page 33 and 34: abstract object (eventuality). Thus
Page 35 and 36: unit as the anaphor and introduced
Page 37 and 38: incompatible, which means that it r
Page 39 and 40: mentioned in the first sentence, it
Page 41 and 42: S-List. If the anaphor refers to an
Page 43 and 44: 1. update DM … 2. Selectional Res
Page 45 and 46: RIAR-algorithm creates a new S-List
Page 47 and 48: which preposition is inserted into
Page 49 and 50: 4. Evaluation and Discussion In thi
Page 51 and 52: 4.2 Evaluation of the RIAR- and SR-
Page 53 and 54: 2). Personal pronouns are problemat
Page 55 and 56: difficult since I could not find an
Page 57 and 58: 5. Conclusions and Further Work In
Page 59 and 60: Byron’s algorithm handles this pr
Page 61 and 62: eferents. The IBL-ontology shows wh
Page 63 and 64: This-NP-Anaphor: 9 Correct Incorrec
Page 65 and 66: Bibliography Bugmann, G.; Stanislao
Page 67 and 68: APPENDIX 2 SELECTIONAL RESTRICTIONS
Page 69 and 70:
- follow on (IV) Æ sub(human_perso
Page 71 and 72:
- up (both) Æ obj(path) - go up th
Page 73 and 74:
pizza_hut, tesco) - bottom Æ hyper
Page 75 and 76:
- street Æ hyper(path), related(ro
Page 77 and 78:
Update Discourse Model For each uni
Page 79 and 80:
Actions: construct A-List at unit(U
Page 81 and 82:
- the museum is on the left / turn
Page 83 and 84:
position: behind the roundaboutinco
Page 85 and 86:
turn right Æ TURN-Action: create i
Page 87 and 88:
path1(sg., PATH, [in(left_region)],
Page 89 and 90:
2. Selectional Restrictions: (7) Sp
Page 91 and 92:
3. properties of anaphor: X(sg., TR
Page 93 and 94:
3. properties of anaphor: X(sg., PA
Page 95 and 96:
destination not in final sentence b
Page 97 and 98:
6. construct S-List:{Plymouth_unive
Page 99 and 100:
3. a) From my flat to Waverly Stati
Page 101:
Personal Pronoun: 21 Correct Incorr
show all

Designing an Anaphora Resolution Algorithm for Route Instructions

Create successful ePaper yourself

Delete template?

Save as template?