file - ChaSen - 奈良先端科学技術大学院大学

More documents

Recommendations

Info

expects to be evaluated in the answer.This chapter introduces a leading study on question-answering that expectsdescriptive answers and another leading study on the classification based on thediscursive features of the descriptive answer. Then there follows a report onan experiment automatically categorizing descriptive answers from actual Q&Aarticles in a Web service based on an analysis of discursive features.In the following section, I introduce related work of descriptive answers. Section4.3 presents the result of description type annotation to answers in actualweb question-answering service. Subsequently, Section 4.4 describes the resultof categorization based on description type of answer. Using machine learning,I explore feasibility of automatic categorization and effective features specifyingdescription types. Finally, Section 4.5 discuss limitation of my approach and thenext steps and summarize contributions in this chapter.4.2 Related work4.2.1 Question requiring descriptive answersI have learned from experience that there are more questions that lead to answersdescribed with sentences and texts than those to answers with a few words. Thesurvey of Q&A articles conducted in this study also indicated a high frequencyof descriptive answers (cf. Section 4.3). There are some leading studies that calla descriptive answer a “long-answer” because it is composed of long texts ratherthan words and phrases, and an answer of words and phrases a “short-answer.”[13] They also focus on the descriptive features of the answers.It is not easy to precisely define a descriptive answer and make an exhaustivelist of all description types that belong to the class of such answers. Some questiontypes that require a descriptive answer have been proposed, such as the Definition,Reason, Reputation, Opinion, Method, and so forth. When describing answersto these questions, many facts are listed to give definitions and reasons, and theprocedure is itemized, which results in a description that tends to be composedof several sentences and longer than the answers to other types of question.In recent years, I have seen many papers on questions asking definitions [13,48
30, 77, 154] and those asking reputations and opinions [58, 71]. The number ofpapers on reasons [55, 57] and methods [6, 40, 79, 135–137, 184] is increasing, butthere are still not many.Since descriptive answers often consist of several sentences, it is possible toclassify the answers by their discursive features and explanatory strategy, to whichthe conventional general discourse analysis method can be applied.4.2.2 Discourse analysisDiscourse analysis has a long history, is very extensive, and encompasses manystudy cases. The scope extends from the analysis of natural interaction [49, 180,181] to that with literal “reading” [189]. Here, I introduce cases that deal with explanatorywritten texts. Textual discourse analysis identifies text segment typessuch as clauses, sentences, and paragraphs, and the logical and rhetorical relationsamong them [16, 45, 46, 55, 57, 65, 177, 183, 190]. The Rhetorical StructureTheory(RST) [16] is one of the most often used discourse analysis methods in naturallanguage processing. Mann et al. built a bottom-up dependency tree calleda rhetorical structure by defining logical and rhetorical relations between clausesand fixing the dependency among the clauses. Based on their idea, Marcu et al.proposed a method for automatically generating a rhetorical structure tree fromthe corpus [85]. Rhetorical structure tags based on RST have been appended tosome large corpora [110].Some previous work studied Japanese corpus annotation based on descriptiontype. For instance, in annotation by human, there are categorization of definitionof word in dictionary [150], annotation of causal relation between sentences[57], and in annotation by computer, automatic tagging to definition statementsof words in web pages [31]. Those work mentioned problems of this kind ofannotation as follows;• Huge amount of corpus are required to prove statistically any hypotheses,because the number of annotated tags for description types per an articleare relatively a few comparing other linguistic annotation.• Low efficiency of annotation due to read the long context of expression whenassigning a tag to the expression.49
Page 1:
NAIST-IS-DD0061208Doctoral Disserta
Page 4 and 5:
This thesis studies two fundamental
Page 6 and 7:
F 0.8 F 0.7 , , , , , iv
Page 8 and 9:
3.4.4 Experimental settings . . . .
Page 10 and 11:
List of Tables3.1 Definitions of Qu
Page 12 and 13: List of Figures1.1 Division of Quer
Page 15 and 16: Chapter 1Introduction1.1 Motivation
Page 17 and 18: The Number of Question per QuerySin
Page 19: answers. Although different distrib
Page 22 and 23: ComputerHumanComputerHumanSpecializ
Page 24 and 25: Blog PageMultipleSentenceQueryQ1Q2e
Page 26 and 27: Data FlowQuestion TypeIdentificatio
Page 28 and 29: I will introduce fundamental techno
Page 31 and 32: Chapter 3Question Type Identificati
Page 33 and 34: used for question sentence type ide
Page 35 and 36: (1) We pay car commuter employees a
Page 37 and 38: Table 3.2. Classified Given Questio
Page 39 and 40: Figure 3.2. Combinations of Questio
Page 41 and 42: Plants can grow indoors. In additio
Page 43 and 44: this thesis. This Chapter proposes
Page 45 and 46: as Inside/Outside [113, 116] and St
Page 47 and 48: Step 1 Segment a question article i
Page 49 and 50: elements be Θ. Then the degree of
Page 51 and 52: Table 3.5. Transition of Question T
Page 53 and 54: Table 3.6. Summary of Experimental
Page 55 and 56: Table 3.8. Results of Chunking Vary
Page 57 and 58: However when using IOB2/IOE2/IOBES,
Page 59 and 60: 3.6 Related workIdentification of t
Page 61: Chapter 4Categorization of Descript
Page 65 and 66: elements were usually eliminated. H
Page 67 and 68: Table 4.1. The Definitions of Descr
Page 69 and 70: 4.3.3 Overview of datasetsTable 4.2
Page 71 and 72: Table 4.4. Categorization of n Obje
Page 73 and 74: 4.3.6 DiscussionIn the field of nat
Page 75 and 76: 4.4 Description type based answer c
Page 77 and 78: R = |Rc||Ra|(4.10)Varying combinati
Page 79 and 80: Chapter 5Extraction of ProceduralEx
Page 81 and 82: Table 5.2. Domain and Type of List.
Page 83 and 84: Figure 5.2. Collection of Lists fro
Page 85 and 86: Table 5.3. Types of Tags.Tag types
Page 87 and 88: domain with the document set in the
Page 89 and 90: Table 5.6. Result of Close-Domain.C
Page 91 and 92: Table 5.9. Comparison of SVM and De
Page 93: Sentence : “ [menyu] w o s ent a
Page 96 and 97: • It is applicable in case that m
Page 98 and 99: • For extraction of procedural ex
Page 100 and 101: [10] Regina Barzilay. Information F
Page 102 and 103: [29] Yoav Freund and Robert E. Scha
Page 104 and 105: [47] Chiori Hori, Takaaki Hori, Hid
Page 106 and 107: [63] Mingzhe Jin. Authorship attrib
Page 108 and 109: [84] Christopher D. Manning and Hin
Page 110 and 111: of International World Wide Web Con
Page 112 and 113:
[122] Akihiro Shinmori, Manabu Okum
Page 114 and 115:
[141] Akihiro Tamura, Hiroya Takamu
Page 116 and 117:
[161] Yudong Yang and HongJiang Zha
Page 118 and 119:
[187] . n-gram . , Vol. 23,No. 5,
Page 120 and 121:
- (Evaluation) Yes-No Yes-No (Ho
Page 122 and 123:
(Analysis) : (Fact) : (Instance)
Page 125:
List of PublicationJournal Papers[1
show all

file - ChaSen - 奈良先端科学技術大学院大学

Create successful ePaper yourself

Delete template?

Save as template?