file - ChaSen - 奈良先端科学技術大学院大学

More documents

Recommendations

Info

tion type identification.Extraction of Descriptive AnswerAchieving a descriptive answer required in question-answering (DQA) poses manydifficulties. Examples of descriptive answers (DAs) include a How-to, a Condition,a Definition, an Opinion, a Reason and so forth. These answer types define typesof questions. How do we extract these answers from their source articles? Firstly,we have to determine the DA boundary in a source article, Answer Segment.Secondly, we have to set various parameters to select relevant answers for theuser query from variants of correct answers, such as fineness and concreteness ofdescription, coverage of related information, degree of cross-reference between relateddocuments, required document structure, subjectivity, or credibility, basedon experience or speculation. Even if we suitably establish these conditions, wecan consider multiple relevant answers according to the discourse structures intheir answers. For instance, when we examine “Cut, boil and fill a bowl.” isthis a mere list of actions or a procedure? To deal with this type of problemcorrectly, we have to be able to recognize discourse relations, including, logicalrelations: parallelism, causality, supposition; temporal relation such as the orderof actions, spatial relations such as the role and location of the agent, rhetoricalrelations such as exemplification and definition. Simple bag of words featuresare insufficient for extracting the exact answer. Unfortunately, by current naturallanguage processing (NLP), it is too difficult to solve all these problems.There are two possible alternatives of condition setting of DQA. The first oneis a restriction of a specific domain, such as cooking recipe [40, 121]. The secondis restriction on the style of answers [13, 30, 31]. In some cases, we can exploitthe style of description frequently appearing in an answer type to narrow downanswer candidates. For instance, if we wish to know the meaning of Soba, “”, the answer style could mimic the description style of a dictionary, such as”Soba : Thin Japanese noodles made from buckwheat flour.” Therefore, if wemake preparations beforehand regarding the lexical and semantic patterns andthen match the patterns to answer candidates, there are fewer and more relevantanswer candidates to sort through. If we could also find a style that is dominantin a descriptive answer type, the style would possibly work well to identify correct4
answers. Although different distributions of description style regarding differentdomains are predictable, some style can be considered to appear in various domains.Thus we can expect the feature of style in one domain to be also effectivein other domains. What styles are frequently used in descriptive answers? Howshould a style of description, that is description type, be defined? Because we aimat extraction of answers from articles in documents, do we have to take accountof linguistic expressions to define types of description style? Description type isnot equal to general document style or format but are not individual writing styleeither. We intend to find description types that can be used to accurately extracteach type of answer.As another solution to the difficulty of DQA, we could take account of exploitinghuman annotated semantic meta-data in the case of difficulty in extractingthe answer only using NLP, such as the example of a list and the procedurementioned above. What style in a Q&A corpus can be annotated as semanticmeta-data with high inter-annotator agreement? As the first step toward solvingthis problem, we performed description style annotation for Q&A articlesand studied the annotation results, clarifying features of the description style ofthe answer. Using the features of style, we conducted experiments of extractingarticles of a descriptive answer type, that is procedural expression from theWeb pages. Additionally, we explored the effective features of the extraction ofprocedural expressions.1.3 Guide to remaining chaptersWe overview previous studies of question-answering and related researches inChapter 2. Chapter 3 looks at multiple sentence query processing, and focuseson question segment extraction and question type identification for multi-sentencequeries. Chapter 4 and Chapter 5 are devoted to answer extraction. We discussannotation of description type to Q&A corpus in Chapter 4, and explore someexpected description type resulted in annotation experiment. In Chapter 5, wepropose the methodology of extraction of procedural expression from the Webpages using description type and machine learning, and show the effectiveness ofthe approach. Finally the thesis is concluded in Chapter 6.5
Page 1: NAIST-IS-DD0061208Doctoral Disserta
Page 4 and 5: This thesis studies two fundamental
Page 6 and 7: F 0.8 F 0.7 , , , , , iv
Page 8 and 9: 3.4.4 Experimental settings . . . .
Page 10 and 11: List of Tables3.1 Definitions of Qu
Page 12 and 13: List of Figures1.1 Division of Quer
Page 15 and 16: Chapter 1Introduction1.1 Motivation
Page 17: The Number of Question per QuerySin
Page 22 and 23: ComputerHumanComputerHumanSpecializ
Page 24 and 25: Blog PageMultipleSentenceQueryQ1Q2e
Page 26 and 27: Data FlowQuestion TypeIdentificatio
Page 28 and 29: I will introduce fundamental techno
Page 31 and 32: Chapter 3Question Type Identificati
Page 33 and 34: used for question sentence type ide
Page 35 and 36: (1) We pay car commuter employees a
Page 37 and 38: Table 3.2. Classified Given Questio
Page 39 and 40: Figure 3.2. Combinations of Questio
Page 41 and 42: Plants can grow indoors. In additio
Page 43 and 44: this thesis. This Chapter proposes
Page 45 and 46: as Inside/Outside [113, 116] and St
Page 47 and 48: Step 1 Segment a question article i
Page 49 and 50: elements be Θ. Then the degree of
Page 51 and 52: Table 3.5. Transition of Question T
Page 53 and 54: Table 3.6. Summary of Experimental
Page 55 and 56: Table 3.8. Results of Chunking Vary
Page 57 and 58: However when using IOB2/IOE2/IOBES,
Page 59 and 60: 3.6 Related workIdentification of t
Page 61 and 62: Chapter 4Categorization of Descript
Page 63 and 64: 30, 77, 154] and those asking reput
Page 65 and 66: elements were usually eliminated. H
Page 67 and 68: Table 4.1. The Definitions of Descr
Page 69 and 70:
4.3.3 Overview of datasetsTable 4.2
Page 71 and 72:
Table 4.4. Categorization of n Obje
Page 73 and 74:
4.3.6 DiscussionIn the field of nat
Page 75 and 76:
4.4 Description type based answer c
Page 77 and 78:
R = |Rc||Ra|(4.10)Varying combinati
Page 79 and 80:
Chapter 5Extraction of ProceduralEx
Page 81 and 82:
Table 5.2. Domain and Type of List.
Page 83 and 84:
Figure 5.2. Collection of Lists fro
Page 85 and 86:
Table 5.3. Types of Tags.Tag types
Page 87 and 88:
domain with the document set in the
Page 89 and 90:
Table 5.6. Result of Close-Domain.C
Page 91 and 92:
Table 5.9. Comparison of SVM and De
Page 93:
Sentence : “ [menyu] w o s ent a
Page 96 and 97:
• It is applicable in case that m
Page 98 and 99:
• For extraction of procedural ex
Page 100 and 101:
[10] Regina Barzilay. Information F
Page 102 and 103:
[29] Yoav Freund and Robert E. Scha
Page 104 and 105:
[47] Chiori Hori, Takaaki Hori, Hid
Page 106 and 107:
[63] Mingzhe Jin. Authorship attrib
Page 108 and 109:
[84] Christopher D. Manning and Hin
Page 110 and 111:
of International World Wide Web Con
Page 112 and 113:
[122] Akihiro Shinmori, Manabu Okum
Page 114 and 115:
[141] Akihiro Tamura, Hiroya Takamu
Page 116 and 117:
[161] Yudong Yang and HongJiang Zha
Page 118 and 119:
[187] . n-gram . , Vol. 23,No. 5,
Page 120 and 121:
- (Evaluation) Yes-No Yes-No (Ho
Page 122 and 123:
(Analysis) : (Fact) : (Instance)
Page 125:
List of PublicationJournal Papers[1
show all

file - ChaSen - 奈良先端科学技術大学院大学

Create successful ePaper yourself

Delete template?

Save as template?