file - ChaSen - 奈良先端科学技術大学院大学

More documents

Recommendations

Info

• Varying annotated expressions so that cannot be acquired rules state descriptiontypes. Therefore it is necessary to gather automatically corpusand to extract features of description types.Some interesting leading studies have been conducted on discourse analysison the Japanese language [51, 95, 170, 175]. However, for actual answer corpusof question-answering in Japanese, previous work is merely found. Maynard [88]explored the structures of answers in Q&A of radio programs and tried to typifythem.4.2.3 Answering proceduresIn recent open domain question-answering, I have seen many studies that respondswith definitions, reasons, and reputations. However, there have beenonly a few leading researches on question-answering that responds with methods.Studies on method retrieval with limited text styles and domains such assearching for patents [32, 122] and cooking recipes [40, 121, 125] have been conductedfor a long time. Questions related to all procedures were addressed by anexpert system [9]. However, only a few studies have been conducted on questionansweringthat responds by searching for methods from an open domain text setsuch as Web texts [5, 135–137, 163]. Additionally, such kind of question-answeringsystem requires a more flexible and more machine-operable approach because ofthe diversity and changeable nature of the information resources. Recently, themost successful approach has been to combine many shallow clues in the textsand occasionally in other linguistic resources. In this approach, the performanceof passage retrieval and categorization is vital for the performance of the entiresystem. In particular, the productiveness of the knowledge of expressions correspondingto each question type, which is principally exploited in retrieval andcategorization, is important. In this sense, the requirements for categorizationin such applications are different from those in previous categorizations. In textcategorization research, feature selection has been discussed [120, 130, 132, 162].However, most of the research dealt with categorization into taxonomy relatedto domain and genre. The features that are used are primarily content words,such as nouns, verbs, and adjectives; functional words and frequent formative50
elements were usually eliminated. However, some particular areas of text categorization,for example, authorship identification, suggested the feasibility of textcategorization with functional expressions on a different axis of document topics[63, 147, 187].4.3 Annotating description types of answersAs stated at the beginning of this chapter, the classification of answers has oftenbeen discussed as it is integrated with the classification of questions. However,there are no established categories of descriptive answers, and the relationshipsbetween classification categories and question categories have not been clarifiedeither. Therefore, I conducted an experiment to classify answers using the classificationcategories based on the discursive features on general texts that wereproposed in leading studies. The classification was performed by tagging the answerarticles. I tried to clarify necessary conditions for categories of descriptiveanswers and those tagging methods.4.3.1 description typesTo further explore description types of answers, this thesis considered the frameworkto solve four problems comprising those described in last section. For thefirst problem of collection of corpus for annotating description type and the secondproblem of reduction of annotation cost for tagging, this thesis suppose a networkenvironment for anonymous annotators tagging descriptive types to articles.To realize such kind of annotation framework, at least, I have to know any descriptiontypes that can be stably assigned by non-professional annotators. Thisthesis supposed instructions of annotations and definitions of description types ina level of book of technical writing for general readers, and then investigated thefeasibility of annotation in such kind of discursive features of text. For the thirdproblems, this thesis stands on machine learning based approaches to automaticallyacquire rules to specify descriptive type from tagged corpus. Finally, forthe forth problem of feature analysis for answers in Japanese question-answering,I conducted annotation of description types to answers in a actual web Q&Aservice, and examine the features of description types.51
Page 1:
NAIST-IS-DD0061208Doctoral Disserta
Page 4 and 5:
This thesis studies two fundamental
Page 6 and 7:
F 0.8 F 0.7 , , , , , iv
Page 8 and 9:
3.4.4 Experimental settings . . . .
Page 10 and 11:
List of Tables3.1 Definitions of Qu
Page 12 and 13:
List of Figures1.1 Division of Quer
Page 15 and 16: Chapter 1Introduction1.1 Motivation
Page 17 and 18: The Number of Question per QuerySin
Page 19: answers. Although different distrib
Page 22 and 23: ComputerHumanComputerHumanSpecializ
Page 24 and 25: Blog PageMultipleSentenceQueryQ1Q2e
Page 26 and 27: Data FlowQuestion TypeIdentificatio
Page 28 and 29: I will introduce fundamental techno
Page 31 and 32: Chapter 3Question Type Identificati
Page 33 and 34: used for question sentence type ide
Page 35 and 36: (1) We pay car commuter employees a
Page 37 and 38: Table 3.2. Classified Given Questio
Page 39 and 40: Figure 3.2. Combinations of Questio
Page 41 and 42: Plants can grow indoors. In additio
Page 43 and 44: this thesis. This Chapter proposes
Page 45 and 46: as Inside/Outside [113, 116] and St
Page 47 and 48: Step 1 Segment a question article i
Page 49 and 50: elements be Θ. Then the degree of
Page 51 and 52: Table 3.5. Transition of Question T
Page 53 and 54: Table 3.6. Summary of Experimental
Page 55 and 56: Table 3.8. Results of Chunking Vary
Page 57 and 58: However when using IOB2/IOE2/IOBES,
Page 59 and 60: 3.6 Related workIdentification of t
Page 61 and 62: Chapter 4Categorization of Descript
Page 63: 30, 77, 154] and those asking reput
Page 67 and 68: Table 4.1. The Definitions of Descr
Page 69 and 70: 4.3.3 Overview of datasetsTable 4.2
Page 71 and 72: Table 4.4. Categorization of n Obje
Page 73 and 74: 4.3.6 DiscussionIn the field of nat
Page 75 and 76: 4.4 Description type based answer c
Page 77 and 78: R = |Rc||Ra|(4.10)Varying combinati
Page 79 and 80: Chapter 5Extraction of ProceduralEx
Page 81 and 82: Table 5.2. Domain and Type of List.
Page 83 and 84: Figure 5.2. Collection of Lists fro
Page 85 and 86: Table 5.3. Types of Tags.Tag types
Page 87 and 88: domain with the document set in the
Page 89 and 90: Table 5.6. Result of Close-Domain.C
Page 91 and 92: Table 5.9. Comparison of SVM and De
Page 93: Sentence : “ [menyu] w o s ent a
Page 96 and 97: • It is applicable in case that m
Page 98 and 99: • For extraction of procedural ex
Page 100 and 101: [10] Regina Barzilay. Information F
Page 102 and 103: [29] Yoav Freund and Robert E. Scha
Page 104 and 105: [47] Chiori Hori, Takaaki Hori, Hid
Page 106 and 107: [63] Mingzhe Jin. Authorship attrib
Page 108 and 109: [84] Christopher D. Manning and Hin
Page 110 and 111: of International World Wide Web Con
Page 112 and 113: [122] Akihiro Shinmori, Manabu Okum
Page 114 and 115:
[141] Akihiro Tamura, Hiroya Takamu
Page 116 and 117:
[161] Yudong Yang and HongJiang Zha
Page 118 and 119:
[187] . n-gram . , Vol. 23,No. 5,
Page 120 and 121:
- (Evaluation) Yes-No Yes-No (Ho
Page 122 and 123:
(Analysis) : (Fact) : (Instance)
Page 125:
List of PublicationJournal Papers[1
show all

file - ChaSen - 奈良先端科学技術大学院大学

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?