Automatic Mapping Clinical Notes to Medical - RMIT University

More documents

Recommendations

Info

$journal of latex class files, vol. 6, no. 1, january ... - RMIT University$

References Aronson, A. R. (2001). Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. Proc AMIA Symp 17: 21. Brennan, P. F. and A. R. Aronson (2003). Towards linking patients and clinical information: detecting UMLS concepts in e-mail. Journal of Biomedical Informatics 36(4/5): 334-341. Brown, P. J. B. and P. Sönksen (2000). Evaluation of the Quality of Information Retrieval of Clinical Findings from a Computerized Patient Database Using a Semantic Terminological Model. Journal of the American Medical Informatics Association 7: 392-403. Chapman, W. W., W. Bridewell, et al. (2001). A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries. Journal of Biomedical Informatics 34(5): 301-310. Chute, C. G. and P. L. Elkin (1997). A clinically derived terminology: qualification to reduction. Proc AMIA Annu Fall Symp 570: 4. Elkin, P. L., S. H. Brown, et al. (2005). A controlled trial of automated classification of negation from clinical notes. BMC Medical Informatics and Decision Making 2005(5): 13. Friedman, C., P. O. Alderson, et al. (1994). A general natural-language text processor for clinical radiology. Journal of the American Medical Informatics Association 1(2): 161-174. Friedman, C., L. Shagina, et al. (2004). Automated Encoding of Clinical Documents Based on Natural Language Processing. J Am Med Inform Assoc 11: 392-402. Hazlehurst, B., H. R. Frost, et al. (2005). MediClass: A System for Detecting and Classifying Encounterbased Clinical Events in Any Electronic Medical Record, American Medical Informatics Association. Hersh, W. R. and D. Hickam (1995). Information retrieval in medicine: The SAPHIRE experience. Journal of the American Society for Information Science 46(10): 743-747. Huang, Y., H. J. Lowe, et al. (2005). Improved Identification of Noun Phrases in Clinical Radiology Reports Using a High-Performance Statistical Natural Language Parser Augmented with the UMLS Specialist Lexicon, American Medical Informatics Association. Lindberg, D. A., B. L. Humphreys, et al. (1993). The Unified Medical Language System. Methods Inf Med 32(4): 281-91. Liu H., Johnson S. B. and Friedman C. (2002), Automatic Resolution of Ambiguous Terms Based on Machine Learning and Conceptual Relations in the 80 UMLS. Journal of the American Medical Informatics Association, Vol. 9, No. 6, Pages 621-636. Mutalik, P. G., A. Deshpande, et al. (2001). Use of General-purpose Negation Detection to Augment Concept Indexing of Medical DocumentsA Quantitative Study Using the UMLS, Journal of the American Medical Informatics Association 2001. Nadkarni, P., R. Chen, et al. (2001). UMLS Concept Indexing for Production Databases. Journal of the American Medical Informatics Association 8: 80- 91. Reynar, J. C. and A. Ratnaparkhi (1997). A maximum entropy approach to identifying sentence boundaries. Proceedings of the Fifth Conference on Applied Natural Language Processing: 16–19. SNOMED International. (2006). SNOMED International: The Systematized Nomenclature of Medicine. http://www.snomed.org. Last accessed 08-09- 2006. Stearns, M. Q., C. Price, et al. (2001). SNOMED clinical terms: overview of the development process and project status. Proc AMIA Symp 662(6). Association for Computing Machinery. 1983. Computing Reviews, 24(11):503-512. Tsuruoka, Y., Y. Tateishi, et al. Developing a robust part-of-speech tagger for biomedical text. Lecture notes in computer science: 382-392. Zou, Q., W. W. Chu, et al. (2003). IndexFinder: A Method of Extracting Key Concepts from Clinical Texts for Indexing. Proc AMIA Symp 763: 7.
Pseudo Relevance Feedback Using Named Entities for Question Answering Luiz Augusto Pizzato and Diego Mollá CLT - Macquarie University Sydney, Australia {pizzato, diego}@ics.mq.edu.au Abstract Relevance feedback has already proven its usefulness in probabilistic information retrieval (IR). In this research we explore whether a pseudo relevance feedback technique on IR can improve the Question Answering task (QA). The basis of our exploration is the use of relevant named entities from the top retrieved documents as clues of relevance. We discuss two interesting findings from these experiments: the reasons the results were not improved, and the fact that today’s metrics of IR evaluation on QA do not reflect the results obtained by a QA system. 1 Introduction Probabilistic Information Retrieval estimates the documents’ probability of relevance using a small set of keywords provided by a user. The estimation of these probabilities is often assisted by the information contained in documents that are known to be relevant for every specific query. The technique of informing the IR system which documents or information are relevant to a specific query is known as relevance feedback. As reported by Ruthven and Lalmas (2003), relevance feedback techniques have been used for many years, and they have been shown to improve most probabilistic models of Information Retrieval (IR). Relevance feedback is considered as pseudo (or blind) relevance feedback when there is an assumption that the top documents retrieved have a higher precision and that their terms represent the subject expected to be retrieved. In other words, it is assumed that the documents on the top of the retrieval list are relevant to the query, and informa- Cécile Paris ICT - CSIRO Sydney, Australia Cecile.Paris@csiro.au tion from these documents is extracted to generate a new retrieval set. In this paper we explore the use of a pseudo relevance feedback technique for the IR stage of a Question Answering (QA) system. It is our understanding that most questions can be answered using an arbitrary number of documents when querying an IR system using the words from the topic and the question. Because QA is normally a computationally demanding task, mostly due to the online natural language processing tools used, we believe that IR can help QA by providing a small list of high-quality documents, i.e. documents from where the QA system would be able to find the answer. In this sense, documents containing answers for a question in a sentence structure that can be easily processed by a QA system will be highly relevant. In this work, we describe an experiment using a pseudo relevance feedback technique applied over a probabilistic IR system to try to improve the performance of QA systems. Since most types of factoid questions are answered by named entities, we assume that documents addressing the correct topic but not containing any named entity of the expected answer class would have a low probability of relevance regarding QA. Therefore, we hypothesise that documents containing named entities of the correct class have higher probability of relevance than those not containing them. The relevance feedback applied to QA differs from the one applied to general IR in the sense that QA deals more with the presence of a passage that can answer a certain question than with the presence of its topic. In this sense, our technique focuses on feeding terms into the IR engine that could represent an answer for the questions. Despite the fact that it is possible to apply the Proceedings of the 2006 Australasian Language Technology Workshop (ALTW2006), pages 81–88. 81
Page 1:
Proceeding of the Australasian Lang
Page 4 and 5:
Program Chairs: Committees Lawrence
Page 6 and 7:
Towards the Evaluation of Referring
Page 8 and 9:
The cat ate the hat that I made NP/
Page 10 and 11:
Figure 2: Cells affected by adding
Page 12 and 13:
were used because the accuracy for
Page 14 and 15:
TIME ACC COVER FAIL RATE % secs % %
Page 16 and 17:
model. We explore different estimat
Page 18 and 19:
S.S. Source Sense Source S.S. Acc.
Page 20 and 21:
acy. The difference in correctly as
Page 22 and 23:
Experiments with Sentence Classific
Page 24 and 25:
datasets with skewed distribution t
Page 26 and 27:
words produced by the feature selec
Page 28 and 29:
Sentence Class NB DT SVM Apology 1.
Page 30 and 31:
Computational Semantics in the Natu
Page 32 and 33:
S[sem = ] -> NP[sem=?subj] VP[sem=?
Page 34 and 35:
Any string which cannot be decompos
Page 36 and 37: satisfy(Formula,model(D,F),G,pos):n
Page 38 and 39: Classifying Speech Acts using Verba
Page 40 and 41: The principles are dichotomous —
Page 42 and 43: ance. Several additional example ut
Page 44 and 45: our results. In classifying only th
Page 46 and 47: Word Relatives in Context for Word
Page 48 and 49: create a collection of training exa
Page 50 and 51: Query Sense The nave was rebuilt in
Page 52 and 53: Algorithm Avg S2LS Avg S3LS Avg S2L
Page 54 and 55: B. Snyder and M. Palmer. 2004. The
Page 56 and 57: it is even used as a black box that
Page 58 and 59: ecognition in QA, so we will not us
Page 60 and 61: BPER ILOC IPER BLOC BLOC BDATE BLOC
Page 62 and 63: of noise introduced by the addition
Page 64 and 65: 2 Existing annotated corpora Much o
Page 66 and 67: Our FUSE|tel spectrum of HD|sta 738
Page 68 and 69: N Word Correct Tagged 23 OH mol non
Page 70 and 71: L ATEX typesetting information into
Page 72 and 73: in English, neither the determiner
Page 74 and 75: con 4 (Sagot et al., 2006) and Morp
Page 76 and 77: Feature type Positions/description
Page 78 and 79: Miriam Butt, Helge Dyvik, Tracy Hol
Page 80 and 81: ently mostly solved by employing hu
Page 82 and 83: Figure 1: Augmented SCT Lexicon Tok
Page 84 and 85: medical terms can be composed by ad
Page 88 and 89: technique to most question types, i
Page 90 and 91: User Question Analyser QA System Se
Page 92 and 93: Figure 2: Precision Figure 3: Cover
Page 94 and 95: Analysis and Review. Springer-Verla
Page 96 and 97: 2 Background 2.1 Pronoun Reference
Page 98 and 99: ous accounts of the effects of cohe
Page 100 and 101: Table 1: Overall Results for Object
Page 102 and 103: References Jennifer Arnold, Janet E
Page 104 and 105: In order for appropriate documents
Page 106 and 107: y Michael West. He found that his t
Page 108 and 109: low-scoring documents are “Click
Page 110 and 111: a) b) You must complete at least 9
Page 112 and 113: uct). In this paper, we will only c
Page 114 and 115: indicate the categories of substruc
Page 116 and 117: Argument lowering Dependent geach
Page 118 and 119: who [u : np] WH(np, s, s/?gq) λP.
Page 120 and 121: algorithms, and report the performa
Page 122 and 123: 2.3 Coverage of the Human Data Out
Page 124 and 125: and minimal subsets of the data. Of
Page 126 and 127: output. Finally, and related to the
Page 128 and 129: identifies, to avoid overwhelming t
Page 130 and 131: y the student is first parsed, usin
Page 132 and 133: a given word sequence Sorig as a pe
Page 134 and 135: A problem arises with this scheme w
Page 136 and 137:
Example 1: Original Headline: Europ
Page 138 and 139:
|relations(sentence1) ∩ relations
Page 140 and 141:
2685 True False 1275 Bleu Dep Bleu
Page 142 and 143:
Features Acc. C1-prec. C1-recall C1
Page 144 and 145:
Given the lack of research in VSD u
Page 146 and 147:
ased features: Type 1 Original sibl
Page 148 and 149:
Preposition of the adjunct semantic
Page 150 and 151:
Algorithm 1 The algorithm of combin
Page 152 and 153:
References Timothy Baldwin and Fran
Page 154 and 155:
dependencies in German with respect
Page 156 and 157:
wij moeten dit probleem aanpakken w
Page 158 and 159:
a brevity penalty of BLEU n = 1 n =
Page 160 and 161:
plicitly matching the syntax of the
Page 162 and 163:
Probabilities improve stress-predic
Page 164 and 165:
Towards Cognitive Optimisation of a
Page 166 and 167:
Natural Language Processing and XML
Page 168 and 169:
Extracting Patient Clinical Profile
show all

Automatic Mapping Clinical Notes to Medical - RMIT University

Create successful ePaper yourself

Delete template?

Save as template?