Automatic Mapping Clinical Notes to Medical - RMIT University

More documents

Recommendations

Info

$journal of latex class files, vol. 6, no. 1, january ... - RMIT University$

Sentence Class NB DT SVM Apology 1.000 (0.000) 1.000 (0.000) 1.000 (0.000) Instruction 0.593 (0.109) 0.619 (0.146) 0.675 (0.126) Instruction-item 0.718 (0.097) 0.582 (0.141) 0.743 (0.127) Others 0.117 (0.249) 0.411 (0.283) 0.559 (0.282) Question 0.413 (0.450) 1.000 (0.000) 1.000 (0.000) Request 0.896 (0.042) 0.930 (0.046) 0.940 (0.047) Response-ack 0.931 (0.061) 0.902 (0.037) 0.942 (0.057) Salutation 0.908 (0.029) 0.972 (0.028) 0.981 (0.020) Signature 0.370 (0.362) 0.960 (0.064) 0.986 (0.045) Specification 0.672 (0.211) 0.520 (0.218) 0.829 (0.151) Statement 0.837 (0.042) 0.843 (0.040) 0.880 (0.035) Suggestion 0.619 (0.206) 0.605 (0.196) 0.673 (0.213) Thanking 1.000 (0.000) 1.000 (0.000) 1.000 (0.000) Url 0.870 (0.071) 0.970 (0.041) 0.988 (0.025) SUGGESTION and STATEMENT were often misclassified as one another. This means that there were not enough distinguishing features to clearly separate these classes. The highest confusion was between INSTRUCTION and STATEMENT, and indeed, sentences of the form “the driver must be installed before the device will work” can be interpreted as both an instruction and a general statement. This suggests that the usage of some of these sentence classes may need to be revised. 4 Conclusions We have presented a set of experiments involving sentence classification. While the successful deployment of classification algorithms for sentences has been demonstrated previously, this kind of classification has received far less attention than the one involving complete documents. In particular, the usefulness of feature selection for sentence classification has not been investigated, to the best of our knowledge. There are many types of documents where individual sentences carry important information regarding communicative acts between parties. In our experiments this corresponds to email responses to technical help-desk inquiries. However, there are many more examples of such documents, including different kinds of emails (both personal and professional), newsgroup and forum discussions, on-line chat, and instant messaging. Therefore, sentence classification is a useful task that deserves more investigation. In particular, such investigations need to relate results to the more well established ones from text classification experiments, and thus highlight the significant differences between these two tasks. Our results confirm some observations made Table 3: Class-by-class performance 22 from text classification. The SVM classification algorithm generally outperforms other common ones, and is largely insensitive to feature selection. Further, the effect of non-trivial feature selection algorithms is mainly observed when an aggressive selection is required. When a less aggressive selection is acceptable (that is, retaining more features), a simple and computationally cheap frequency-based selection is adequate. Our results also show some important differences between text and sentence classification. Sentences are much smaller than documents, and less rich with textual information. This means that in pruning the feature space one needs to be very careful not to eliminate strong discriminative features, especially when there is a large class distribution skew. We saw that lemmatization and stopword removal proved detrimental, whereas they have been demonstrated to provide a useful dimensionality reduction in text classification. This difference between sentences and documents may also be responsible for obscuring the effect of a particular feature selection method (BNS), which has been demonstrated to outperform others when there is a large distribution skew. We conclude from these observations that while feature selection is useful for reducing the dimensionality of the classification task and even improving the performance of some classifiers, the extent of its usefulness is not as large as in text classification. Acknowledgments This research was supported in part by grant LP0347470 from the Australian Research Council and by an endowment from Hewlett-Packard. The authors also thank Hewlett-Packard for the extensive help-desk data, Ingrid Zukerman, Peter
Tischer and anonymous reviewers for their valuable feedback, and Edward Ivanovic for providing a prototype for our tagging software. References Anthony. 2006. Sentence classifier for helpdesk emails. Honours Thesis, Clayton School of Information Technology, Monash University. Ana Cardoso-Cachopo and Arlindo Limede Oliveira. 2003. An empirical comparison of text categorization methods. In String Processing and Information Retrieval, 10th International Symposium, pages 183–196, Brazil, October. Jean Carletta. 1996. Assessing agreement on classification tasks: The Kappa statistic. Computational Linguistics, 22(2):249–254. William W. Cohen, Vitor R. Carvalho, and Tom M. Mitchell. 2004. Learning to classify email into “Speech Acts”. In Proceedings of Empirical Methods in Natural Language Processing, pages 309– 316, Barcelona, Spain, July. Association for Computational Linguistics. Mark G. Core and James F. Allen. 1997. Coding dialogues with the DAMSL annotation scheme. In David Traum, editor, Working Notes: AAAI Fall Symposium on Communicative Action in Humans and Machines, pages 28–35, Menlo Park, California. American Association for Artificial Intelligence. Simon Corston-Oliver, Eric Ringger, Michael Gamon, and Richard Campbell. 2004. Task-focused summarization of email. In Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, pages 43–50, Barcelona, Spain, July. Harris Drucker, Donghui Wu, and Vladimir N. Vapnik. 1999. Support Vector Machines for spam categorization. IEEE Transactions on Neural Network, 10(5):1048–1054. George Forman. 2003. An extensive empirical study of feature selection metrics for text classification. Journal of Machine Learning Research, 3(7– 8):1289–1305. Cambridge, MA, MIT Press. Evgeniy Gabrilovich and Shaul Markovitch. 2004. Text categorization with many redundant features: Using aggressive feature selection to make SVMs competitive with C4.5. In Proceedings of the 21st International Conference on Machine Learning, pages 321–328, Alberta, Canada. Edward Ivanovic. 2005. Dialogue act tagging for instant messaging chat sessions. In Proceedings of the ACL Student Research Workshop, pages 79–84, Ann Arbor, Michigan. Thorsten Joachims. 1998. Text categorization with Support Vector Machines: Learning with many relevant features. In European Conference on Machine Learning (ECML), pages 137–142. Daniel Jurafsky, Rebecca Bates, Noah Coccaro, Rachel Martin, Marie Meteer, Klaus Ries, Elizabeth Shriberg, Andreas Stolcke, Paul Taylor, and 23 Carol Van Ess-Dykema. 1997. Automatic detection of discourse structure for speech recognition and understanding. In Proceedings of the IEEE Workshop on Speech Recognition and Understanding, pages 88–95, Santa Barbara, CA, December. Larry McKnight and Padmini Srinivasan. 2003. Categorization of sentence types in medical abstracts. In Proceedings of the American Medical Informatics Association Annual Symposium, pages 440–444, Washington D.C. Gerard Salton and Michael J. McGill. 1983. An Introduction to Modern Information Retrieval. McGraw- Hill. Sam Scott and Stan Matwin. 1999. Feature engineering for text classification. In Proceedings of ICML- 99: The 16th International Conference on Machine Learning, pages 379–388, Slovenia. Kazuhiro Seki and Javed Mostafa. 2005. An application of text categorization methods to gene ontology annotation. In Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pages 138– 145, Salvador, Brazil. Chao Wang, Jie Lu, and Guangquan Zhang. 2005. A semantic classification approach for online product reviews. In Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence, pages 276–279, Compiegne, France. Ian H. Witten and Eibe Frank. 2005. Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, San Fransisco, 2nd edition. Yiming Yang and Jan O. Pedersen. 1997. A comparative study on feature selection in text categorization. In Douglas H. Fisher, editor, Proceedings of ICML- 97: The 14th International Conference on Machine Learning, pages 412–420, Nashville, US. Liang Zhou, Miruna Ticrea, and Eduard Hovy. 2004. Multi-document biography summarization. In Proceedings of Empirical Methods in Natural Language Processing, pages 434–441, Barcelona, Spain.
Page 1: Proceeding of the Australasian Lang
Page 4 and 5: Program Chairs: Committees Lawrence
Page 6 and 7: Towards the Evaluation of Referring
Page 8 and 9: The cat ate the hat that I made NP/
Page 10 and 11: Figure 2: Cells affected by adding
Page 12 and 13: were used because the accuracy for
Page 14 and 15: TIME ACC COVER FAIL RATE % secs % %
Page 16 and 17: model. We explore different estimat
Page 18 and 19: S.S. Source Sense Source S.S. Acc.
Page 20 and 21: acy. The difference in correctly as
Page 22 and 23: Experiments with Sentence Classific
Page 24 and 25: datasets with skewed distribution t
Page 26 and 27: words produced by the feature selec
Page 30 and 31: Computational Semantics in the Natu
Page 32 and 33: S[sem = ] -> NP[sem=?subj] VP[sem=?
Page 34 and 35: Any string which cannot be decompos
Page 36 and 37: satisfy(Formula,model(D,F),G,pos):n
Page 38 and 39: Classifying Speech Acts using Verba
Page 40 and 41: The principles are dichotomous —
Page 42 and 43: ance. Several additional example ut
Page 44 and 45: our results. In classifying only th
Page 46 and 47: Word Relatives in Context for Word
Page 48 and 49: create a collection of training exa
Page 50 and 51: Query Sense The nave was rebuilt in
Page 52 and 53: Algorithm Avg S2LS Avg S3LS Avg S2L
Page 54 and 55: B. Snyder and M. Palmer. 2004. The
Page 56 and 57: it is even used as a black box that
Page 58 and 59: ecognition in QA, so we will not us
Page 60 and 61: BPER ILOC IPER BLOC BLOC BDATE BLOC
Page 62 and 63: of noise introduced by the addition
Page 64 and 65: 2 Existing annotated corpora Much o
Page 66 and 67: Our FUSE|tel spectrum of HD|sta 738
Page 68 and 69: N Word Correct Tagged 23 OH mol non
Page 70 and 71: L ATEX typesetting information into
Page 72 and 73: in English, neither the determiner
Page 74 and 75: con 4 (Sagot et al., 2006) and Morp
Page 76 and 77: Feature type Positions/description
Page 78 and 79:
Miriam Butt, Helge Dyvik, Tracy Hol
Page 80 and 81:
ently mostly solved by employing hu
Page 82 and 83:
Figure 1: Augmented SCT Lexicon Tok
Page 84 and 85:
medical terms can be composed by ad
Page 86 and 87:
References Aronson, A. R. (2001). E
Page 88 and 89:
technique to most question types, i
Page 90 and 91:
User Question Analyser QA System Se
Page 92 and 93:
Figure 2: Precision Figure 3: Cover
Page 94 and 95:
Analysis and Review. Springer-Verla
Page 96 and 97:
2 Background 2.1 Pronoun Reference
Page 98 and 99:
ous accounts of the effects of cohe
Page 100 and 101:
Table 1: Overall Results for Object
Page 102 and 103:
References Jennifer Arnold, Janet E
Page 104 and 105:
In order for appropriate documents
Page 106 and 107:
y Michael West. He found that his t
Page 108 and 109:
low-scoring documents are “Click
Page 110 and 111:
a) b) You must complete at least 9
Page 112 and 113:
uct). In this paper, we will only c
Page 114 and 115:
indicate the categories of substruc
Page 116 and 117:
Argument lowering Dependent geach
Page 118 and 119:
who [u : np] WH(np, s, s/?gq) λP.
Page 120 and 121:
algorithms, and report the performa
Page 122 and 123:
2.3 Coverage of the Human Data Out
Page 124 and 125:
and minimal subsets of the data. Of
Page 126 and 127:
output. Finally, and related to the
Page 128 and 129:
identifies, to avoid overwhelming t
Page 130 and 131:
y the student is first parsed, usin
Page 132 and 133:
a given word sequence Sorig as a pe
Page 134 and 135:
A problem arises with this scheme w
Page 136 and 137:
Example 1: Original Headline: Europ
Page 138 and 139:
|relations(sentence1) ∩ relations
Page 140 and 141:
2685 True False 1275 Bleu Dep Bleu
Page 142 and 143:
Features Acc. C1-prec. C1-recall C1
Page 144 and 145:
Given the lack of research in VSD u
Page 146 and 147:
ased features: Type 1 Original sibl
Page 148 and 149:
Preposition of the adjunct semantic
Page 150 and 151:
Algorithm 1 The algorithm of combin
Page 152 and 153:
References Timothy Baldwin and Fran
Page 154 and 155:
dependencies in German with respect
Page 156 and 157:
wij moeten dit probleem aanpakken w
Page 158 and 159:
a brevity penalty of BLEU n = 1 n =
Page 160 and 161:
plicitly matching the syntax of the
Page 162 and 163:
Probabilities improve stress-predic
Page 164 and 165:
Towards Cognitive Optimisation of a
Page 166 and 167:
Natural Language Processing and XML
Page 168 and 169:
Extracting Patient Clinical Profile
show all

Automatic Mapping Clinical Notes to Medical - RMIT University

Create successful ePaper yourself

Delete template?

Save as template?