19.11.2014 Views

Localized Supervised Metric Learning on ... - Researcher - IBM

Localized Supervised Metric Learning on ... - Researcher - IBM

Localized Supervised Metric Learning on ... - Researcher - IBM

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

which c<strong>on</strong>firms the importance of leveraging label informati<strong>on</strong><br />

into the distance metric. We also observe that<br />

Wavelet features improve the performance significantly<br />

for LSML, where the classificati<strong>on</strong> error drops by half<br />

(from about 15% to less than 7%.)<br />

Table 1. Classificati<strong>on</strong> error comparis<strong>on</strong><br />

Expert PCA LSML<br />

Statistic features 0.2295 0.2131 0.1475<br />

Wavelet features NA 0.2295 0.0656<br />

Table 2. Precisi<strong>on</strong>@10 retrieval results<br />

Expert PCA LSML<br />

Statistic features 0.6120 0.5355 0.6557<br />

Wavelet features NA 0.5410 0.7869<br />

Sensitivity Analysis There are two parameters in the<br />

study: 1) the number of neighbors k in the k-NN classifier<br />

and 2) the dimensi<strong>on</strong>ality d of the resulting lowdimensi<strong>on</strong>al<br />

space (after PCA and LSML). Figure 3<br />

shows the reuslts of sentivity analysis <strong>on</strong> these two parameters.<br />

Figure 3(a) plots classificati<strong>on</strong> error vs. k for<br />

all methods. Small k leads to lower classificati<strong>on</strong> error,<br />

which c<strong>on</strong>firms the need for a localized distance metric.<br />

Figure 3(b) plots classificati<strong>on</strong> error vs. dimensi<strong>on</strong>ality<br />

d for all methods except Expert, which c<strong>on</strong>firms the<br />

stability of LSML w.r.t. to different d.<br />

6. C<strong>on</strong>clusi<strong>on</strong> and Discussi<strong>on</strong><br />

We have presented a method for deriving semantically<br />

sound similarity measures for retrieving patients<br />

represented by multi-dimensi<strong>on</strong>al time series. Our<br />

method uses both statistical and wavelet based features<br />

to capture the characteristics of patients, and leverages<br />

localized supervised metric learning to incorporate<br />

physicians’ expert domain knowledge. Experiments using<br />

the MIMIC II database dem<strong>on</strong>strates the efficacy of<br />

this appraoch. In future work we plan to explore ways<br />

to explicitly incorporate temporal characteristics of the<br />

data to further improve metric learning in this particular<br />

c<strong>on</strong>text.<br />

References<br />

[1] MIMIC II Database.<br />

http://physi<strong>on</strong>et.org/physiobank/database/mimic2db/.<br />

[2] X. Chen, D. Xu, G. Zhang, and R. Mukkamala. Forecasting<br />

acute hypotensive episodes in intensive care<br />

patients based <strong>on</strong> a peripheral arterial blood pressure<br />

waveform. Computers in Cardiology, 36, 2000.<br />

[3] I. Daubechies. Ten Lectures <strong>on</strong> Wavelets. SIAM,<br />

Philadelphia, 1992.<br />

(a) Stable with different k<br />

(b) Stable with different d<br />

Figure 3. LSML is stable with different parameter<br />

values.<br />

[4] K. Fukunaga. Introducti<strong>on</strong> to Statistical Pattern Recogniti<strong>on</strong>.<br />

Academic Press, San Diego, California, 1990.<br />

[5] J. Goldberger, S. Roweis, G. Hint<strong>on</strong>, and R. Salakhutdinov.<br />

Neighborhood comp<strong>on</strong>ent analysis. In NIPS, 2005.<br />

[6] Y. Jia, F. Nie, and C. Zhang. Trace ratio problem revisited.<br />

IEEE Transacti<strong>on</strong>s <strong>on</strong> Neural Networks, 2009.<br />

[7] M. Saeed and R. Mark. A novel method for the efficient<br />

retrieval of similar multiparameter physiologic time series<br />

using wavelet-based symbolic representati<strong>on</strong>s. In<br />

American Medical Informatics Associati<strong>on</strong>, 2006.<br />

[8] M. Sugiyama. Dimensi<strong>on</strong>ality reducti<strong>on</strong> of multimodal<br />

labeled data by local fisher discriminant analysis. J.<br />

Mach. Learn. Res., 8, 2007.<br />

[9] F. Wang, J. Sun, T. Li, and N. Anerousis. Two heads<br />

better than <strong>on</strong>e: <str<strong>on</strong>g>Metric</str<strong>on</strong>g>+active learning and its applicati<strong>on</strong>s<br />

for it service classificati<strong>on</strong>. In ICDM, 2009.<br />

[10] F. Wang and C. Zhang. Feature extracti<strong>on</strong> by maximizing<br />

the neighborhood margin. In CVPR, 2007.<br />

[11] E. P. Xing, A. Y. Ng, M. I. Jordan, and S. Russell.<br />

Distance metric learning, with applicati<strong>on</strong> to clustering<br />

with side-informati<strong>on</strong>. In NIPS, 2002.<br />

[12] L. Yang. Distance metric learning: A comprehensive<br />

survey. Technical report, Michgan State University,<br />

2006.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!