18.01.2015 Views

PPT - 数据工程与知识工程教育部重点实验室

PPT - 数据工程与知识工程教育部重点实验室

PPT - 数据工程与知识工程教育部重点实验室

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Comparison between different models <br />

LDA Mixture of Unigram BTM <br />

l Document level topic<br />

distribution <br />

– Suffer sparsity of the doc!<br />

l Model the generation of<br />

each word <br />

– Ignore context <br />

l Corpus level topic<br />

distribution <br />

– Alleviate doc sparsity <br />

l Single topic assumption in<br />

each document <br />

– Too strong assumption <br />

l Corpus level topic<br />

distribution <br />

– Alleviate doc sparsity <br />

l Model the generation of<br />

word pairs <br />

– Leverage context

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!