18.01.2015 Views

PPT - 数据工程与知识工程教育部重点实验室

PPT - 数据工程与知识工程教育部重点实验室

PPT - 数据工程与知识工程教育部重点实验室

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Biterm Topic Model(BTM) <br />

• Biterm: co-occurred word pairs in short text <br />

– "visit apple store" -> "visit apple", "visit store", "apple store“!<br />

• Model the generation of biterms with latent topic structure <br />

– a topic ~ a probability distribution over words <br />

– a corpus ~ a mixture of topics <br />

– a biterm ~ two i.i.d sample drawn from one topic

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!