18.01.2015 Views

PPT - 数据工程与知识工程教育部重点实验室

PPT - 数据工程与知识工程教育部重点实验室

PPT - 数据工程与知识工程教育部重点实验室

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Short texts are prevalent <br />

Uncovering the topics of short texts is crucial for a<br />

wide range of content analysis tasks <br />

Data Source Average Word Count<br />

(removing stop words)<br />

Weibo Sina weibo ~9<br />

Questions Baidu Zhidao ~6<br />

Web page titles Logs ~5<br />

Query Query log ~3

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!