18.01.2015 Views

PPT - 数据工程与知识工程教育部重点实验室

PPT - 数据工程与知识工程教育部重点实验室

PPT - 数据工程与知识工程教育部重点实验室

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Results <br />

• Datasets<br />

– 20-newsgroup<br />

Time Efficiency <br />

• 18,846 documents<br />

• 26,214 distinct words<br />

• 20 related categories<br />

• Baseline methods<br />

– Probabilistic model: LDA<br />

– Non-Probabilistic model: NMF<br />

– Sparse topic model: STC<br />

Topic sparsity <br />

Classification Accuracy

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!