Information Extraction and Retrieval - IIIT Hyderabad
Information Extraction and Retrieval - IIIT Hyderabad
Information Extraction and Retrieval - IIIT Hyderabad
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Classic IR Models - Basic Concepts<br />
• Not all terms are equally useful for representing the<br />
document contents: less frequent terms allow identifying<br />
a narrower set of documents<br />
• The importance of the index terms is represented by<br />
weights associated to them<br />
• Let<br />
– k i be an index term<br />
– d j be a document<br />
– w ij is a weight associated with (k i ,d j )<br />
• The weight w ij quantifies the importance of the index<br />
term for describing the document contents<br />
21