Information Retrieval Techniques for non-textual media - Prof. A ...
Information Retrieval Techniques for non-textual media - Prof. A ...
Information Retrieval Techniques for non-textual media - Prof. A ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Web content<br />
• The <strong>media</strong>n size of HTM/HTML pages was 8 KB, but the<br />
mean was 605 KB.<br />
• About 23% included images and<br />
• 4% contained movies or animations, and about 20%<br />
contained Javascript applications.<br />
• There are about 2.9 million active weblogs ('blogs'),<br />
containing about 81 GB of in<strong>for</strong>mation.<br />
• About 62 billion emails are sent daily, on the Internet<br />
and elsewhere<br />
• The average email is about 59 kilobytes in size, thus<br />
the annual flow of emails worldwide is 667,585<br />
terabytes.