02.11.2014 Views

untangling_the_web

untangling_the_web

untangling_the_web

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

DOClD: 4046925<br />

UNCLASSIFIEDNFOR OFFl61AL l::JSE ONLY<br />

~ Public information on <strong>the</strong> deep <strong>web</strong> is at least 400 to 550 times larger than<br />

<strong>the</strong> commonly defined World Wide Web.<br />

~ The deep <strong>web</strong> contains 7,500 terabytes of information compared to nineteen<br />

terabytes of information in <strong>the</strong> surface <strong>web</strong>.<br />

~ The deep <strong>web</strong> contains nearly 550 billion individual documents compared to<br />

<strong>the</strong> one billion of <strong>the</strong> surface <strong>web</strong>.<br />

~ More than 200,000 deep <strong>web</strong>sites presently exist.<br />

~ Sixty of <strong>the</strong> largest deep-<strong>web</strong>sites collectively contain about 750 terabytes of<br />

information-sufficient by <strong>the</strong>mselves to exceed <strong>the</strong> size of <strong>the</strong> surface <strong>web</strong><br />

forty times.<br />

Therefore, it is vital to maintain a good set of bookmarks for a wide variety of<br />

research tools beyond search engines. Specialized search tools-database finders,<br />

email lookup tools, and online telephone and fax directories-are good first additions<br />

to a robust set of research tools.<br />

If you don't want to "be found,"<br />

never post to Usenet newsgroups.<br />

Once you do, expect<br />

to be spammed and to appear<br />

in directories, such as email<br />

lookup databases. Your only<br />

real solution at this point is to<br />

get a new Internet account.<br />

UNCLASSIFIEDiiFO~ OFFlelAL l:ISE eNLY 307

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!