02.11.2014 Views

untangling_the_web

untangling_the_web

untangling_the_web

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

DOClD: 4046925<br />

UNCLASSIFIEDNFOR OFFl61AL USE m.LY<br />

This will show <strong>the</strong> cached version of <strong>the</strong> page that contains only text:<br />

•<br />

.! ' _I . I ! __ . I " r I ' _ ' _ _ _ _ __<br />

This is G 0 'I 9 I e's tsxt-only cache of hllp:flwww.korea·dpr.com/as retrieved on Oct 25 ,20 05 04:54:39 GMT.<br />

-:=; .~ ,) 'I I e's cache is <strong>the</strong> snapshot thaI we took of <strong>the</strong> age as we crawled <strong>the</strong> <strong>web</strong>.<br />

i'i,. ~ .Y. may have changed since that time. Click <strong>the</strong> current page without highlighting.<br />

Ci:d , here for <strong>the</strong> full o: a c h ~ d page with images inc<br />

To ::nk t~ IJI ~.J,) k m~r h this page, use <strong>the</strong> following ur<br />

http: }I_.9oo9'le . e o,,", , e. r c h~ q :: c.J. ch e ; OT-.J_ rD 'YbollwJ: ,1- d p r . coJrl!+dprk.lhl =enJ:~tri p =1<br />

_. __ ._.- - - ." ,.. _ '_ ' _" ' -" " " ' - - - -<br />

n,"",,,,,,, ,r.r.I"""" have been highlighted: ~Ii;'r.~j<br />

Googk-is nn:1hn q(!iliaM<br />

Getting around <strong>the</strong> 32-word limit. For years Google had a 10-word limit for search<br />

queries, meaning that anything more than that, and Google would drop those terms<br />

from your query. However, Google expanded <strong>the</strong> number of terms searched to a 32­<br />

word limit. While <strong>the</strong> casual Google searcher will probably never notice <strong>the</strong><br />

difference, professional researchers certainly will. There are many times when<br />

researchers need to search for long phrases (error codes, for example), exclude<br />

large numbers of terms to avoid unwanted results, run complex Google API<br />

searches, run queries of multiple sites, etc., and that darned 10-word interfered with<br />

<strong>the</strong> search. While <strong>the</strong>re are a number of work-arounds all were unsatisfactory.<br />

Allowing more search terms is a big improvement, but I am sorry to report that <strong>the</strong><br />

new 32-word limit only applies at present to main Google search, Google Images,<br />

Froogle and <strong>the</strong> Google Web API, while <strong>the</strong> 10-word limit is still in effect for Google<br />

Groups and Google News. This is especially disappointing vis-a-vis Google Groups<br />

because it has long been one of <strong>the</strong> best sources of information about complicated<br />

computer error codes and o<strong>the</strong>r computer arcana. Perhaps <strong>the</strong> folks at Google will<br />

see fit to expand <strong>the</strong> 32-word limit to include Google Groups,<br />

You can, however, still use <strong>the</strong> wildcard to trick Google Groups into searching<br />

more than 10 keywords. Google will riot count wildcards as search terms, so<br />

inserting a wildcard into a phrase will let you search for more than 10 terms. I have<br />

found this most useful when searching for a long phrase such as a computer error<br />

message, which may frequently run well over 10 words. By simply removing <strong>the</strong><br />

"little words" such as an, you can easily search for <strong>the</strong> entire error message.<br />

Here's an example of an error message containing more than 10 terms:<br />

Windows Socket Error: An Invalid Argument was supplied (10022), on API 'connect'<br />

84 UNCLASSIFIEDHFeR OFFl61J!cL tJS! Ol'4Lt

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!