02.11.2014 Views

untangling_the_web

untangling_the_web

untangling_the_web

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

DOCID: 4046925<br />

UNCLASSIFIEDOFOR OFFl6IAL b1S~ QNboY<br />

~ [originurlextension:pdf "white paper"] finds pages indexed by Yahoo that are<br />

in PDF format and contain <strong>the</strong> phrase ["white paper"] anywhere in <strong>the</strong> text,<br />

title, or urI. .<br />

To search by specific type of file, use <strong>the</strong> syntax originurlextension: plus one of<br />

<strong>the</strong>se or any file extension, such as cgi, log, zip, etc. Because this workaround<br />

is not a true filetype search, you can search on any file extension.<br />

o htm or html-standard <strong>web</strong> page<br />

o pdf -Adobe Acrobat<br />

o xls-MS Excel<br />

o ppt-MS PowerPoint<br />

o doc-MS Word<br />

o txt-text<br />

o xml, rdf, rss-RSS or XML feeds 52<br />

Search roller. Searchroller uses a JavaScript to let you create a neat little search<br />

query bookrnarklet'" for your future use. The bookmarklet comprises a set of<br />

domains you like to search on routinely but don't want to type in each time. For<br />

example, perhaps you'd like to search simultaneously on a whole group of news<br />

sites. Tara Calishain's script lets you input <strong>the</strong> uris for <strong>the</strong> news' sites once, <strong>the</strong>n<br />

save <strong>the</strong>m to your Favorites or Bookmarks. Each time you click on <strong>the</strong> bookmarklet,<br />

a screen will appear asking you to enter a query term or terms, <strong>the</strong>n <strong>the</strong> bookmarklet<br />

will automatically go to Yahoo and run that query against all <strong>the</strong> urls you have<br />

previously selected. It's a great timesaver when you consider this is a typical<br />

Searchroller bookmarklet query, although it could be much longer:<br />

[iraq (site:cnn.com OR site:msnbc.com OR site:usatoday.com] OR<br />

[site:nytimes.com OR site:washingtonpost.com OR site:bbc.co.uk )]<br />

Search roller<br />

http://www.researchbuzz.org/2aa4/1a/new yahoo<br />

hack searchroller fO.shtml<br />

Artificial Proximity Search. Since Yahoo's APls are so new and as yet not fully<br />

exploited, clever folks like Tara Calishain have come up with ways to force Yahoo to<br />

perform new types of searches. The proximity search lets you input one search term<br />

and look for it from 1 to 5 "spaces" (really, words) from a second search term. For<br />

example, I can search for henry within two words of thoreau and find many instances<br />

52 In order to read RSS or XML feeds, you need a reader or aggregator to parse this type of data.<br />

53 A bookmarklet is a tiny JavaScript application contained in a bookmark that can be saved and used<br />

<strong>the</strong> same way you use normal bookmarks. Bookmarklets do not require users to download and install<br />

software. For more on bookmarklets, visit .<br />

116 UNCLASSIFIEDHFOR OFFICIAL tJSE OI~L f

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!