02.11.2014 Views

untangling_the_web

untangling_the_web

untangling_the_web

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

DID: 4046925<br />

UNCLASSIFIEDNFOR OFFIOIAL l:ISE Ot.LY<br />

Yahoo Hacks<br />

While Google hacks-tips, tricks, techniques, and scripts that make Google more<br />

powerful and useful-are plentiful and fairly well documented, <strong>the</strong> same cannot be<br />

said (yet) for Yahoo Hacks, despite <strong>the</strong> fact that O'Reilly published a Yahoo Hacks<br />

book in late 2005. Part of <strong>the</strong> reason for this was <strong>the</strong> absence of Yahoo APls, a<br />

problem Yahoo recognized and rectified with its Developer site.<br />

Yahoo Developer Network<br />

Yahoo Developer Network Blog<br />

http://developer.yahoo.net/<br />

http://developer.yahoo.netlblog/<br />

While many of <strong>the</strong> hacks, mostly employing some form of API, are geared toward<br />

maps, Yahoo launched a <strong>web</strong>page devoted exclusively to Yahoo and "mixed" API<br />

applications.<br />

Yahoo Search Application Gallery<br />

http://developer.yahoo.netlsearch/applications.html<br />

I recommend you pay special attention to <strong>the</strong> following applications that use Yahoo<br />

APls, although you may find o<strong>the</strong>rs even more useful to you:<br />

Link Harvester<br />

http://www.linkhounds.com/link-harvester/<br />

This is a very powerful-but very slow-tool for examining links to a domain or a<br />

specific urI. The example below shows <strong>the</strong> links to [www.mfa.gov.cn]. Link Harvester<br />

does <strong>the</strong> following:<br />

• quickly finds almost every single site linking into a domain or page.<br />

• scrapes past <strong>the</strong> 1,000 search result limit by making domain filtering a<br />

snap.<br />

~ grabs number of pages indexed.<br />

• grabs links to any page.<br />

• grabs total inbound links, home page links, and deep link ratio.<br />

• tool is fast and free. which is great considering all it does.<br />

• grabs C block IP address information.<br />

• tool provides links to Wayback Machine and Whois Source (now Domain<br />

Tools) next to each domain.<br />

• free & open source<br />

UNCLASSIFIEDNFOR OFFIOIAL USE ONLY 113

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!