02.11.2014 Views

untangling_the_web

untangling_the_web

untangling_the_web

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

DID: 4046925<br />

UNCLASSIFIEOHFOR OFFICIAL IcISE g~IL¥<br />

directories, Whois databases, NIH PubMed, SEC Edgar, Amazon's "Search Inside<br />

<strong>the</strong> Book" feature, digital library collections. The Domain Name System/Service<br />

(DNS) itself is <strong>the</strong> largest distributed database ever created and freely accessible to<br />

any user via a simple query (NSLookup). Also, don't overlook mailing lists,<br />

newsgroups, and o<strong>the</strong>r non-<strong>web</strong> segments of <strong>the</strong> Internet.<br />

Tip 11: Configure and Use Two Browsers<br />

If you spend much time on <strong>the</strong> Internet, especially viewing non-US sites, you will<br />

probably encounter certain <strong>web</strong>pages that will not display at all, will not display<br />

properly, and/or will not print in <strong>the</strong> browser you're using. However, if you open <strong>the</strong><br />

page in <strong>the</strong> o<strong>the</strong>r browser, it may be fine. So don't despair if a page isn't displaying<br />

or printing as it should. Chances are <strong>the</strong>re are simply problems with <strong>the</strong> way <strong>the</strong><br />

page was created and it will look fine in <strong>the</strong> o<strong>the</strong>r browser.<br />

Tip 12: Try URL Guessing<br />

It works more frequently than you would imagine. For example, I found <strong>the</strong> Iranian<br />

Ministry of Foreign Affairs by guessing www.mfa.gov.ir. And guess what <strong>the</strong> address<br />

for <strong>the</strong> Russian Ministry of Internal Affairs (MVD) is? Yes, it's www.mvd.ru.No<br />

search engine indexed ei<strong>the</strong>r of <strong>the</strong>se sites at <strong>the</strong> time I first found <strong>the</strong>m.<br />

Tip 13: Change URLs to Find "Hidden" Webpages<br />

Sometimes a simple change inside a long uri will disclose interesting pages deep<br />

within a <strong>web</strong>site. For example, look at <strong>the</strong>se two pages from <strong>the</strong> Federal Trade<br />

Commission:<br />

http://www.ftc.gov/opa/2006/02/<br />

http://www.ftc.gov/opa/2007102/<br />

Simply by changing <strong>the</strong> portion of <strong>the</strong> uri that indicates year and month, you can<br />

view <strong>the</strong> FTC News Releases for a specific date. This is a simple example of a<br />

technique that can be used to uncover "hidden" <strong>web</strong>pages. It's especially useful on<br />

sites that update on a regular schedule, e.g., sites for press releases or news.<br />

Tip 14: Be on <strong>the</strong> Lookout for URL Errors<br />

Not surprisingly, many uris listed on <strong>web</strong>pages are incorrect. Among <strong>the</strong> most<br />

common mistakes are misspellings, putting a backslash (\) where a slash (I) should<br />

be, including or excluding <strong>the</strong> L in HTML, e.g.:<br />

http://www.examlpe~com/pathl1ame\bigmistake.html<br />

428 UNCLASSIFIEOHFOR OFFlCI,tcL USE ONLY

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!