02.11.2014 Views

untangling_the_web

untangling_the_web

untangling_the_web

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

DID: 4046925<br />

UNCLASSIFIEDNFOR OFFlOhlcL ~SE ONLY<br />

tags in <strong>the</strong> results list. Gigablast claims to be indexing all "generic" meta tags.<br />

In addition, it can display <strong>the</strong> meta tags in <strong>the</strong> results list. Doing this requires<br />

adding commands to <strong>the</strong> URL of <strong>the</strong> results list. At <strong>the</strong> end of <strong>the</strong> uri, add a<br />

&dt= followed by <strong>the</strong> word(s) for <strong>the</strong> meta tags, followed by a colon, and <strong>the</strong>n a<br />

number to represent how many characters from each meta tag should be<br />

displayed. So, for example, adding &dt=keywords+author+generator+description:30<br />

will display <strong>the</strong> meta tag content for meta keywords, meta author, meta<br />

generator, and meta description tags for any records retrieved. Use a +<br />

between meta tag words. It seems that this "generic" meta tag approach<br />

excludes more complex meta tags like Dublin Core, which use a syntax like<br />

DC.Creator. The dot syntax will not work for <strong>the</strong> display command, although<br />

Gigablast does index some of <strong>the</strong> content of <strong>the</strong>se tags."S8<br />

Sample Output of Meta Tag Search<br />

Reload IG http://www.gigablast.com/search'klZ=134827&q=dublin+core&dt=kc: I ~<br />

add string to <strong>the</strong> end of<br />

resulting uri in address<br />

DC-dot<br />

..DC-dot now conforms with <strong>the</strong> Expressing Dublin Core in HTMUXHTML meta and I<br />

..Now you can click on <strong>the</strong> DC-dot button, wherever you are, to create [lublin COle m<br />

about. ..This sel'lice will retrieve a Web page and automatically generate [lublinCOI!r-----------------'<br />

metadata, ei<strong>the</strong>r as ..<br />

D~"cnt:'ti-Jr,. Give DC-dot a URL and see <strong>the</strong> Dublin Core it generates.<br />

keywol ,Is: DUblin Core; DC; generator; editor; VVar,'/ickFramework; SOIF; TEl; USMARC; XML; OILS; ROADS; RDF; IMS<br />

genelatnr: HTML Tidy, see IN,.'""v.w3 org<br />

;iegmv. Reference: Lihralles: I IhralY and Information Science: Technicai Selvices: CatalogulnQ: r,.'letadata· Dul1lin COle<br />

vWoNfl.lkoln.ac ufJmei"dataiLil'cloU - 8.8k -Iarchived copyl-~ - [older copies]- indexed: Oct 05 2005 - modified: Dec 11 2001<br />

Dublin Core Metadata Template<br />

..When <strong>the</strong> list of Qualifiers for Dublin Core elements Is finally decided upon, this template<br />

will. .. .You may include my name and email-address in a list of those using Dublin Core.<br />

Additional DC....Dublin Core Metadata Template.. This service is provided by <strong>the</strong> "Nordic<br />

Metadata Projsct' in..<br />

D8Se.r1r;tiOT from <strong>the</strong> Nordic Metadata Project<br />

Cal8qlj"l Reference' Libraries' Librari and Information Science: Technical Services: CataloguinQ: MetafJat,,: Dublin COle<br />

'NoNoNlub.lu.selcgi-bininmdc.pl- 40.5k- (archived copy] - [stripped]- [older copies]- indexed: Oct 05 2005<br />

Dublin CorefMARC/GILS Crosswalk<br />

..For conversion of MARC 21 into Dublin Core, many fields may be mapped into a single<br />

Dublin Core ... In <strong>the</strong> Dublin COle to MARC mapping, two mappings are provided,<br />

one for unqualified Dnblin COle... .The following is a crosswaik between <strong>the</strong> fifteen elements<br />

in <strong>the</strong> Dublin Core Element Set and MARC..<br />

Di'2,uif;t,or,. Librarj of Congress<br />

keywOl lis: MARC Dublin Core GILS crosswaik<br />

author: LiiJraly of Congress ['Ietwork: ueveropment ano MARC Standards Office<br />

,lescliIHioll: Crosswalk from Dublin Core<br />

(,aleg.Y'I. Reference: Libralles: LibralY and Information Science: TecYlllical Selvices: CataloquinQ: Metada!a: Crosswalks<br />

ic<strong>web</strong>iot'gov;malcidccross.lltml- 18.6k - ["cllived copy] - [strippedj- [oldel copiesl- indexed: Oct 062005 - modified: Dec 31 2002<br />

~ clearly displays date <strong>web</strong>page was indexed and, in some cases, modified<br />

~ search query spellchecker (Did you mean? option)<br />

58 Greg R. Notess, "Review of Gigablast," Searchengineshowdown, 17 September 2006,<br />

http://www.searchengineshowdown.com/features/qigablasUreview.html> (14 November 2006).<br />

UNCLASSIFIEDHFOR OFFICIAL tJSE ONLY 143

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!