02.11.2014 Views

untangling_the_web

untangling_the_web

untangling_the_web

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

DID: 4046925<br />

UNCLASSIFIEOMj;QR Qj;j;IGIAL l:JSE or.LY<br />

pages may be any number of things, including pages with robots.txt command or<br />

!.§.g. Unindexed pages are identifiable by what <strong>the</strong>y lack: no summary, no page size,<br />

and no cached copy.<br />

WI/WII. statvcuedufrobots.Dd<br />

§i!:r.'L§!J~~9!!~<br />

javangeli st.sni psnap orgfspacefSnipSnapfconfi gfrob...<br />

.~jLr.~U:§}:...r:..9..9.?.9.<br />

WI/WII.atmoswashi ngton .edufrobots .Dd<br />

~jJ.t.!.:.§.r .. E.~{t.~.~<br />

Google Orphans-no cached<br />

copy, no summary, no page size<br />

fichier indiquant auxrobots les endroltsinterdits # # voir http 'n - I T:~nsl"te this 03q8 ]<br />

fichier indiquant aux robots les endroits interdits # # voir<br />

http://info. <strong>web</strong>crawler. com/mak/projectslrobots/norobots.html User-agent: • Disallow: n.<br />

www.ann.jussieu.fr/robots.txt - 1k - Ci

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!