16.01.2013 Views

Microsoft Sharepoint Products and Technologies Resource Kit eBook

Microsoft Sharepoint Products and Technologies Resource Kit eBook

Microsoft Sharepoint Products and Technologies Resource Kit eBook

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 22: Managing External Content in <strong>Microsoft</strong> Office SharePoint Portal Server 2003 615<br />

2. Perform one of the following steps to confirm or specify updates <strong>and</strong> rules for<br />

Web content as a content source:<br />

■ Click OK.<br />

■ Click Advanced. Specify rules to include or exclude content, specify<br />

scheduled updates, or start an update on the content_source_type content<br />

source page.<br />

Managing Rules for Including or Excluding Content<br />

You can create rules that include or exclude content from the content index. These<br />

rules are called site restrictions <strong>and</strong> site path rules. A site restriction rule is the main<br />

rule for a site. You can show or hide the other rules for a site by clicking the plus<br />

sign (+) or minus sign (-) next to the site restriction. The other rules for a site are<br />

called site path rules. The site restriction defines the overall rules for a site, <strong>and</strong> the<br />

site path rules are rules for specific parts of the site. The Site Path rules are nested<br />

inside the Site Restrictions rule.<br />

Site Path Rules are evaluated in the order they appear in the list. If there is a<br />

site rule match, all the path rules are evaluated <strong>and</strong> applied in the order they appear.<br />

You can use site restrictions <strong>and</strong> site path rules to accomplish the following<br />

tasks:<br />

■ Override the settings for the default content access account when crawling a<br />

specific site or path<br />

■ Specify the granularity for crawling lists<br />

■ Allow crawling of sites where addresses pass parameters—for example, the<br />

address includes a question mark (?)<br />

■ Allow sites to be traversed for links without content being added to the index<br />

■ Exclude an area from the index completely<br />

Including or excluding content is a best practice for tuning SharePoint Portal<br />

Server search capabilities, <strong>and</strong> it’s a good tool to offer the best possible search<br />

results that meet your business requirements.<br />

Rules can use general expressions <strong>and</strong> wild cards, as shown in the following<br />

examples:<br />

■ “http://woodgrovebank/folder*” applies to all Web resources that have a URL<br />

that starts with “http://woodgrovebank/folder”<br />

■ “http://server?web*” applies to resources such as “http://serveraweb2/file.htm”<br />

<strong>and</strong> “http://serverbweb3/file.htm”<br />

■ “*/*.doc” applies to every <strong>Microsoft</strong> Office Word document encountered

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!