16.01.2013 Views

Microsoft Sharepoint Products and Technologies Resource Kit eBook

Microsoft Sharepoint Products and Technologies Resource Kit eBook

Microsoft Sharepoint Products and Technologies Resource Kit eBook

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 22: Managing External Content in <strong>Microsoft</strong> Office SharePoint Portal Server 2003 617<br />

click Allow alerts on individual SharePoint list items, alerts for the list<br />

will be sent if any item in the list is changed.<br />

■ To crawl each SharePoint list item individually, click Index SharePoint<br />

list items individually.<br />

5. In the Specify Authentication section, do one of the following:<br />

■ To use the default content access account, click Use default crawling<br />

account to log in. If the default crawling account cannot access this<br />

path, enter a user name <strong>and</strong> password combination that can crawl the<br />

content source.<br />

■ To prevent Basic authentication from being used, select the Prohibit the<br />

server from passing plain text passwords check box. The server will<br />

attempt to use NTLM authentication.<br />

6. Finally, to use a client certificate for authentication, click Specify client certificate,<br />

<strong>and</strong> then select a certificate from the list.<br />

Working with Content Indexes<br />

A content index is a full-text index of content stored on a SharePoint Portal Server<br />

2003 portal site. These content indexes do not include the indexes from the<br />

<strong>Microsoft</strong> SQL Server Full Text Index engine that can be used in each Windows<br />

SharePoint Services site. The content indexes are populated by the Indexing Service<br />

as it receives crawled content from the Gatherer Service.<br />

Content indexes do not have a hard-coded limit. Successful testing has reached<br />

5 million documents indexed to a single index file.<br />

Creating a Content Index<br />

The reasons for creating additional index files are a bit complex. You’ll need to bear<br />

in mind a couple of competing issues when you decide to create a new index file.<br />

First, bear in mind that if a user’s search spans multiple index files, this will<br />

increase the load on your server when the query is executed <strong>and</strong> it will also take<br />

more time to search each index file <strong>and</strong> then compile a single result set.<br />

Second, ranking is performed within each index file but is not performed again<br />

in the aggregate result set. So, if a search query pulls records from multiple index<br />

files, there will be no overall ranking of the result set. Therefore, the best practice<br />

here is to try, if you can, to limit the number of index files that you create. However,<br />

if you find yourself creating a plethora of index files, group the content sources<br />

together in such a manner so as to limit the number of times most search queries<br />

will need to traverse multiple index files.<br />

Third, if an index file should become corrupt (which can occur if your server<br />

suddenly loses power or if there is a read/write failure in your hardware), bear in

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!