13.07.2015 Views

Microsoft SharePoint. Building Office 2007 Solutions in VB 2005 ...

Microsoft SharePoint. Building Office 2007 Solutions in VB 2005 ...

Microsoft SharePoint. Building Office 2007 Solutions in VB 2005 ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

106CHAPTER 4 ■ SHAREPOINT SHARED SERVICESthe account must have access to all of the content to be <strong>in</strong>dexed. If you set up the developmentenvironment <strong>in</strong> Chapter 2, you should use the DOMAIN\SPCrawlAcct account youcreated dur<strong>in</strong>g the MOSS <strong>in</strong>stallation.After the content access account is set, you’ll want to def<strong>in</strong>e the content sources to be<strong>in</strong>dexed. By default, MOSS creates a content source for <strong>SharePo<strong>in</strong>t</strong> sites. If you only <strong>in</strong>tend tosearch <strong>SharePo<strong>in</strong>t</strong> content, this source may be all you need. However, you can def<strong>in</strong>e othercontent sources <strong>in</strong>clud<strong>in</strong>g web sites, file shares, Exchange public folders, and bus<strong>in</strong>ess datafrom the BDC.Along with the content source, you can also def<strong>in</strong>e crawl rules. Crawl rules allow you tospecify what content is <strong>in</strong>cluded or excluded from a source. You can also set a special accountto use when crawl<strong>in</strong>g the source if the default account does not have access for some reason.If the content you crawl has a different address than you want to appear <strong>in</strong> the searchresults, you can create a server name mapp<strong>in</strong>g. This is useful if, for example, you crawl contentus<strong>in</strong>g an address <strong>in</strong>side the firewall but want to make the results available outside the firewall.Simply enter the address of the crawled content and the mapped address for the search results.After you have configured the content sources, you need to specify a schedule for thecrawl. You won’t get any results back from the Search Service until a full crawl of the contentsources has been completed. Because a full crawl is resource <strong>in</strong>tensive, you should try toschedule it for off-hours.Follow these steps to set up a crawl schedule:1. From the Configure Search sett<strong>in</strong>gs page, click Content Sources and Crawl Schedules.2. On the Manage Content Sources page, hover over the Local <strong>Office</strong> <strong>SharePo<strong>in</strong>t</strong> ServerSites content source and select Edit from the drop-down menu.3. On the Edit Content Source page, click the Create Schedule under the Full CrawlSchedule list.4. In the Manage Schedules dialog, accept the default sett<strong>in</strong>gs by simply click<strong>in</strong>g the OKbutton.5. Check the box labeled Start Full Crawl of This Content Source.6. Click the OK button.Includ<strong>in</strong>g File TypesWhen the Search Service creates a content <strong>in</strong>dex, it does not <strong>in</strong>clude all of the file types itencounters. This is because many file types might not make any sense <strong>in</strong> the search results.For example, EXE files are not <strong>in</strong>cluded <strong>in</strong> the content <strong>in</strong>dex because they are not documentsand might even conta<strong>in</strong> a virus. While the out-of-the-box sett<strong>in</strong>gs <strong>in</strong>dex most of the commonlyused file types, not every file type you may want is <strong>in</strong>cluded. In particular, most organizationswant to <strong>in</strong>clude Adobe Acrobat portable document format (PDF) files <strong>in</strong> the searchresults, but they are not <strong>in</strong>cluded by default.In order to <strong>in</strong>clude other file types, you must first <strong>in</strong>stall the appropriate IFilter. IFiltersare used to build <strong>in</strong>dexes for specific document formats. Each file type that you want to<strong>in</strong>clude must have an <strong>in</strong>stalled IFilter before it can be part of the <strong>in</strong>dex. After you <strong>in</strong>stall theIFilter, you must then provide an image to represent the file with<strong>in</strong> <strong>SharePo<strong>in</strong>t</strong> and specificallytell the <strong>in</strong>dex to <strong>in</strong>clude the file type.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!