Thursday, July 31, 2008

Maximum File Size for Crawling

ERROR:
The file reached the maximum download limit. Check that the full text of the document can be meaningfully crawled

Reason:
By default, Sharepoint Search Services can crawl and filter a file with a size of up to 16 megabytes (MB). It will always crawl the first 16MB of a file. After this limit is reached, SharePoint Portal Server enters a warning in the gatherer log “The file reached the maximum download limit. Check that the full text of the document can be meaningfully crawled.”

Resolution:
Increase the default limit from 16 MB, you must add in the registry new entry MaxDownloadSize.

Steps:
1. Start Registry Editor (Regedit.exe).

2. Locate the following key in the registry:

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\12.0\Search\Global\Gathering Manager

3. Open Edit - New - DWORD Value. Name it MaxDownloadSize.

4. Double-click, change the value to Decimal, and type the maximum size (in MB) for files that the gatherer downloads.

5. Restart the server.

6. Start Full Crawl.

0 comments: