Error Fetching URL when Indexing PDF Files (188957)
The information in this article applies to:
- Microsoft Search, when used with:
- Microsoft Site Server 3.0
This article was previously published under Q188957
This problem occurs when you use Microsoft Site Server version 3.0 and the
Adobe PDF IFilter version 1.1 (beta) and prior.
The third-party products discussed here are manufactured by vendors
independent of Microsoft; we make no warranty, implied or otherwise,
regarding these products' performance or reliability.
REFERENCES
Microsoft Site Server Search documentation.
SYMPTOMS
When you are indexing .pdf files with Site Server Search, the gatherer log
may report the following error message for one or more files:
Error fetching URL.
CAUSE
The IFilter provided by Adobe is limited to access by a single thread. By
default, Site Server Search requests and processes multiple documents
simultaneously. This may result in errors if the Portable Document Format
file (PDF) filter installed is version 1.1 beta or earlier.
WORKAROUND
To work around this problem, Site Server Search can be configured to
request a single document at a time. This slows down the indexing of a site
but has the desired effect of limiting access to the IFilter to a single
thread at a time. You can accomplish this as follows:
- Create a virtual directory and place your PDF files in it. Be sure to
allow directory browsing on this directory.
- In the properties for the Catalog Builder, select the Timing tab and
limit the site that contains the virtual directory to one document at a
time.
- Create a Search project that will do a Web crawl of the PDF virtual
directory. Set the crawl to one (1) page hop allowed.
- Build the catalog.
Modification Type: | Major | Last Reviewed: | 7/15/1999 |
---|
Keywords: | kbprb KB188957 |
---|
|