locked
.pdf iFilter does not appear to be crawling .pdf file content RRS feed

  • Question

  • I have installed the .pdf iFilter according to the instructions found http:www.sharepointsharon.com/2010/03/sharepoint-2010-and-adobe-pdf and in the book entitled "Pro SharePoint 2010 Search", and on completion I have found that

    1. No icons are displayed in my search (none not even Word docs)
    2. The content of pdf files does not appear to have been crawled.

    Anyone have any ideas?

    Thanks.

     

    Tuesday, October 11, 2011 9:36 AM

Answers

All replies

  • Have you checked the crawl schedules, to make sure that one ran?
    Steven Andrews | SharePoint Professional | http://www.twitter.com/backpackerd00d | https://baron72.wordpress.com/
    Tuesday, October 11, 2011 9:39 AM
    Answerer
  • I have the same problem :(

    The pdf icon appear in the list where are the pdf document but when I search a pdf document, in the search result the icon doesn't appear and also I cannot make searchings using words or tittles which appears in the pdf, it cannot find anything.


    Maybe, the problem comes from the step 6:

    6.- Perform a full crawl of your index. Note: An incremental crawl is not sufficient when you have added a new file type. SharePoint only indexes file names with the extensions listed under File Types and ignores everything else. When you add a new file type, you then have to perform a full crawl to forcibly identify all files with the now relevant file extension.

    I don't get it...how can I do that??How did you do that, TomyJones2??

    Thank you :)

    Tuesday, October 11, 2011 11:23 AM
  • Thanks,

    Yes, I have performed several Full crawls since installing the .pdf iFilter by going to the Search Service Application page and selecting "Content Sources" under Crawling in the Quick Menu and selecting the "Local SharePoint sites" link and from within that page at the bottom selecting the "Select full crawl of this content source" and clicking OK.

    Does anyone know the whereabouts of any log file so that I can investigate further?

    Many Thanks.

     

    Tuesday, October 11, 2011 12:55 PM
  • Logs are kept in the Hive/14/Logs directory.  Each log will have the days date in the title so you'll be able to check.  I'll see if I can replicate these issues on my test farm and will advise accordingly.
    Steven Andrews | SharePoint Professional | http://www.twitter.com/backpackerd00d | https://baron72.wordpress.com/
    Tuesday, October 11, 2011 1:12 PM
    Answerer
    • Proposed as answer by pitregallego Wednesday, October 12, 2011 7:40 AM
    • Marked as answer by TonyJones2 Thursday, October 13, 2011 8:25 AM
    Tuesday, October 11, 2011 1:51 PM
  • Following 1 above I noticed that had not set up the registry completely I was missing:

    \\HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\14.0\Search\Setup\Filters

    •  Right-click the Filters  folder and select New Key. Enter “.pdf” for the key value.
    • Add the following values to this key:

    <REG_SZ> Default = <value not set>
    <REG_SZ> Extension = pdf
    <REG_DWORD> FileTypeBucket = 1
    <REG_SZ> MimeTypes = application/pdf

    Now it appears to be working, well it is picking up some of the content of the pdf files. No doubt there will be a little more research on the ability of the Adobe pdf iFilter, but that a different question.

    Many Thanks.

    Thursday, October 13, 2011 8:35 AM