Crawl content of RTF file in file share not working
-
Wednesday, June 20, 2012 2:52 PM
Environment
- Moss 2007 Enterprise SP2, Oct 2010 CUM
- Win server 2003 R2 enterprise SP2
Configuration on server
- added rtffilt.dll (from Microsoft download site) to windows\system32 directory and successfully registered
- changed HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Shared Tools\Web Server Extensions\12.0\Search\Setup\ContentIndexCommon\Filters\Extension\.rtf to the correct DLL CLSID: {e2403e98-663b-4df6-b234-687789db8560}
- HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Shared Tools\Web Server Extensions\12.0\Search\Applications\<guid>\Gather\Search\Extensions\ExtensionList add string with next number in sequence with value of rtf
- added rtf to search file types in ssp admin
- reboot server
- load rtf files that have at least one unique word in content, one in doc library, one in file share
- full crawl library and file share
Results
- sharepoint search on title returns docs from both file share and library
- windows search on unique word works in windows explorer
- sharepoint search on unique word returns doc in library, but not doc in file share
Am I missing something?
Thanks
Fox4
All Replies
-
Thursday, June 21, 2012 8:55 AM
Further to the above
Results
- sharepoint search on unique word returns doc in library, but not doc in file share
I set up Advanced Search with two scopes
- All Sites (#1, default)
- test (#2 - scoped to the file share)
sharepoint search on title with both scopes selected returns docs from both file share and library
Search unique word with All Sites selected returns rtf file from Document library.
Search unique word with test scope selected returns nothing.
Search unique word with both scopes selected returns nothing.
I see no obvious difference comparing the SharePoint logs for the All Sites query and the query using both scopes.
Why does the query on both scopes not return the rtf file from the Document library?
Am I doing something wrong, or is search really that difficult?
Fox4
- Edited by Fox4 Thursday, June 21, 2012 8:58 AM typo
-
Thursday, June 21, 2012 9:52 AM
I do not know if this becomes a separate issue or if it is related to the above. I ran the scope return test again using a docx file instead of an rtf file to see if the multiple scopes return problem was in the scopes or in the file type.
Same setup as above, but with docx file. Core results web part set to allow duplicate results.
- sharepoint search on title with both scopes selected returns docx from both file share and library
- Search unique word with All Sites selected returns docx from both file share and library
- Search unique word with test scope selected returns docx from file share. - so far so good...
- Search unique word with both scopes selected returns docx from file share only???
Between the inability to crawl rtf content in a file share, and inconsistent behaviour on results, I am losing confidence in search as a useful tool.
Fox4
-
Wednesday, July 18, 2012 7:04 AM
Greetings ,
Thank you for your post. I am able to reproduce the behavior. While crawling the rtf file from file share, the crawl log report message below
"The filtering process could not load the item. This is possibly caused by an unrecognized item format or item corruption" .
We have confirmed that the behavior you have reported is reproducible at our end and requires in depth troubleshooting. This has been reported however
If you wish to pursue this, please open up a Support Incident with Microsoft for continued troubleshooting on this issue. Alternatively, you could visit the below link to see the various support options that are available to better meet your needs: http://support.microsoft.com/default.aspx?id=fh;en-us;offerprophone;
If you are a MSDN / TechNet subscriber, you can also contact our support by using your free support incidents.Thanks!!
Regards,
Manas Biswas
Microsoft Online Community Support
Please remember to click 'Mark as Answer' on the post that helps you or click 'Unmark as Answer' if a marked post does not actually answer your question. This can be beneficial to other community members reading the thread. -
Monday, July 23, 2012 7:48 AM
Hi Manas
Thanks for your reply. I do not get any crawl errors, the file share simply does not appear in results.
I will mark your reply as the answer and write this off to another case where SharePoint gets you to about 80% of a useful solution.
Fox4
-
Wednesday, August 29, 2012 7:57 AM
Update 29 Aug 2012. I used one of my support calls with Microsoft.
I have narrowed the issue with searching a file share to the following
- file converted from plain text to .doc using a pre 2007 version of Word - content not crawled
- file with .rtf extension - content not crawled
Microsoft support has reproduced my findings and is investigating. I will update with their results.
A work around is to resave the file as .doc or .docx using Word 2007-2010. There is a tool available to do bulk conversions - http://technet.microsoft.com/en-us/library/cc179019.aspx
Fox4

