Answered by:
Indexing Of Files

Question
-
I am in a scenario where I will eventually have 500 million documents to search.
1) I believe Windows Search, used with the SDK on Windows Server 2008 is nowhere near reliable or scalable enough. Please verify.
2) If I use Sharepoint, do the files on the server all have to be checked into Sharepoint, or can it index the files on the server, on shared UNC paths, etc? Or do the files have to be checked into Sharepoint, and are they stored in sql server. I need a .net programming interface to give me a list of files.
Tuesday, December 27, 2011 5:04 PM
Answers
-
I am not sure if windows search is reliable or not in your scenario. But I can give details about your second question.
The files need NOT be checked in or uploaded to any SharePoint site. You can crawl a shared path, by creating a content source for that.
After indexing is done, you can use the OOB webparts to show results in a SharePoint site. Or use the Search API to get the results and show them on any interface you need.
Ram Prasad Meenavalli | MCTS SharePoint 2010 | MCPD SharePoint 2010- Edited by Ram Prasad Meenavalli Tuesday, December 27, 2011 5:26 PM
- Marked as answer by Derek Michael Hart Wednesday, December 28, 2011 1:17 AM
Tuesday, December 27, 2011 5:26 PM
All replies
-
I am not sure if windows search is reliable or not in your scenario. But I can give details about your second question.
The files need NOT be checked in or uploaded to any SharePoint site. You can crawl a shared path, by creating a content source for that.
After indexing is done, you can use the OOB webparts to show results in a SharePoint site. Or use the Search API to get the results and show them on any interface you need.
Ram Prasad Meenavalli | MCTS SharePoint 2010 | MCPD SharePoint 2010- Edited by Ram Prasad Meenavalli Tuesday, December 27, 2011 5:26 PM
- Marked as answer by Derek Michael Hart Wednesday, December 28, 2011 1:17 AM
Tuesday, December 27, 2011 5:26 PM -
A search scope can be created to index a fileshare, External web site, and/or external data source.
The following link is very light on technical content, but will get your head into the game:
http://office.microsoft.com/en-us/sharepoint-server-help/define-scopes-for-searches-HA010241119.aspx
Steve Clark, MCTS | Twin-Soft.comTuesday, December 27, 2011 5:30 PM -
You ask about "SharePoint". As this is a pre-SharePoint 2010 forum that means you are asking about one of the 2007 (or earlier) SharePoint products.
In the 2007 range there were two main SharePoint products WSS 3.0 and MOSS 2007. The answers you have received so far about question 2 (clearly q1 about Windows Search is off-topic here) only apply to MOSS 2007 as WSS 3.0 does not index anything else than files stored in itself.
However the main point that seems to have been ignored so far is whether SharePoint is suitable for searching 500 Million documents even if it could. I would instead look at the FAST products for that - note that there are specific forums for the different versions of them, just as there are different forums for SharePoint 2010 if you want to ask about the current SharePoint range.
SP 2010 "FAQ" (mainly useful links): http://wssv4faq.mindsharp.com/default.aspx
WSS3/MOSS FAQ (FAQ and Links) http://wssv3faq.mindsharp.com/default.aspx
Both also have links to extensive book lists and to (free) on-line chapters- Edited by Mike Walsh FIN Tuesday, December 27, 2011 6:15 PM a for the
- Proposed as answer by JD Wade, MCTS, MCITP Tuesday, December 27, 2011 7:28 PM
- Unproposed as answer by Derek Michael Hart Wednesday, December 28, 2011 1:17 AM
Tuesday, December 27, 2011 6:14 PM -
Steve,
Would it be possible to consult with you for a couple hours?
Derek Hart
415-747-8925Thursday, December 29, 2011 11:19 PM