none
Fast Search Query SSA Crawl and Content SSA Crawl and index reset RRS feed

  • Question

  • Hello. I have a lot of questions on Fast Search process and components since we do not have an expert/knowledgeable person on Fast Search in house. I would like to thank you before hand because I am cramming a lot of cases under this one inquiry.

    1. Fast Search Query SSA. From my reading, this should be used for query or what termed 'people search'. The content source currently added to this Query SSA is as below.
    http://appserver:portnumber
    http://intranet.site.com
    http://mysite.site.com
    sps3://intranet.site.com
    Question: Should I remove all the other sources other than sps3? What should I do next? Index reset? Fast Search Server restart? Our farm is multiple servers with load balanced WFEs

    2. Fast Search Content SSA. Content sources are as listed below.

    http://intranet.site.com

    http://mysite.site.com

    Question: When we run full crawl on the content SSA it takes a very long time to finish. Does the result of the full crawl under Content SSA used as "source" in Query SSA crawl. Bear in mind, I do not have trainings in Fast Search so my question may sound very dumb.

    3. We have our fast search server specs at the bare minimum I would say. At times our full crawl for both Query SSA (schedule Saturday), and Content SSA (schedule Sunday) overlapsed each other and caused further delay because of resource limitation. Which one should I stop to allow the other to finish. Which is more important to finish?

    4. Recently we had to do index reset on Query SSA. However, I mistakenly did the index reset on Content SSA. I saw that the other steps if you did index reset on Content SSA is to remove manually content from the Content SSA crawl db. What is the impact if I did not follow through with the content removal from the crawl db?
    Or what should I do next? Remove content from crawl db. Restart server? And start full crawl? In which order.

    Wednesday, January 22, 2014 7:21 AM

All replies

  • Answer 1: Have only http://mysite.site.com under Query SSA. Since the Query SSA used Query Result processing, don't give much load on the Query SSA. Move rest of the sites to Content SSA. Don't perform Index Reset very often, unless it really required.

    Answer 2: Full Crawl will take time, it depends on the Size of the content. So Increase the crawl component to speed-up the process.

    Answer 3: You have to decide based on the No.of Documents in each Content Source. i would suggest to run the smallest content sources on Sunday Night. Because it will take less time and Monday it will have fresh data to offer better search.

    Answer 4: FAST will store all the Indexed documents in the File System. All the crawled documents information only will be available in the crawl db. but the indexed documents will be available only in File System where Indexer is running. you can run Full crawl after the Index reset.

    Thursday, January 23, 2014 7:05 AM
  • Asir,

    Thank you for your feedback. I have further questions. I hope you can bear with my questions since I just started to delve into FS for SP.

    1. I read that for Query SSA you should only have sps3 in the start addresses. And you feedback is saying only put in mysite. Can you confirm only start address mysite needed to be set at Query SSA and remove the sps3

    2. From my previous question; does the result of the full crawl under Content SSA is used as "source" in Query SSA crawl?

    From your feedback; increase the crawl component. May I know where? Or do you mean create another Content SSA with less number of start address

    4. From my previous question; I mistakenly did the index reset on Content SSA. I saw that the other steps if you did index reset on Content SSA is to remove manually content from the Content SSA crawl db. However, I did not follow through with the content removal from the crawl db? What should I do next? Remove content from crawl db. And then start full crawl?

    Thursday, January 23, 2014 7:48 AM
  • Answer 1: Have ONLY the People search in Query SSA. Move all other to Content SSA.

    Ref: http://social.technet.microsoft.com/Forums/en-US/bf47c0b6-1b7e-4ad2-9010-2bd3ee2fe6c3/fast-search-query-content-source-vs-fast-content-ssa-content-source?forum=fastsharepoint

    Answer 2: Content SSA will crawl the content and send it to FAST server mentioned in the Configuration. and Query SSA will fetch the result from the FAST server mentioned in the Configuraiton.

    So if you are pointing the Content SSA and Query SSA to same FAST Farm, you can access the Content SSA's crawled content in Query SSA.

    Friday, January 24, 2014 11:34 AM