get word count in a Document - Fast search server for sharepoint 2010 RRS feed

  • Question

  • My question to Fast Search Server 2010 experts is simple, is there a way to get keyword count for a document in the index.  If not does it make sense to extend the pipeline to do this.  we would like to display keyword count in the search results e.g sharepoint(20), fast(20). Please keep in mind I am talking document keywords not refinement count.  

    Best Regards,


    Will Appreciate any help anyone can give 

    Saturday, April 16, 2011 10:37 AM

All replies

  • Not sure what you mean by "keyword", but if it's a column containing words you want to check for in the body of the document, then you would have to do a custom stage and set the count as a separate crawled/mapped property.

    You would write a module taking in both the keywords and the extracted text (body), and count them with for example regular expressions for the matching. I have a blog post about how to debug and log in a pipeline stage for FAST for SharePoint with code which counts all words in the document with downloadable code which might help you get started.

    Mikael Svenson 

    Search Enthusiast - MCTS SharePoint/WCF4/ASP.Net4 -
    Saturday, April 16, 2011 6:39 PM
  • Hello Mikael,

    I would like to get the document body as a managed property, so that I can include that property in my search results.

    Could you suggest some solution?

    Monday, September 26, 2011 7:11 AM
  • Hi Felix,

    You could do something similar to my blog post on prototyping extensibility stages in PowerShell.

    You would copy the crawled property 11280615-f653-448f-8ed8-2915008789f2:body:31 over to a crawled property of your choosing which you map to a new managed property.

    This basically be a property copy, but needed as the "body" crawled property is not available outside of the pipeline as a normal crawled property.

    Mikael Svenson 

    Search Enthusiast - SharePoint MVP/WCF4/ASP.Net4
    Monday, September 26, 2011 7:57 AM