none
Attachment size crawl issue

    Question

  • Guys,

    Is there any limitation with the attachment size for search crawling. We are facing a problem in crawling a pdf attachment of more than 11MB size. 

    Nitin

    Monday, November 17, 2014 8:48 AM

Answers

All replies

  • Here is how you can increase it

    http://www.toddklindt.com/blog/Lists/Posts/Post.aspx?ID=215

    $s = Get-SPEnterpriseSearchServiceApplication

    $s.GetProperty("MaxDownloadSize")

    $s.SetProperty("MaxDownloadSize",25)

    $s.Update()

    Restart-Service osearch14

      


    If this helped you resolve your issue, please mark it Answered. You can reach me through http://itfreesupport.com/

    Monday, November 17, 2014 8:55 AM
  • Inderjeet,

    Its already increased but no effect.

    Nitin

    Monday, November 17, 2014 10:24 AM
  • I believe there are 3 separate limits/boundaries:

    MaxDownloadSize - crawler limit, how large a document can be for crawler to grab it.

    Document Parsing limit/# Of Characters limit - a limit as document is moving through internal Content Processing.  Currently not changeable and it's somewhat unclear how many Bytes those 2000000 characters translate to.  Not documented.  Corresponding crawl log message like: "Document 'http://.../xyz.pdf' was partially processed. Document produces an output text that exceeds the maximum limit (2000000 characters).

    Stored in Index limit - controlled by MP.MaxCharactersInPropertyStoreIndex, now can be upped to 2MB.  Corresponding crawl log message like: "The item has been truncated in the index because it exceeds the maximum size"

    http://technet.microsoft.com/en-us/library/cc262787%28v=office.15%29.aspx#Search

    Reference: 

     - Document size crawl component can download

     - Parsed Content Size 

    As of April 2013 CU, you can up the MP index storage limit to 2MB per Managed Property via MP.MaxCharactersInPropertyStoreIndex:

    http://www.dotnetmafia.com/blogs/dotnettipoftheday/archive/2013/06/21/increasing-the-maximum-number-of-characters-indexed-by-search-in-sharepoint-2013.aspx

    Keep in mind that upping MP.MaxCharactersInPropertyStoreIndex to 2MB doesn't mean you'll always avoid the 2000000 characters limitation.

    Wednesday, November 26, 2014 6:09 PM