none
O365 PDF Search RRS feed

  • Question

  • I have a large set of PDF documents that are not already text searchable. Basically images of text documents - think contracts. 

    Is there a recommendation that can be made for these bulk PDF's so they could be converted to: 

    • Have proper naming conventions (Seems sharepoint libraries block certain special characters in the names) 
    • Index the PDF so that once they are uploaded they can be searched by keywords
    Friday, July 29, 2016 3:32 AM

Answers

  • Add metadata (fields) to the document library to add the keywords. Use the quick edit mode (think spreadsheet view) to set the metadata for the files. Or automate setting metadata to the files using PowerShell or C#. Once you set the metadata initiate the crawl (take help from SharePoint admin). When uploading new files make sure to fill out the metadata for the new file. This makes these files more search friendly.

    ---
    Rajesh
    rjesh.com| @rjesh
    You don't need to buy me a beer, if helpful just smile, vote, and mark it as answer.

    Friday, July 29, 2016 4:20 AM
  • Hi,

    The file name restriction is by design in SharePoint. As a workaround, you can create a new column in library and set corresponding value for each file. And search the value of the column may be helpful to you search quickly.

    In SharePoint online, you are unable to do incremental crawl manually. You may need to wait 4-6 hours for incremental crawl completion so that you can search the pdf file after you upload a pdf file.

    Microsoft Guidance on Search Crawls in SharePoint Online for your reference:

    http://blogs.catapultsystems.com/eskaggs/archive/2015/05/20/microsoft-guidance-on-search-crawls-in-sharepoint-online/

    Thanks,

    Dean Wang


    TechNet Community Support
    Please remember to mark the replies as answers if they help, and unmark the answers if they provide no help. If you have feedback for TechNet Support, contact tnmff@microsoft.com.

    Monday, August 1, 2016 10:42 AM
    Moderator

All replies

  • Add metadata (fields) to the document library to add the keywords. Use the quick edit mode (think spreadsheet view) to set the metadata for the files. Or automate setting metadata to the files using PowerShell or C#. Once you set the metadata initiate the crawl (take help from SharePoint admin). When uploading new files make sure to fill out the metadata for the new file. This makes these files more search friendly.

    ---
    Rajesh
    rjesh.com| @rjesh
    You don't need to buy me a beer, if helpful just smile, vote, and mark it as answer.

    Friday, July 29, 2016 4:20 AM
  • Hi,

    The file name restriction is by design in SharePoint. As a workaround, you can create a new column in library and set corresponding value for each file. And search the value of the column may be helpful to you search quickly.

    In SharePoint online, you are unable to do incremental crawl manually. You may need to wait 4-6 hours for incremental crawl completion so that you can search the pdf file after you upload a pdf file.

    Microsoft Guidance on Search Crawls in SharePoint Online for your reference:

    http://blogs.catapultsystems.com/eskaggs/archive/2015/05/20/microsoft-guidance-on-search-crawls-in-sharepoint-online/

    Thanks,

    Dean Wang


    TechNet Community Support
    Please remember to mark the replies as answers if they help, and unmark the answers if they provide no help. If you have feedback for TechNet Support, contact tnmff@microsoft.com.

    Monday, August 1, 2016 10:42 AM
    Moderator