How can I create a local copy of a Sharepoint wiki RRS feed

  • Question

  • We've put an immense amount of information in our Sharepoint wiki.

    I'd like to have a local copy of the wiki pages on my hard drive so it's available when I'm offline. The links and editor don't need to work, I just need to be able to feed it to Windows Search so I can find the wiki material.

    I've tried a couple of offline browser products to see if I could havest the page, but so far the results have been mediocre. I also tried accessing the wiki list using Access 2007 to see if I could get at the files (documents) attached in the list. No luck.

    I'm a SP administrator, but I have no server access.

    Is there any way to get a local copy of the wiki?
    Tuesday, June 30, 2009 1:58 PM

All replies

  • Your best bet might be to come up with a process that can extract the rendered HTML versus the underlying data that is unformatted. 

    SharePoint Developer | Administrator | Evangelist -- Twitter -- Blog - http://nextconnect.blogspot.com
    Tuesday, June 30, 2009 2:21 PM
  • Which "offline browser products" have you tried?

    Did you try Colligio for instance ?
    WSS FAQ sites: http://wssv2faq.mindsharp.com and http://wssv3faq.mindsharp.com
    Total list of WSS 3.0 / MOSS 2007 Books (including foreign language) http://wssv3faq.mindsharp.com/Lists/v3%20WSS%20FAQ/V%20Books.aspx
    Tuesday, June 30, 2009 2:38 PM
  • I've tried Teleport Pro (couldn't authenticate) and WebCopier (couldn't get data properly).

    Do you have experience using Colligio for this purpose?

    The full list of the web robots I'm testing is here:

    http://tech.kateva.org/2009/07/offline-browsers-web-robot-options-to.html Update: The Colligo reference is very interesting. I'll definitely pursue it: http://www.colligo.com/go/webads/reader_G.asp?gclid=CJbOrK38t5sCFRHxDAodyQiFAg
    Thursday, July 2, 2009 7:00 PM
  • Mike, I just tested Colligo Reader. It's promising, but this is the area of functionality they struggle with.

    This is from my blog post: http://tech.kateva.org/2009/06/how-can-i-create-local-copy-of.html

    " ...The Colligo suite of products provides offline access to a lot of SP content. Unfortunately, they’re just starting to look at support for the wiki content. Their application architecture is not ideal for this purpose however.

    The Colligo application suite is designed to support offline editing and synchronization of SP content. That’s a very ambitious goal, and it’s easy to see why they need to use a proprietary database to store content and to hide the data from ‘backdoor’ manipulation in ‘…. Local Settings\Application Data\ColligoOfflineClient\Storage5…’.

    Sadly, this doesn’t fit that well with Microsoft’s oddball Wiki implementation. I’d like to be able to view the Wiki pages in IE, to have local links become ‘relative links’ (so they work against the local store), and expose the data to Windows Search. None of that works with the proprietary store. (I don’t care about offline editing as much as having an offline store, so the main value of the Colligo architecture doesn’t apply to me.)

    Lastly their HTML viewer is just a placeholder for future work, it’s  keyhole view of the Wiki data. Of course they’ll improve this, but the other challenges are tougher.

    There’s not a lot of alternatives however. I’d go for it if Colligo were to add an export feature that would create a static HTML view of the wiki data on demand..."

    Monday, July 6, 2009 4:02 PM
  • I've been doing the same thing for archiving purposes since my wiki lives on the corporate server and I want to have a personal copy of all the information I've collected in case I'm laid off.  I created a view that showed all the important content that I wanted to archive (the wiki content is a column).  From there I can export the whole wiki to a spreadsheet.  Pretty barebones, but it does the job...
    Thursday, August 20, 2009 9:31 PM
  • I'm sure this issue is closed for the original author - and this may not even help, but if you have access to the content database, the Wiki content is stored as raw HTML...
    select tp_DirName,tp_LeafName,ntext2 from AllUserData
    where tp_ContentType='Wiki Page'

    Will return the directory name, ASPX page that the Wiki is embeded into, and the HTML that makes up the Wiki...l

    If you have versioning enabled - you'll get every version of every page so you may have to filter for the latest version as well.
    • Edited by CarlSteffen Wednesday, September 30, 2009 7:14 PM added versioning comment
    Wednesday, September 30, 2009 5:54 PM