none
FAST/SharePoint 2010 can't crawl a Sp2007 web site (with additional authentication) RRS feed

  • Question

  • Hi,

    We have a problem we can't for the life of us figure out how to solve. We want to crawl a SharePoint 2007 site as SharePoint site (to get relevant metadata). We are successfully crawling this source as a web page, but then we only get the boring 'Web page' as a content type, and similiar low-valued meta. The web page is public, but in order to log in, we have to go through another system for authentication. We have a username and password and an account with full read permissions on the server, which we can log in with manually.  We have tried different crawl rules with/without form credentials, and with/without custom user authentication. No success.

    We get the following error when trying to crawl this content source:

    The start address http://blablabla.com cannot be crawled.

    Context: Application 'FastContent', Catalog 'Portal_Content'

    Details:

    Access is denied. Verify that either the Default Content Access Account has access to this repository, or add a crawl rule to crawl this repository. If the repository being crawled is a SharePoint repository, verify that the account you are using has "Full Read" permissions on the SharePoint Web Application being crawled. (0x80041205)

    We have googled A LOT for this error, and it shows up over several places. We have tried "all" the suggested fixes, most of which are mentioned in a relevant problem-thread here: http://www.go4answers.com/Example/sharepoint-2010-crawling-external-33489.aspx, including:

    • Ignoring robots in the FAST xml-file and loading this file
    • Disable strict name checking (in the registry)
    • Disable loopback check (in the registry

    Probably other fixes as well I can't recall now. How can we resolve this issue? Is this a bug?

    Thursday, December 2, 2010 12:50 PM

All replies

  • In order for this to work you have to do one of two things:

    1. On the MOSS 2007 side create a web application policy (done in Central Admin) for your SharePoint 2010 content access account. Give it full read access to your MOSS 2007 web applications.

    or

    2. Create a crawl rule on SharePoint 2010 and tell it to use the content access account you used to crawl MOSS 2007

    Hope this helps.

    Regards,
    Mikael Svenson


    Search Enthusiast - MCTS SharePoint/WCF4/ASP.Net4
    http://techmikael.blogspot.com/ - http://www.comperiosearch.com/
    Thursday, December 2, 2010 8:29 PM
  • Yeah, we have done both those things with no luck.

    -T

    Friday, December 3, 2010 8:07 AM
  • Check the IIS logs on your SharePoint 2007 to see if anything gets logged when you launch a crawl from SP2010 (to see that at least the user requests get there).

    "in order to log in, we have to go through another system for authentication." -> ? a web app zone that uses windows authentication ?

    The account you use to crawl 2007 content, is it a domain account?


    Florin DUCA
    MCSE 2003 +Sec,MCTS conf/dev WSS3/MOSS, MCITP/MCPD SP 2010, MCPD ASP.Net 3.5, MCTS ISA 2006
    Logica Business Consulting, France
    Friday, December 3, 2010 8:51 AM
  • The other system is a custom made system with email/password-login(that's all I know about it), not Windows authentication.

    The account is a domain account, yes.

    There were no relevant information in the SharePoint log. In the security event log, this showed up when trying to connect. Still, we get the same error on our Sharepoint2010-server:

    Event Type: Success Audit
    Event Source: Security
    Event Category: Logon/Logoff
    Event ID: 538
    Date:  07.12.2010
    Time:  08:51:46
    User:  EKSTERN\Fast-svc
    Computer: SRVR-E022
    Description:
    User Logoff:
      User Name: Fast-svc
      Domain:  EKSTERN
      Logon ID:  (0x0,0x150A3420)
      Logon Type: 7


    For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
    ****************************************************************************
    Event Type: Success Audit
    Event Source: Security
    Event Category: Logon/Logoff
    Event ID: 576
    Date:  07.12.2010
    Time:  08:51:46
    User:  EKSTERN\Fast-svc
    Computer: SRVR-E022
    Description:
    Special privileges assigned to new logon:
      User Name: Fast-svc
      Domain:  EKSTERN
      Logon ID:  (0x0,0x150A3420)
      Privileges: SeSecurityPrivilege
       SeBackupPrivilege
       SeRestorePrivilege
       SeTakeOwnershipPrivilege
       SeDebugPrivilege
       SeSystemEnvironmentPrivilege
       SeLoadDriverPrivilege
       SeImpersonatePrivilege

    For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
    ****************************************************************************
    Event Type: Success Audit
    Event Source: Security
    Event Category: Logon/Logoff
    Event ID: 528
    Date:  07.12.2010
    Time:  08:51:46
    User:  EKSTERN\Fast-svc
    Computer: SRVR-E022
    Description:
    Successful Logon:
      User Name: Fast-svc
      Domain:  EKSTERN
      Logon ID:  (0x0,0x150A3420)
      Logon Type: 7
      Logon Process: User32 
      Authentication Package: Negotiate
      Workstation Name: SRVR-E022
      Logon GUID: {fc17ca13-38a7-b281-8a05-c13bc93936ec}
      Caller User Name: SRVR-E022$
      Caller Domain: EKSTERN
      Caller Logon ID: (0x0,0x3E7)
      Caller Process ID: 2844
      Transited Services: -
      Source Network Address: XX.XX.XX.134
      Source Port: 65033


    For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.

    Tuesday, December 7, 2010 8:44 AM