Search keeps trying to crawl the internet, when the server has no internet access
-
18. dubna 2012 21:21
We, like many of you, have a sharepoint server that has NO internet access. We have people who link to things like howtogeek.com or cnn.com, and when SharePoint crawls our site, it's trying to crawl those sites despite the fact that in the content source is set "<label for="ctl00_PlaceHolderMain_spSettingsCrawlSiteRadioButton">Crawl only the SharePoint Site of each start address". Then I've created a content source where it's a "web" not a "sharepoint" and when I set it up with "</label><label for="ctl00_PlaceHolderMain_webSettingsCrawlSiteRadioButton">Only crawl within the server of each start address". Also I've configured it to have "custom" with "0" server hops. Still it goes it out to the internet. Why, I've ran a full crawl to find out where the links are (setting verbose logging), it never indicates a lot of the sites that are showing up in our firewall log. However I know that it's search that does it as it doesn't happen until I run a full crawl.</label>
I've disabled all federated locations, etc. just can't figure it out.
Bryan
Všechny reakce
-
25. dubna 2012 7:41ModerátorHi Bryan,
Thank you for your question.
I am trying to involve someone familiar with this topic to further look at this issue.
Thanks,
Lhan Han -
25. dubna 2012 14:38Appreciate it. Would love to get to the bottom of this problem.
-
27. dubna 2012 15:19
Hello Bryan,
Do you have a proxy server setup?
Thanks!
Regards,
Shruti
-
27. dubna 2012 15:31
No, we have a firewall, but none of our servers are setup to be allowed through it. There is a client that is installed on the desktops, but as our normal procedures servers are not even given the client.
Bryan
-
8. května 2012 14:51
Hello Bryan,
A good way to check.
Look through the IIS logs and see if the content access account is accessing sites over the internet.
Thanks!
Regards,
Shruti
- Označen jako odpověď Shruti-MSFT 14. května 2012 15:04
- Zrušeno označení jako odpověď Bryan - COE 14. května 2012 16:12
-
14. května 2012 16:13ZERO entries, I thought I updated this post, but must not have. There are absolutely zero entries going to any of the sites listed in my firewall logs. BUT there are no entries unless search is in the process of crawling.
- Označen jako odpověď Shruti-MSFT 14. května 2012 21:43
- Zrušeno označení jako odpověď Shruti-MSFT 14. května 2012 21:43
-
14. května 2012 21:54
Hello Brayn,
Strange that there are no entries in the IIS logs for the content access account. It looks like a network issue. An analysis of the network trace would be helpful here. I would suggest opening up a ticket with MS support for further research into the issue.
Thanks!
Regards,
Shruti
-
15. května 2012 13:56
Hello Bryan,
I posted a reply too but it looks like it did not get saved. Strange that the IIS logs do not have any entries for the content acces account accessing the sites over internet. I would suggest collecting a network trace and analysing it.
Regards,
Shruti