none
Exchange2010 无法向DAG添加第二个成员(无杀毒软件且防火墙已关) RRS feed

  • 问题

  • 虽然查出和我出现同样问题的帖子,但基本都是说因为杀毒软件防火墙导致的。不过我服务器装的杀毒软件已卸载,Windows内置防火墙已关闭,问题依旧。通过Baidu Google Bing等各个搜索引擎的搜索也未找出有效的解决方法。下面是我的错误日志:

    add-databaseavailabiltygroupserver started on machine EX01.
    [2013-11-03T03:36:31] add-dagserver started
    [2013-11-03T03:36:31] commandline:         $scriptCmd = {& $wrappedCmd @PSBoundParameters }
    [2013-11-03T03:36:31] Option 'Identity' = 'DAG01'.
    [2013-11-03T03:36:31] Option 'MailboxServer' = 'MB02'.
    [2013-11-03T03:36:31] Option 'DatabaseAvailabilityGroupIpAddresses' = ''.
    [2013-11-03T03:36:31] Option 'WhatIf' = ''.
    [2013-11-03T03:36:31] Process: w3wp w3wp.exe:6120.
    [2013-11-03T03:36:31] User context = 'NT AUTHORITY\SYSTEM'.
    [2013-11-03T03:36:31]   Member of group 'Everyone'.
    [2013-11-03T03:36:31]   Member of group 'BUILTIN\Users'.
    [2013-11-03T03:36:31]   Member of group 'NT AUTHORITY\SERVICE'.
    [2013-11-03T03:36:31]   Member of group '控制台登录'.
    [2013-11-03T03:36:31]   Member of group 'NT AUTHORITY\Authenticated Users'.
    [2013-11-03T03:36:31]   Member of group 'NT AUTHORITY\This Organization'.
    [2013-11-03T03:36:31]   Member of group 'BUILTIN\IIS_IUSRS'.
    [2013-11-03T03:36:31]   Member of group 'LOCAL'.
    [2013-11-03T03:36:31]   Member of group 'IIS APPPOOL\MSExchangePowerShellAppPool'.
    [2013-11-03T03:36:31]   Member of group 'BUILTIN\Administrators'.
    [2013-11-03T03:36:31] Updated Progress '正在验证参数。' 2%.
    [2013-11-03T03:36:31] 正在运行
    [2013-11-03T03:36:31] Mailbox server: value passed in = MB02, mailboxServer.Name = MB02, mailboxServer.Fqdn = MB02.office.abc
    [2013-11-03T03:36:31] LogClussvcState: clussvc is Stopped on MB02.office.abc. Exception (if any) = none
    [2013-11-03T03:36:31] The IP addresses for the DAG are (blank means DHCP): 172.16.0.10
    [2013-11-03T03:36:31] Looking up IP addresses for DAG01.
    [2013-11-03T03:36:31]   DAG01 = [ 172.16.0.10 ].
    [2013-11-03T03:36:31] Looking up IP addresses for mb02.
    [2013-11-03T03:36:31]   mb02 = [ 172.16.0.5 ].
    [2013-11-03T03:36:31] Looking up IP addresses for MB02.office.abc.
    [2013-11-03T03:36:31]   MB02.office.abc = [ 172.16.0.5 ].
    [2013-11-03T03:36:31] DAG DAG01 has 1 servers:
    [2013-11-03T03:36:31] DAG DAG01 contains server EX02.
    [2013-11-03T03:36:31] Updated Progress '检查邮箱服务器'MB02'是否在数据库可用性组中。' 4%.
    [2013-11-03T03:36:31] 正在运行
    [2013-11-03T03:36:31] GetRemoteCluster() for the mailbox server failed with exception = 活动管理器操作失败。错误 群集操作失败。错误: 群集 API'"OpenCluster(MB02.office.abc) 失败,出现 0x6d9。错误: 终结点映射器中没有更多的终结点可用。"'失败。。. This is OK.
    [2013-11-03T03:36:31] Ignoring previous error, as it is acceptable if the cluster does not exist yet.
    [2013-11-03T03:36:31] DumpClusterTopology: Opening remote cluster DAG01.
    [2013-11-03T03:36:31] Dumping the cluster by connecting to: DAG01.
    [2013-11-03T03:36:31] The cluster's name is: DAG01.
    [2013-11-03T03:36:31] Groups
    [2013-11-03T03:36:31]     group: 可用存储 [not a CMS]
    [2013-11-03T03:36:31]         OwnerNode: EX02.office.abc
    [2013-11-03T03:36:31]         State: Offline
    [2013-11-03T03:36:31]     group: 群集组 [Cluster Main Group]
    [2013-11-03T03:36:31]         OwnerNode: EX02.office.abc
    [2013-11-03T03:36:31]         State: Online
    [2013-11-03T03:36:31]             Resource: 群集名称 [Online, type = Network Name, PossibleOwners = EX02 ]
    [2013-11-03T03:36:31]                 NetName = [DAG01]
    [2013-11-03T03:36:31]             Resource: 群集 IP 地址 [Online, type = IP Address, PossibleOwners = EX02 ]
    [2013-11-03T03:36:31]                 Address = [172.16.0.10]
    [2013-11-03T03:36:31]                     EnableDhcp = [0]
    [2013-11-03T03:36:31]                     Network = [群集网络 1]
    [2013-11-03T03:36:31] Nodes
    [2013-11-03T03:36:31]     node: EX02.office.abc [ state = Up ]
    [2013-11-03T03:36:31] Subnets
    [2013-11-03T03:36:31]     Name(群集网络 1), Mask(172.16.0.0/23), Role(ClusterNetworkRoleInternalAndClient)
    [2013-11-03T03:36:31]         NIC 172.16.0.4 on Node EX02 in State=Up
    [2013-11-03T03:36:31] Opening the cluster on nodes [ex02].
    [2013-11-03T03:36:31] Other mailbox servers in the DAG are already members of cluster 'DAG01'
    [2013-11-03T03:36:31] The server MB02 does not belong to a cluster, and the other servers belong to DAG01.
    [2013-11-03T03:36:31] Successfully resolved the servers based on the stopped servers list.
    [2013-11-03T03:36:31] The following servers are in the StartedServers list (The list is the StartedServers property of the DAG in AD):
    [2013-11-03T03:36:31] The following servers are in the StoppedServers list:
    [2013-11-03T03:36:31] 正在检查数据库可用性组'DAG01'的成员是否也是群集的成员。
    [2013-11-03T03:36:31] 正在检查群集'DAG01'的成员是否也是数据库可用性组的成员。
    [2013-11-03T03:36:31] According to GetNodeClusterState(), the server MB02 is NotConfigured.
    [2013-11-03T03:36:32] The CNO is currently Online.
    [2013-11-03T03:36:32] InternalValidate() done.
    [2013-11-03T03:36:32] Updated Progress '正在向数据库可用性组'DAG01'添加服务器'MB02'。' 6%.
    [2013-11-03T03:36:32] 正在运行
    [2013-11-03T03:36:32] Updated Progress '正在将服务器'MB02'添加到群集中。' 8%.
    [2013-11-03T03:36:32] 正在运行
    [2013-11-03T03:42:50] 以下日志项来自计算机'EX02.office.abc'上运行的其他进程。BEGIN
    [2013-11-03T03:42:50] [2013-11-03T03:36:31] Opening a local AmCluster handle.
    [2013-11-03T03:36:31] Updated Progress '正在向数据库可用性组'DAG01'添加服务器'mb02'。' 2%.
    [2013-11-03T03:36:31] 正在运行
    [2013-11-03T03:36:31] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseValidateNodeState, ePhaseType = ClusterSetupPhaseStart, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 12, szObjectName = MB02, dwStatus = 0x0 )
    [2013-11-03T03:36:32] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseValidateNodeState, ePhaseType = ClusterSetupPhaseEnd, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 12, szObjectName = , dwStatus = 0x0 )
    [2013-11-03T03:36:32] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseGettingCurrentMembership, ePhaseType = ClusterSetupPhaseStart, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 25, szObjectName = DAG01, dwStatus = 0x0 )
    [2013-11-03T03:36:32] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseGettingCurrentMembership, ePhaseType = ClusterSetupPhaseEnd, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 25, szObjectName = , dwStatus = 0x0 )
    [2013-11-03T03:36:32] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseAddNodeToCluster, ePhaseType = ClusterSetupPhaseStart, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 37, szObjectName = MB02, dwStatus = 0x0 )
    [2013-11-03T03:36:32] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseAddNodeToCluster, ePhaseType = ClusterSetupPhaseEnd, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 37, szObjectName = , dwStatus = 0x0 )
    [2013-11-03T03:36:32] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseValidateNetft, ePhaseType = ClusterSetupPhaseStart, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 50, szObjectName = MB02, dwStatus = 0x0 )
    [2013-11-03T03:36:32] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseValidateNetft, ePhaseType = ClusterSetupPhaseEnd, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 50, szObjectName = , dwStatus = 0x0 )
    [2013-11-03T03:36:32] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseValidateClusDisk, ePhaseType = ClusterSetupPhaseStart, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 62, szObjectName = MB02, dwStatus = 0x0 )
    [2013-11-03T03:36:32] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseValidateClusDisk, ePhaseType = ClusterSetupPhaseEnd, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 62, szObjectName = , dwStatus = 0x0 )
    [2013-11-03T03:36:32] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseConfigureClusSvc, ePhaseType = ClusterSetupPhaseStart, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 75, szObjectName = MB02, dwStatus = 0x0 )
    [2013-11-03T03:36:34] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseConfigureClusSvc, ePhaseType = ClusterSetupPhaseEnd, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 75, szObjectName = , dwStatus = 0x0 )
    [2013-11-03T03:36:34] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseStartingClusSvc, ePhaseType = ClusterSetupPhaseStart, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 87, szObjectName = MB02.office.abc, dwStatus = 0x0 )
    [2013-11-03T03:36:34] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseStartingClusSvc, ePhaseType = ClusterSetupPhaseEnd, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 87, szObjectName = , dwStatus = 0x0 )
    [2013-11-03T03:36:34] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseNodeUp, ePhaseType = ClusterSetupPhaseStart, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 100, szObjectName = MB02, dwStatus = 0x0 )
    [2013-11-03T03:39:34] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseNodeUp, ePhaseType = ClusterSetupPhaseEnd, ePhaseSeverity = ClusterSetupPhaseFatal, dwPercentComplete = 100, szObjectName = MB02, dwStatus = 0x5b4 )
    [2013-11-03T03:39:34] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseFailureCleanup, ePhaseType = ClusterSetupPhaseStart, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 100, szObjectName = MB02, dwStatus = 0x0 )
    [2013-11-03T03:42:50] ClusterSetupProgressCallback( eSetupPhase = ClusterSetupPhaseFailureCleanup, ePhaseType = ClusterSetupPhaseEnd, ePhaseSeverity = ClusterSetupPhaseInformational, dwPercentComplete = 100, szObjectName = , dwStatus = 0x0 )
    
    [2013-11-03T03:42:50] 前一个日志项来自计算机'EX02.office.abc'上运行的另一个进程。END
    [2013-11-03T03:42:50] 因为遇到错误,操作未成功。可在日志文件"C:\ExchangeSetupLogs\DagTasks\dagtask_2013-11-03_03-36-31.568_add-databaseavailabiltygroupserver.log"中找到更多详细信息。
    [2013-11-03T03:42:50] WriteError! Exception = Microsoft.Exchange.Cluster.Replay.DagTaskOperationFailedException: 服务器端数据库可用性组管理操作失败。错误: 操作失败。CreateCluster 错误可能是静态地址配置错误所致。错误: 群集操作失败。错误: 群集 API'"AddClusterNode() (MaxPercentage=100) 失败,出现 0x5b4。错误: 由于超时时间已过,该操作返回。"'失败。 ---> Microsoft.Exchange.Cluster.Replay.AmClusterApiException: 活动管理器操作失败。错误 群集操作失败。错误: 群集 API'"AddClusterNode() (MaxPercentage=100) 失败,出现 0x5b4。错误: 由于超时时间已过,该操作返回。"'失败。。 ---> System.ComponentModel.Win32Exception: 由于超时时间已过,该操作返回。
       --- 内部异常堆栈跟踪的结尾 ---
       在 Microsoft.Exchange.Cluster.ClusApi.AmCluster.AddNodeToCluster(AmServerName nodeName, IClusterSetupProgress setupProgress, IntPtr context, Exception& errorException, Boolean throwExceptionOnFailure)
       在 Microsoft.Exchange.Cluster.Replay.DagHelper.AddDagClusterNode(AmServerName mailboxServerName, String& verboseLog)
       --- End of inner exception stack trace (Microsoft.Exchange.Cluster.Replay.AmClusterApiException) ---
       在 Microsoft.Exchange.Cluster.Replay.DagHelper.ThrowDagTaskOperationWrapper(Exception exception)
       在 Microsoft.Exchange.Cluster.Replay.DagHelper.AddDagClusterNode(AmServerName mailboxServerName, String& verboseLog)
       在 Microsoft.Exchange.Cluster.ReplayService.ReplayRpcServer.<>c__DisplayClass34.<RpcsAddNodeToCluster>b__33()
       在 Microsoft.Exchange.Data.Storage.Cluster.HaRpcExceptionWrapperBase`2.RunRpcServerOperation(String databaseName, RpcServerOperation rpcOperation)
       --- End of stack trace on server (EX02.office.abc) ---
       在 Microsoft.Exchange.Data.Storage.Cluster.HaRpcExceptionWrapperBase`2.ClientRethrowIfFailed(String databaseName, String serverName, RpcErrorExceptionInfo errorInfo)
       在 Microsoft.Exchange.Cluster.Replay.ReplayRpcClientWrapper.RunRpcOperationDbName(AmServerName serverName, String databaseName, Int32 timeoutMs, IHaRpcExceptionWrapper rpcExceptionWrapperInstance, InternalRpcOperation rpcOperation)
       在 Microsoft.Exchange.Cluster.Replay.ReplayRpcClientWrapper.RunRpcOperation(AmServerName serverName, Nullable`1 dbGuid, Int32 timeoutMs, IHaRpcExceptionWrapper rpcExceptionWrapperInstance, InternalRpcOperation rpcOperation)
       在 Microsoft.Exchange.Cluster.Replay.ReplayRpcClientWrapper.RunAddNodeToCluster(AmServerName serverName, AmServerName newNode, String& verboseLog)
       在 Microsoft.Exchange.Management.SystemConfigurationTasks.AddDatabaseAvailabilityGroupServer.JoinNodeToCluster()
    [2013-11-03T03:42:50] Updated Progress '完成!' 100%.
    [2013-11-03T03:42:50] 已完成
    add-databaseavailabiltygroupserver explicitly called CloseTempLogFile().

    2013年11月3日 7:50

全部回复

  • 您好!

    1.请检查群集服务以及所有需要的Exchange服务是开启的。

    2.请参考以下文章disable TCP Chimney Offload。

    http://support.microsoft.com/kb/951037

    3.请确认你的server之间可以互相ping通。


    微软一站式示例脚本库: http://blogs.technet.com/b/onescript

    2013年11月4日 5:21
  • 首先非常感谢您耐心的解答,这个问题确实让我很困扰,现在服务器正在使用也没办法重装,只能一点点排错进行检查。针对您所说的3个步骤,我的检测结果如下:

    1. 分别在两个服务器执行Test-ReplicationHealth和Test-ServiceHealth,检测结果如下:

    注:Ex02为第一台MBX且已经添加到DAG中,MB02是第二台MBX即添加到DAG报错的那台。

    Ex02执行 Test-ReplicationHealth命令显示:

    Server          Check                      Result     Error
    ------          -----                      ------     -----
    EX02            ReplayService              已通过
    EX02            ActiveManager              已通过
    EX02            TasksRpcListener           已通过

    Ex02执行Test-ServiceHealth命令显示:

    Role                    : 邮箱服务器角色
    RequiredServicesRunning : True
    ServicesRunning : {IISAdmin, MSExchangeADTopology, MSExchangeIS,

                                  MSExchangeMailboxAssistants, MSExchangeMailSub mission,

                                  MSExchangeRepl, MSExchangeRPC, MSExchangeSA, MSExchangeSearch,

                                  MSExchangeServiceHost, MSExchangeThrottling,

                                  MSExchangeTransportLogSearch, W3Svc, WinRM}
    ServicesNotRunning : {}

    MB02执行结果与上完全一样,这里就不贴出了。

    2. 已执行netsh int tcp set global chimney=disabled禁用了 TCP 烟囱卸载功能。

    3. Ex02和MB02之间无论是ping还是nslookup正反解析均正常。


    最后删除之前添加的DAG记录,并到域控服务器中删除了DAG计算机清楚了相应的DNS记录,重新添加DAG问题依旧。


    • 已编辑 John Loh1 2013年11月4日 16:35 格式问题
    2013年11月4日 16:33
  • 您好!

    我推荐你安装最新的clussvc.exe试试。

    http://support.microsoft.com/kb/2616514


    微软一站式示例脚本库: http://blogs.technet.com/b/onescript

    2013年11月5日 2:07