none
vmclusres.dll either crashed or deadlocked / WSFC Resource Deadlock

    Question

  • I am trying to investigate a WSFC Resource Deadlock / RHS failure.... Here is what I found so far....

     

    000017a8.00002588::2011/12/29-22:24:32.196 ERR   [RCM] rcm::RcmResControl::DoResourceControl: ERROR_RESOURCE_CALL_TIMED_OUT(5910)' because of 'Control(UNKNOWN (0x1600004) obj: .MU. flags:2097156 code:0 access:<<insert {4} refers an argument that is not supplied. Only 4 argument(s) provided>>) to resource 'SCVMM coe7vlab028dv2 Configuration' timed out.'
    000017a8.00002588::2011/12/29-22:24:32.196 WARN  [RCM] ResourceControl(UNKNOWN (0x1600004) obj: .MU. flags:2097156 code:0 access:<<insert {4} refers an argument that is not supplied. Only 4 argument(s) provided>>) to SCVMM coe7vlab028dv2 Configuration returned 5910.
    0000061c.00001484::2011/12/29-22:24:33.007 ERR   [RHS] RhsCall::DeadlockMonitor: Call RESOURCECONTROL timed out for resource 'SCVMM coe7vlab028dv2 Configuration'.
    0000061c.00001484::2011/12/29-22:24:33.007 INFO  [RHS] Enabling RHS termination watchdog with timeout 1200000 and recovery action 3.
    0000061c.00001484::2011/12/29-22:24:33.007 ERR   [RHS] Resource SCVMM coe7vlab028dv2 Configuration handling deadlock. Cleaning current operation and terminating RHS process.
    0000061c.00001484::2011/12/29-22:24:33.007 ERR   [RHS] About to send WER report.
    000017a8.00002588::2011/12/29-22:24:33.007 WARN  [RCM] HandleMonitorReply: FAILURENOTIFICATION for 'SCVMM coe7vlab028dv2 Configuration', gen(0) result 4.
    000017a8.00002588::2011/12/29-22:24:33.007 INFO  [RCM] rcm::RcmResource::HandleMonitorReply: Resource 'SCVMM coe7vlab028dv2 Configuration' consecutive failure count 1.
    0000061c.00001484::2011/12/29-22:24:34.146 ERR   [RHS] WER report is submitted. Result : WerReportQueued.
    000017a8.00002588::2011/12/29-22:24:34.177 ERR   [RCM] rcm::RcmMonitor::RecoverProcess: Recovering monitor process 1564 / 0x61c
    000017a8.00002588::2011/12/29-22:24:34.177 INFO  [RCM] Created monitor process 16556 / 0x40ac

     

    Any suggestion? I guess I will need to use the debug diagnostic tool to look at what happened with 000017a8.00002588

    Correct?

    Friday, December 30, 2011 1:49 AM

Answers

  • This post is older than 30 days or we have not heard back from you.  Did this issue get resolved?  If so, please share with community how you resolved.  Otherwise, re-activate post if you still require assistance.  Forums are best effort support and so if you require assistance this more detailed I recommend opening a case with Microsoft Support.


    Mohamed Fawzi | http://fawzi.wordpress.com

    Saturday, October 06, 2012 1:26 AM
    Moderator

All replies

  • A few more piece of information. This is a cluster with 3 nodes for now (more to come online).

    We are using this Hyper-V cluster to support Windows 7 virtual desktops under Citrix XenDesktops.

    The nodes are 2U supermicro systems with 4 x AMD 6274 cpus (Interlagos / Bulldozer) and 256GB of memory. Each node is equipped with a Qlogic 8242 HBA (2 x 10GB) connected to two Cisco Nexus switches. We are doing FCoE and are connecting to a NetApp 3210 for storage. We are running the latest version of NetApp MPIO and Snapdrive stack on each systems.
    Friday, December 30, 2011 2:07 PM
  • Any idea or suggestion?
    Monday, January 16, 2012 12:56 AM
  • Did this question get resolved? We are experiencing identical issues
    Tuesday, July 24, 2012 8:49 AM
  • This post is older than 30 days or we have not heard back from you.  Did this issue get resolved?  If so, please share with community how you resolved.  Otherwise, re-activate post if you still require assistance.  Forums are best effort support and so if you require assistance this more detailed I recommend opening a case with Microsoft Support.


    Mohamed Fawzi | http://fawzi.wordpress.com

    Saturday, October 06, 2012 1:26 AM
    Moderator