none
Backup succeeds but restore fails to host, "computer is unreachable" RRS feed

  • Question

  • We recently stood up a new SAN, which is currently hosting some virtual machines that are on their own subnet.  We have a single DPM 2010 server on our existing subnet.  All machines are on the same domain and there is no firewall between them.  All machines are Server 2008 R2.  DPM is able to install agents on and backup data from virtuals on the new subnet, but it cannot restore data to them.  Any restore job of any data with those machines as its target fails after 7 minutes.  The error on DPM is

    DPM failed to communicate with DPM.DOMAIN.com because the computer is unreachable. (ID 41 Details: No such host is known(0x80072AF9))

    Checking the status of the relevant agents from the Management tab returns "OK."  I am confused by the fact that the DPM server lists itself as the machine that is unreachable, and I don't know how to interpret the errlog files any further.

    DPMRACurr.errlog from the target system

    Idle Timer created with timeout = 390000
    CDsmReceiverSubTaskBase: constructor [0000000002A03A80] openovl[0000000002A03BE8] msgovl[0000000002A03CD0] closeovl[0000000002A03C30]
    CRestoreSubTask: constructor [0000000002A091F0]
    CFileWriterIterator:Init(overwrite:2, setarchive:0, setchangetime:1, skiptargetmetadatarestore:1, restoresec:0, recreaterp:0, throttle:0) called
    DsmReceiver:AddAlternateLocationMapping(filepath:\\?\Volume{9a6999a6-b818-11de-9c8f-00155d0b1e0b}\, filespec:*, destination:\\?\Volume{0d1741bb-a9ee-11e1-839b-806e6f6e6963}\test\) called
    CFileWriterIterator::AddAlternateLocationMapping(filepath:\\?\Volume{9a6999a6-b818-11de-9c8f-00155d0b1e0b}\, filespec:*, destination:\\?\Volume{0d1741bb-a9ee-11e1-839b-806e6f6e6963}\test\) called
    CRestoreSubTask::TriggerDone [0000000002A091F0]
    Failed: Hr: = [0x00000000] IsCancelEvent: completion: 0xa006, signature: 0xaabbcc00
    CRestoreSubTask::CancelDataMove [0000000002A091F0]
    CDsmReceiver::Cancel, hCancelReason: 0x80990a51
    CDsmReceiverSubTaskBase::Cancel, hCancelReason: 0x80990a51, subtask state: 1
    Hr: = [0x80990a51] CDsmSubTaskBase::ErrorCleanup: subtask state: 1
    CRestoreSubTask::CleanUp [0000000002A091F0]
    CDLSRuntime::CloseSession: Removing not completed ACCEPT session for subTask: 0000000000000000, 000000000071C630
    CRestoreSubTask::GetFinalStatus [0000000002A091F0]
    <?xml version="1.0"?>
    <Status xmlns="http://schemas.microsoft.com/2003/dls/StatusMessages.xsd" StatusCode="-2137453999" Reason="Error" CommandID="RAWriteDataset" CommandInstanceID="2defcc36-8ab4-4b49-b7e1-dc67bd366840" GuidWorkItem="5858981f-8b6f-47cb-9c64-b007bca6f136" TETaskInstanceID="6cc64b96-7783-48c0-8099-626b06d4e5a8"><ErrorInfo xmlns="http://schemas.microsoft.com/2003/dls/GenericAgentStatus.xsd" ErrorCode="998" DetailedCode="-2137453999" DetailedSource="2"/><RAStatus><RAWriteDataset xmlns="http://schemas.microsoft.com/2003/dls/ArchiveAge

    MSDPMCurr.errlog from the DPM server

    GenerateWorkplan(job=b87ee4f9-3fe0-45fd-8504-790d2fabbaf7) - Entering
    GenerateWorkplan(job=b87ee4f9-3fe0-45fd-8504-790d2fabbaf7) - Finished (error=False), executing GenerateWorkplanCallback
    Setting up Fsm: verb[ef6b8adb-5995-45ac-d881-469b0a62ce48]
    DatamovePollingCmdTimeoutInMin = 30
    ExecuteTask(job=b87ee4f9-3fe0-45fd-8504-790d2fabbaf7) - Starting task instance
    Publishing event from TaskInstance.cs(407): TaskStart, [TaskID=6cc64b96-7783-48c0-8099-626b06d4e5a8]
    Task: Initial transition enqueued, startState='Restore.StartState'
    Restore.SetSnapshotContext.PT : <-- Exited FSM block with SUCCESS
    Restore.DataTransferLoop.DatasetDataTransferLoop : ==> Entrering FSM block
    Restore.DataTransferLoop.CommonLoop : RAReadDataset, StatusReason = Error (StatusCode = -2147013895, ErrorCode = HostUnreachable, workitem = 8b9adfcd-9c16-4f70-9edc-8d7f871b1bb0)
    Response: <?xml version="1.0"?>
    <Status xmlns="http://schemas.microsoft.com/2003/dls/StatusMessages.xsd" StatusCode="-2147013895" Reason="Error" CommandID="RAReadDataset" CommandInstanceID="c00acb5c-ba7e-4361-bd05-14d16a8f81e1" GuidWorkItem="8b9adfcd-9c16-4f70-9edc-8d7f871b1bb0" TETaskInstanceID="6cc64b96-7783-48c0-8099-626b06d4e5a8"><ErrorInfo xmlns="http://schemas.microsoft.com/2003/dls/GenericAgentStatus.xsd" ErrorCode="2014" DetailedCode="-2147013895" DetailedSource="2"><Parameter Name="AgentTargetServer" Value="DPM.DOMAIN.com"/></ErrorInfo><RAStatus><RAReadDataset xmlns="http://schemas.microsoft.com/2003/dls/ArchiveAgent/StatusMessages.xsd" BytesTransferred="0" NumberOfFilesTransferred="0" NumberOfFilesFailed="0" DataCorruptionDetected="false"/></RAStatus></Status>
    CommonLoop: Started tracking the receiver end timeout for final response
    DataTransferLoop [RAReadDataset, RAWriteDataset]: FAILURE -- RAReadDatasetFailure
    AgentStatus[RAForRead] - (CommandID=RAReadDataset, StatusReason=Error) failed with HRESULT 0x80072AF9, error -2147013895.
    error RmHostUnreachable has been already supplied - do not override it with HostUnreachable
    RARequestor: No subtask exists for RAReadDataset in GetWorkItemInfo response, cancelling the polling
    RARequestor: No subtask exists for RAReadDataset in GetWorkItemInfo response, cancelling the polling
    RARequestor: No subtask exists for RAReadDataset in GetWorkItemInfo response, cancelling the polling
    RARequestor: No subtask exists for RAReadDataset in GetWorkItemInfo response, cancelling the polling
    RARequestor: No subtask exists for RAReadDataset in GetWorkItemInfo response, cancelling the polling
    CCoreServices::CheckForPendingReboot() entered
    CEngineServices::CheckForPendingReboot entered
    Failed: Hr: = [0x80070002] : F: lVal : (UINT)RegOpenKeyExW((( HKEY ) (ULONG_PTR)((LONG)0x80000002) ), (LPCWSTR)rebootregkey, 0, ((((0x00020000L)) | (0x0001) | (0x0008) | (0x0010)) & (~(0x00100000L))), &key)
    RARequestor: No subtask exists for RAReadDataset in GetWorkItemInfo response, cancelling the polling
    RARequestor: No subtask exists for RAReadDataset in GetWorkItemInfo response, cancelling the polling
    RARequestor: No subtask exists for RAReadDataset in GetWorkItemInfo response, cancelling the polling
    RARequestor: No subtask exists for RAReadDataset in GetWorkItemInfo response, cancelling the polling
    CCoreServices::CheckForPendingReboot() entered
    CEngineServices::CheckForPendingReboot entered
    Failed: Hr: = [0x80070002] : F: lVal : (UINT)RegOpenKeyExW((( HKEY ) (ULONG_PTR)((LONG)0x80000002) ), (LPCWSTR)rebootregkey, 0, ((((0x00020000L)) | (0x0001) | (0x0008) | (0x0010)) & (~(0x00100000L))), &key)
    RARequestor: No subtask exists for RAReadDataset in GetWorkItemInfo response, cancelling the polling
    RARequestor: No subtask exists for RAReadDataset in GetWorkItemInfo response, cancelling the polling
    RARequestor: No subtask exists for RAReadDataset in GetWorkItemInfo response, cancelling the polling
    RARequestor: No subtask exists for RAReadDataset in GetWorkItemInfo response, cancelling the polling
    RARequestor: No subtask exists for RAReadDataset in GetWorkItemInfo response, cancelling the polling
    CommonLoop: Posting LoopComplete (Failure) message
    DataTransferLoop [RAReadDataset, RAWriteDataset]: FAILURE -- DataTransferLoop
    Sending Command RACancelAllSubTasks to RAForRead on server DPM.DOMAIN.com for workitem 8b9adfcd-9c16-4f70-9edc-8d7f871b1bb0
    Sending Command RACancelAllSubTasks to RAForWrite on server TARGET.DOMAIN.com for workitem 5858981f-8b6f-47cb-9c64-b007bca6f136
    RestoreMachine : FAILURE - RESTORE, errorCode=RmHostUnreachable
    Publishing event from ServerAlert.cs(650): AlertStateChange, [ObjectId=aac638f6-8f3c-46af-8b6c-e5c851d2c9df]
    StopAutoHealActionTimer: Timer for alertId=[98ccaa24-d5ca-47d8-9310-c176358095aa] not found
    Publishing event from ServerAlert.cs(556): AlertStateChange, [ObjectId=aac638f6-8f3c-46af-8b6c-e5c851d2c9df]
    RestoreMachine: Skipping Unlock()
    Task Diagnostic Information - <?xml version="1.0" encoding="utf-16"?>
    <TaskExecutionContext>
      <PrmDatasourceId>fe299e4d-daf1-46ef-8d2c-b6a61dfb8d20</PrmDatasourceId>
      <PrmWriterId>00000000-0000-0000-0000-000000000000</PrmWriterId>
      <PrmActiveNode>TARGET.DOMAIN.com</PrmActiveNode>
      <PrmDatasetId>3d9453fe-e05c-4100-90ee-31dc6abb400c</PrmDatasetId>
      <PrmLogicalReplicaId>72ecfb60-cc30-4809-a352-25688763574c</PrmLogicalReplicaId>
      <TEVerb>RestoreAsFilesFromSC</TEVerb>
      <TEErrorState>Restore.DataTransferLoop.DataTransferLoopWaitState</TEErrorState>
      <TEErrorDetails>&lt;?xml version="1.0" encoding="utf-16"?&gt;
    &lt;q1:ErrorInfo ErrorCode="41" DetailedCode="-2147013895" DetailedSource="2" ExceptionDetails="" xmlns:q1="http://schemas.microsoft.com/2003/dls/GenericAgentStatus.xsd"&gt;
      &lt;q1:Parameter Name="agenttargetserver" Value="DPM.DOMAIN.com" /&gt;
      &lt;q1:Parameter Name="servername" Value="DPM.DOMAIN.com" /&gt;
    &lt;/q1:ErrorInfo&gt;</TEErrorDetails>
    </TaskExecutionContext>
    Publishing event from TaskInstance.cs(825): TaskStop, [TaskID=6cc64b96-7783-48c0-8099-626b06d4e5a8]
    Task stopped (state=Failed, error=RmHostUnreachable; -2147013895; WindowsHResult), search "Task Diagnostic Information" for details.
    RetireWorkplan(job=b87ee4f9-3fe0-45fd-8504-790d2fabbaf7) - Calling derived RetireWorkplan
    Retiring task for (job=b87ee4f9-3fe0-45fd-8504-790d2fabbaf7)
    RetireWorkplan(job=b87ee4f9-3fe0-45fd-8504-790d2fabbaf7) - Retiring task: Failed
    CCoreServices::UpdateAlertVisibilityIfInvisible() entered
    CEngineServices::UpdateAlertVisibilityIfInvisible entered. alertId={B1DF0839-D8E2-4EC7-933D-868D0FE77327}, newVisibility=0, invisibilityTimeout=0




    • Edited by JeremySD Wednesday, June 6, 2012 7:21 PM
    Wednesday, June 6, 2012 7:16 PM

All replies

  • This is old. Did you ever resolve this? I had a strange issue somewhat similar to this before. Can you communicate both ways from client to DPM and from DPM to client? I know you said no firewall but I would double check this. Can you telnet to client and from client to DPM on port 445? Does DNS resolve both ways?

    My Blog | www.buchatech.com | www.systemcenterportal.com
    If you found this post helpful, please give it a "Helpful" vote. If it answered your question, remember to mark it as an "Answer". This posting is provided "AS IS" with no warranties and confers no rights! Always test ANY suggestion in a test environment before implementing!

    Saturday, January 12, 2013 6:27 AM
    Moderator