Windows Server TechCenter > Windows Server Forums > Clustering > createing a 2 node cluster server 2008
Ask a questionAsk a question
 

Answercreateing a 2 node cluster server 2008

  • Wednesday, October 21, 2009 7:15 PMgodellian Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     
    Hi,

    This has been giving me issues for 2 days.  i have 2 HP Proliant DL380 G5 servers and an HP MSA2012sa for shared storage connected with MINI SAS cables.  for the life of me i cant get the cluster created!

    i run the validation and everything validates fine accept for that i dont have rights to the default computers folders on our domain, no problem though because i have created the comptuer name for the cluster in an OU that i have full rights to and have disabled it. 

     when i start to create the cluster it starts to run fine untill it gets to "forming cluster"  then it says "the opteraion is taking longer than expeced" and after 2 minutes it bombs out. 

    after forming cluster it says unable to successfuly cleanup. 
    to trouble shoot cluster creation problems run the validate a confguration wizard on the servers you wan to cluster. 
    an error occurred while creating the cluster. an error occurred creating cluster 'clustername' this operation returned because of the timeout period expired.

    i have created the cluster using only one node and tried to add the 2nd node later and it will also not work. The server 'servername' could not be added to the cluster. An error occurred while adding node servername to cluster servername. The cluster node is not reachable.

    i can ping both servers from both servers i have no idea why it would say its unreachable. 

    i have read a book on how to create clusters and the book made this sound so simple and i would have no problems, this could not be farther from the truth.  can anyone tell me what im doing wrong?!?!?!

    thanks

    Eric

Answers

  • Tuesday, October 27, 2009 2:41 PMgodellian Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     Answer
    We finally figured it out.  we had to disable symantec endpoint protection. 

    thanks for coming! shoot me now please.

    • Marked As Answer bygodellian Tuesday, October 27, 2009 2:41 PM
    •  

All Replies

  • Wednesday, October 21, 2009 7:47 PMThorsten Wujek Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     

    Hi Eric,

    to verify that I have understood everything correctly can you answer the following questions:

    Did the validation test run without any errors ?
    Are you able to create a cluster with one node and cluster service up and running. Can you also please verify if a computer account was created for the cluster name
    Is the network enabled for cluster communication, if it is possible please describe your network infrastructure



    ThorstenWujek
  • Wednesday, October 21, 2009 8:01 PMgodellian Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     
    The validation test ran with only one warning -
    "The user running validate, does not have permissions to create computer objects in the 'nd.gov' domain."

    When createing a 1 node cluster the cluster service is up and running.

    a computer account is created for the cluster name and when creating the cluster it is disabled and the account creating the cluster has full rights to that OU.

    im not sure what you mean by asking "is the network enabled for cluster communications" both servers are on a switch and can communicate on the switch, they are also connected to each other via crossover cable.

    thanks for the reply

    eric
  • Wednesday, October 21, 2009 8:24 PMThorsten Wujek Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     

    can you create a cluster log and send it ?

    I am sorry but I have to leave and will try to work on your problem tomorrow.

    Bye


    ThorstenWujek
  • Wednesday, October 21, 2009 9:10 PMgodellian Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     

    this is my report when trying to create the cluster and it fails.

    Create Cluster

     


    Cluster:  agoclstr
    Node:  AGOCLSTR1.nd.gov
    Node:  agoclstr2.bci.ago.nd.gov
    IP Address:  10.8.74.122
    Started 10/21/2009 4:04:25 PM
    Completed 10/21/2009 4:07:35 PM

    Beginning to configure the cluster agoclstr.
    Initializing Cluster agoclstr.
    Validating cluster state on node AGOCLSTR1.nd.gov.
    Searching the domain for computer object 'agoclstr'.
    Verifying computer object 'agoclstr' in the domain.
    Configuring computer object 'agoclstr' as cluster name object.
    Validating installation of the Network FT Driver on node AGOCLSTR1.nd.gov.
    Validating installation of the Cluster Disk Driver on node AGOCLSTR1.nd.gov.
    Configuring Cluster Service on node AGOCLSTR1.nd.gov.
    Validating installation of the Network FT Driver on node agoclstr2.bci.ago.nd.gov.
    Validating installation of the Cluster Disk Driver on node agoclstr2.bci.ago.nd.gov.
    Configuring Cluster Service on node agoclstr2.bci.ago.nd.gov.
    Waiting for notification that Cluster service on node AGOCLSTR1.nd.gov has started.
    Forming cluster 'agoclstr'.
    Unable to successfully cleanup.
    To troubleshoot cluster creation problems, run the Validate a Configuration wizard on the servers you want to cluster.
    An error occurred while creating the cluster. An error occurred creating cluster 'agoclstr'. This operation returned because the timeout period expired.

    i know the nodes names do not match i dont know if thats a big deal but when i try to put in the correct name for agoclstr1 it says "an error occurred while determining if you have adminitrator privileges on 'agoclstr1.bci.ago.nd.gov'. Attempted to perform and unauthorized operation."  i an logged on as an adminitrator so if you can tell me why i get that, i will also be less confused.

    eric

  • Thursday, October 22, 2009 12:43 PMChuck Timon [MSFT]MSFT, ModeratorUsers MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     
    OK, we need to get both nodes in the same domain and the person exectuing the process must have admin rights to all nodes in the cluster and must be able to create COs in the domain or the process documented here - http://technet.microsoft.com/en-us/library/cc731002(WS.10).aspx must be followed.

    Thanks.
    Chuck Timon Senior, Support Escalation Engineer (SEE) Microsoft Corporation
  • Thursday, October 22, 2009 12:48 PMThorsten Wujek Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     
    Hi eric,

    that is not the cluster log. You have to create it with the CLUSTER LOG command. for example: CLUSTER LOG /GENERATE /COPY:"path"

    BTW: Can you please provide more information about your domain structure (what it looks like/ where you want to install/ what rights you have), how many nics you are using

    Bye
    ThorstenWujek
  • Thursday, October 22, 2009 1:46 PMgodellian Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     
    ok this cluster log i generated is huge, 2 MB, where would you like me to send it?  or do you want me to look for something specific and post it here?

    both servers are on the same domain both logged in with admin rights to all nodes, and have the rights required in active directory according to the process on the link mentioned by chuck.
  • Thursday, October 22, 2009 2:29 PMThorsten Wujek Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     

    can you snip the parts where it says error or failed in the log and post it in this thread ?


    ThorstenWujek
  • Thursday, October 22, 2009 2:47 PMgodellian Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     
    ok so i can only create the log if i have a cluster already created, so this is the log from my single node cluster that i cannot and the 2nd node to.  here is one message im getting over and over

    [RES] Network Name <Cluster Name>: Trying to obtain the VSToken for Core Cluster Name resource
    2009/10/22-03:43:15.259 INFO  [RES] Network Name <Cluster Name>: GetCoreNetnameObject_VSToken returning status 0
    :2009/10/22-03:43:15.368 WARN  [RES] Network Name <Cluster Name>: Testing agoclstr.nd.gov for dynamic updates failed over adapter 'Local Area Connection', status 9017.
    2009/10/22-03:43:15.368 INFO  [RES] Network Name <Cluster Name>: Detected DNS registration PROBLEM. will retry in  900 seconds
    2009/10/22-03:43:15.368 INFO  [RES] Network Name <Cluster Name>: TimerQueueTimer rescheduled to fire after 900 secs

    I was getting a DNS error in the event log, saying bad DNS key, im not sure if that is related to why i cant add the 2nd node. here is some more errors.

    00000d98.00000df8::2009/10/21-21:47:44.037 WARN  [RHS] ERROR_MOD_NOT_FOUND(126), unable to load resource DLL mqclus.dll
    00000bf4.00000d34::2009/10/21-21:47:44.037 WARN  [RCM] Failed to load restype MSMQ: error 126.

    im getting a lot more errors than this, but again, this is only after i have created a single node cluster, when i try and create a 2 node cluster from the start, it never gets past "forming cluster"

  • Friday, October 23, 2009 8:05 AMThorsten Wujek Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     

    It seems that there are no sufficiant rights for the dns zone.
    Can you please provide the complete Error in the eventlog (eventid,eventsource, description) ?


    Just for testing purpose:

    try to switch the dns zone from secure to unsecure dynamic updates

    Bye


    ThorstenWujek
  • Friday, October 23, 2009 8:22 AMThorsten Wujek Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     
    Can you also provide the output of : Cluster RES Clustername /priv

    Are you using domain suffixes on your nodes ?

    if YES - Are the suffixes host based or connection based?

    What are the suffixes, are they equal or different ?
    ThorstenWujek
  • Friday, October 23, 2009 8:54 AMJesper Arnecke Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     
    Hiya,

    Try to create the DNS record Host A record for your cluster manually granting the Cluster Computer object the following rights:
    Write All Properties
    Read Permissions
    All Validated Writes
  • Friday, October 23, 2009 2:19 PMgodellian Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     

    i tossed cluster RES 'myclustername' /priv into the command line and it gave me a "system error 5007 has occurred (0x0000138f) the cluster resorce could not be found."

    we do not have rights to edit our DNS, we have to send requests to get it changed.  i have unchecked "register this connection's address in DNS"  and i sent in the request manually.

    your questions of if we use domain suffixes on my nodes, im not sure what you mean because i havent had to deal with any DNS stuff since we dont have the rights to do that.  i would have to ask.  i also have no idea what the difference between connection and host bases suffixes are.

    ive been talking with someone else on my network that has set up a 2008 server cluster and he never had any of the problems im having.  he has the same rights i have, and he used 2 comptuers that didnt even have matching hardware and got his to create just fine. 

    again my issue is when creating a 2 node cluster, i get stuck on "forming cluster" and then it fails.  i can create a single node cluster and when i try and add the 2nd node it gets to the point where it says " dererming if the node is a fully fucntioning member of the cluster"  and it fails that step.  i can actually see it on the cluster, but in offline status.  the cluster service on the new node is running, but it never gets on. 

    im now on my 4th day of this and im no closer no then where i started, after mulitple DNS change requests and complete reloads of the operating systems. 

  • Friday, October 23, 2009 2:26 PMJesper Arnecke Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     
    Hiya,

    If possible try, for testing sake, with another server?
    If you try and rebuild the cluster, however starting on the other server the previous do you get the same error?
  • Friday, October 23, 2009 4:14 PMgodellian Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     
    ive tried building the cluster starting from the 2nd node first, same exact results.
  • Friday, October 23, 2009 4:39 PMThorsten Wujek Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     
    Hi Eric, I really can understand your frustration, but you can imagine that it is hard to determin the root cause of this behaviour with not all the right within the infrastructure.

    What seems likely is that it has to do with DNS, because your servers are using different suffixes bci.ago.nd.gov and nd.gov.

    Read http://blogs.msdn.com/clustering/archive/2009/07/17/9836756.aspx

    Maybe you can check regarding the article.


    ThorstenWujek
  • Friday, October 23, 2009 4:45 PMThorsten Wujek Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     

    do you have rights to check the cluster nodes computeraccounts spns ? If yes can you provide them ?

    Thank you


    ThorstenWujek
  • Friday, October 23, 2009 6:52 PMgodellian Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     
    i have changed the name in our DNS servers from bci.ago.nd.gov to just nd.gov, so now it actually matches the comptuer name in DNS, i still get the same results i did before making any DNS changes. 

    i probably have rights to check the cluster nodes comptueraccounts spns, but im not sure what that is exactly, could you tell me how to do that?

    thanks for not abandoning the issue im having just yet!

    eric
  • Friday, October 23, 2009 7:18 PMgodellian Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     
    ok i have a new error log.  i have created a single node cluster and tried to connect another node to it.  this is the only error we get in the log file, ill copy paste a few lines above and below the error.

    00000f44.000000e4::2009/10/23-18:49:23.183 DBG   [NETFTAPI] Signaled NetftLocalConnect  event for 169.254.2.44
    00000f44.000000e4::2009/10/23-18:49:23.183 DBG   [NETFTAPI] Signaled NetftLocalConnect  event for 169.254.2.44
    00000f44.000009d4::2009/10/23-18:49:23.183 INFO  [IM] got event: Local endpoint 169.254.2.44:~0~ connected
    00000f44.00000c24::2009/10/23-18:49:28.583 ERR   [QUORUM] Node 2: Fail to form/join a cluster in 6:15.000
    00000f44.00000c24::2009/10/23-18:49:28.583 INFO  [CORE] Node quorum state is 'Not yet formed or joined a cluster'. Form/join status with other nodes is as follows:
    00000f44.00000c24::2009/10/23-18:49:28.583 INFO  [NODE] Node 2: Farthest reported progress joining with node AGOCLSTR1 (id 1) is: Wait for NetFT Duplicate Address Detection at time 2009/10/23-18:49:21.053: status 0
    00000f44.00000c24::2009/10/23-18:49:28.583 ERR   join/form timeout (status = 258)
    00000f44.00000c24::2009/10/23-18:49:28.583 ERR   FatalError is Calling Exit Process.     00000000:00a5c31b( ERROR_MOD_NOT_FOUND(126) )
    00000f44.00000c24::2009/10/23-18:49:28.583 ERR       00000000:00a422a4( ERROR_MOD_NOT_FOUND(126) )
    00000f44.00000c24::2009/10/23-18:49:28.583 ERR       00000000:00a2a345( ERROR_MOD_NOT_FOUND(126) )
    00000f44.00000c24::2009/10/23-18:49:28.583 ERR       00000000:00a02c11( ERROR_MOD_NOT_FOUND(126) )
    00000f44.00000c24::2009/10/23-18:49:28.583 ERR       00000000:00adb2c0( ERROR_MOD_NOT_FOUND(126) )

    i cant seem to google anyone on the line i have bolded.  can anyone get anything out of this error message?
  • Saturday, October 24, 2009 9:15 AMThorsten Wujek Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     
    hi,

    you can determin spns through adsiedit or setspn. I think adsiedit is easier. So in adsiedit connect to default context. Then find your node computeraccount. Open the properties for that object and search for serviceprinciplename attribute.
    Please tell me what is listed here.

    The submitted Error log is not significant enough. If you are interested you can send your log to Thorsten.wujek@t-online.de. And I will go through it.

    Bye 
    ThorstenWujek
  • Tuesday, October 27, 2009 2:41 PMgodellian Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     Answer
    We finally figured it out.  we had to disable symantec endpoint protection. 

    thanks for coming! shoot me now please.

    • Marked As Answer bygodellian Tuesday, October 27, 2009 2:41 PM
    •  
  • Tuesday, November 10, 2009 2:20 PMdanielategan Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     
    Hi Eric

    Hope you can help me. I'm also setting up a cluster for the 1st time, but I have an issue at the moment.
    We have 2 Windows 2008 servers and an HP MSA2012sa device for storage.

    I've created a raid 5 array on the MSA unit and created only 1 volume and mapped the 2 servers to that volume.
    I then formatted the drive on the one server and it's now showing up as drive D on both servers.
    The problem I have is that when I create a folder on Server 1 it does not show up on Server 2. I need to restart the servers before I can see the created folders from the other server.
    Can you please let me know how you created the shared volume between your servers.

    Regards

    Danie
  • Tuesday, November 10, 2009 9:05 PMgodellian Users MedalsUsers MedalsUsers MedalsUsers MedalsUsers Medals
     

    have you already created the cluster?  if you have then one of your nodes probably owns the volume and the 2nd node is not going to see the folder.  the reason why resetting works is because it shows the node is offline so it moved the resource to the 2nd node and then you can finally see it. 

    i created 1 vitural disk, 3 volumes.  one for the whitness drive, one for the DTC and one for the SQL storage.  after i created the volumes and mapped them, i checked the disks management of the disks and took them online, formatted them and gave them drive letters that matched.  i then created the cluster and installed SQL. 

    let me know what else you have tried.