Quantcast
Channel: High Availability (Clustering) forum
Viewing all 2783 articles
Browse latest View live

Network Load Balancing - "access denied" when loading configuration information from host2

$
0
0

We have 2 Windows 2012 R2 servers, both are running on workgroup.

We set up NLB cluster.  When we open NLB Manager on the server2, then message shows "loading configuration information. Access denied. Error connecting to server1". There is no issue doing this on server1, NLB Manager is able to connect to both servers. We login using default administrator account, both account name and password are the same for 2 servers.

When we check security event log on server1, there is this strange Audit Failure log using account "test_nlb" from server2 which related to "Access denied" error. Please let us know how to resolve this. Thanks in advance.

      Event ID: 4776

      The computer attempted to validate the credentials for an account.

      Authentication Package:   MICROSOFT_AUTHENTICATION_PACKAGE_V1_0

      Logon Account:   test_nlb

      Source Workstation:   WPAAP2

      Error Code:   0xc0000064                                  

      An account failed to log on.

     

Event ID: 4625

Subject:

    Security ID:       S-1-0-0

    Account Name:       -

    Account Domain:       -

    Logon ID:       0x0

Logon Type:           3

Account For Which Logon Failed:

    Security ID:       S-1-0-0

    Account Name:       test_nlb

   Account Domain:       WPAAP2

Failure Information:

    Failure Reason:       Unknown user name or bad password.

    Status:           0xc000006d

    Sub Status:       0xc0000064

 

Process Information:

    Caller Process ID:   0x0

    Caller Process Name:   -

 

Network Information:

    Workstation Name:   WPAAP2

    Source Network Address:   192.168.70.45

    Source Port:       55136

 

Detailed Authentication Information:

    Logon Process:       NtLmSsp

    Authentication Package:   NTLM

    Transited Services:   -

    Package Name (NTLM only):   -

    Key Length:       0

 

This event is generated when a logon request fails. It is generated on the computer where access was attempted.

 

The Subject fields indicate the account on the local system which requested the logon. This is most commonly a service such as the Server service, or a local process such as Winlogon.exe or Services.exe.

 

The Logon Type field indicates the kind of logon that was requested. The most common types are 2 (interactive) and 3 (network).

 

The Process Information fields indicate which account and process on the system requested the logon.

 

The Network Information fields indicate where a remote logon request originated. Workstation name is not always available and may be left blank in some cases.

 

The authentication information fields provide detailed information about this specific logon request.

    - Transited services indicate which intermediate services have participated in this logon request.

    - Package name indicates which sub-protocol was used among the NTLM protocols.

    - Key length indicates the length of the generated session key. This will be 0 if no session key was requested.

 

 

 

 

 

 

 



High Availability Clustering & Hyper-V Server 2012 Adapter Bindings

$
0
0

Hi thanks for reading. I have a  2 Server 2012R2 lab setup with Hyper-V and clustering.  I have setup this before but using 5 single physical network adapters but now I have teams so a bit confused as to the adapter bindings and if the teams need to be added or just the vEthernet Nics?.

Nic 1,2            HostVMSwitch

Nic 3,4,5         HostMgmtSwitchTeam

The adapter Binding settings are:

HostMgmtSwitchTeam

V-Curric

Nic 3

Nic 4

Nic 5

V-Livemigration

HostVMSwitch

Nic 1

Nic 2

V-iSCSI

V-HeartBeat


Paul Edwards

Failover cluster installation in remote site - DMZ with only read-only domain controller

$
0
0

Hi,

I have a question regarding the new setup of a two node failover cluster with W2K8R2 nodes in a kind of DMZ, which means the site is separated from the LAN/AD by a firewall. Rules are set on the firewall that allow replication only to a read only Domain Controller that is located in the site.

Installation of the cluster fails, even after pre-staging the cluster nodes and on the LAN side of the firewall.

Is this a supported configuration at all, or do the firewall admins have to open ports/apply additional rules to allow the two future cluster nodes communication with a writable DC in the LAN (behind the firewall) as I suggest?

Placing a writabe DC in the site is not an option. Creating a separate AD/forest with trust to the main forest is also not an option.

Any other recommendations / procedures ? Which ports/protocol must be opened on the firewall for the cluster nodes IPs ?

 

Thanks in advance!

Cheers!


Live Migration only using one live migration adapter

$
0
0

Hi,

I have a 2 node cluster spanning two buildings, 4 switches (2 in each building), 2 live migration NICs in each server. I have NIC1 going through switch 1 and NIC2 going through switch 2, same in the other building.

everything talks fine to each other. I decided to team the Live Migration networks so I can get more transferring at once (getting 2Gbps instead of 1, this seemed like a good idea at the time but when I ran a test scenario of failing one of the switches the live migration was unable to complete (despite the switches being stacked together, maybe I have a config issue I dont know).

so I destroyed the team and thought I'd have two seperate live mig channels, i select these two live mig channels in as live migration networks in failover cluster manager. when i try to live migrate a bunch of VM's it is only using the one adapter, no matter how many VM's are going down there it will deal with 2 at a time (as per hyper-v settings). 

the question is, why is not using my other live migration channel even though that adapter is configured correctly, I have even moved it up in the list and then it will use it so i know it works, but it won't use both together...?

thanks

Steve

WSUS Email Stopped

$
0
0

Server is 2008 R2 running WSUS 3.2

I get emails about downloaded updates.  But about 4 months ago, I stopped getting emails about servers no longer reporting.

I cant seem to find the options to email reports to troubleshoot this.  Everything was working, and now it is not emailing anymore.  Anyone know where to start looking?


BlankMonkey

Question on Cluster network configuration

$
0
0

Hi Guys,

Just wanted to know the best practices for configuring network for clusters.
As far as I know, recommended is to have a separate network for Public/client communications and separate network for private heartbeat communication.

Does this change for Win 2008 & Win 2012 Clusters? I say this because I heard that there is no more need to have 2 dedicated networks and only 1 network is enough.
Is this true? If it is true , does it not a single point of failure ? What is the best practices for configuring sql clusters?

Thanks in advance.

How to create cluster file shares on 2008 R2 usingt he command line

$
0
0

Hi,

I know I can use the cluster command to created file shares which are cluster aware on 2003 R2, but this doesn't work as well on 2008 R2 (https://support.microsoft.com/kb/284838?wa=wsignin1.0) - in addition, the properties mapping won't work on 2008 R2 such as "cluster . res "ClusterFS" /AddDep:"Disk X:".

Is there a supported way to create a cluster file share via the command line on 2008 R2? I've tested this using "net share" and feeding my input parameters in, as long as I do it on the active node and use a clustered disk, the share is visible on the cluster node and it moves over with my file server resource when I move services to a second node.

Net share file share creation works fine, but I'm wondering whether it's supported for clusters and if not, what's the recommended Ms way? I striggled to do this in PowerShell, which is odd as I thought creating a cluster file share would be bread and butter for PS.

Thanks

How to compare Windows Updates applied for 2 nodes within cluster and make them equal?

$
0
0

Hello, is there a more efficient or better way than eyeballing both nodes Windows Update patches that got applied in seeing if necessary updates need to be applied/removed in making both nodes the same?

Thanks in advance.


Server 2008 failover cluster "interact with desktop" is no longer supported... How to make it work!

$
0
0
 So how do you have applications run on the console that need management or need to run all the time and need to be visible...???
It makes no sense to have an Application resource if it can't interact with the desktop... Not many apps don't interact with the desktop.

Is there a work around.

TIA

LB

Clustering -- Quorum -- How do you configure a file share witness on another domain?

$
0
0

Hi There,

I am setting up Clustering in a Windows 2012 (R1) environment.

I have a domain [myDomain], two servers on two sites, they belong to the myDomain. They have two services clustered and this is working.

I now wish to configure the Quorum File Share Witness option. Due to too many complications to explain here I need the File share to be on another Domain (outside my control ... a long story) so lets say the domain is called remoteDomain.

When i am configuring the Quorum option in the cluster wizard and I enter this share i get a "is not valid share path" NOW for info using normal windows explorer i can get to this other share be it i have to enter credentials in the usual windows pop-up box but it does work.

Am i to assume that the file share witness must be on the same domain as the cluster? If not what do you do to configure this as at a lit of a loss.

Sorry if i have just asked the daftest question here ;)

Thanks,

Steve


How to assign SMB storage to CSV in HV failover cluster?

$
0
0

I have a Hyper-V Cluster that looks like this: Clustered-Hyper-V-Diagram

  • 2012 R2 Failover Cluster
  • 2 Hyper-V nodes
  • iSCSI Disk Witness on isolated "Cluster Only" Network
  • "Cluster and Client" Network with nic-team connectivity to 2012 R2 File Server
  • Share configured using: server manager > file and storage services > shares > tasks > new share > SMB Share - Applications > my RAID 1 volume.

My question is this: how do I configure a Clustered Shared Volume?  How do I present the Shared Folder to the cluster?

I can create/add VMs from Cluster Manager > Roles > Virtual Machines using \\SMB\Share for the location of the vhd...  but how do I use a CSV with this config?  Am I missing something?













How to ensure two roles run on same host server

$
0
0
I have a two node cluster running SQL Always On and a File Server role. How can I ensure both roles always run on the same host server such that if SQL Server fails over to Node B the file server will also fail over?

Can't bring Network Name on line

$
0
0
Can't bring Network Name on line

Cannot cannect to configuration database error was shown when trying to access site.

Checking I found Suddenly clustering isn't function. When I look at the dependency report, the Network Name is
offline but its IP address is online and pingable.

The event viewer shows several of the following:

==============
Event ID: 1205; The Cluster service failed to bring clustered service or
application 'Cluster Group' completely online or offline. One or more
resources may be in a failed state. This may impact the availability of the
clustered service or application.

==================
Event ID: 1069; Cluster resource 'Cluster Name' in clustered service or
application 'Cluster Group' failed.

==================
Event ID: 1207; Cluster network name resource 'printserver' cannot be
brought online. The computer object associated with the resource could not be
updated in domain 'domainname.com' for the following reason:
Unable to obtain the Primary Cluster Name Identity token.

The text for the associated error code is: An attempt has been made to
operate on an impersonation token by a thread that is not currently
impersonating a client.


The cluster identity 'SPSERVERCLUS$' may lack permissions required to
update the object. Please work with your domain administrator to ensure that
the cluster identity can update computer objects in the domain.

==========
Event ID: 1205; The Cluster service failed to bring clustered service or
application 'Cluster Group' completely online or offline. One or more
resources may be in a failed state. This may impact the availability of the
clustered service or application.

Environment: Windows SQL Server 2008 R2.

Did any updates created and issue?

Checked DNS name is mapped to correct IP address.

AD has no expired credentials.

Surprisingly when turned 1 DB OFF, we can connect to site but still this error persist.

I've been googling this for a while, but haven't found a solution.

Please recommend the appropriate forum for this.I'd
appreciate any ideas. Thanks.


Thanks and Regards, Parth




Thanks and Regards, Parth

Cluster Shared Volume disappeared after taking the volume offline for Validation Tests.

$
0
0

Hi,

After an unknown issue with one of our Hyper-V 4 Node cluster running on Server 2008 R2 SP1 with fibre channel NEC D3-10 SAN Storage all our cluster shared volumes were in redirecting mode and I was unable to get them back online. Only after rebooting all the nodes one by one the disks came back online. Eventlog messages indicated that I had to test my cluster validation. After shutting down all the virtual machines I set all the cluster shared volumes offline and started the complete validation test. The following warnings/errors appeared during the test.

An error occurred while executing the test.
An error occurred retrieving the
disk information for the resource 'VSC2_DATA_H'.
Element not found (Validate Volume Consistency Test)

Cluster disk 4 is a Microsoft MPIO based disk
Cluster disk 4 from node has 4 usable path(s) to storage target
Cluster disk 4 from node has 4 usable path(s) to storage target
Cluster disk 4 is not managed by Microsoft MPIO from node
Cluster disk 4 is not managed by Microsoft MPIO from node (Validate Microsoft MPIO-based disks test)

SCSI page 83h VPD descriptors for cluster disk 4 and 5 match (Validate SCSI device Vital Product Data (VPD) test)

After the test the cluster shared volume was disappeared (the resource is online).

Cluster events that are logged

Cluster physical disk resource 'DATA_H' cannot be brought online because the associated disk could not be found. The expected signature of the disk was '{d6e6a1e0-161e-4fe2-9ca0-998dc89a6f25}'. If the disk was replaced or restored, in the Failover Cluster Manager snap-in, you can use the Repair function (in the properties sheet for the disk) to repair the new or restored disk. If the disk will not be replaced, delete the associated disk resource. (Event 1034)

Cluster disk resource found the disk identifier to be stale. This may be expected if a restore operation was just performed or if this cluster uses replicated storage. The DiskSignature or DiskUniqueIds property for the disk resource has been corrected. (Event 1568)

In disk management the disk is unallocated, unknown, Reserved. When the resource is on one node and i open disk management i get the warning that i have to initialize the disk. I did not do this yet.

Reading from other posts i think that the partition table got corrupted but i have no idea how to get it back. I found the following information but it's not enough for me to go ahead with: Using a tool like TestDisk to rewrite the partition table. then rewriting the uniqueID to the disk brought everything back. But still no explaination as to why we had our "High Availability" Fail Over cluster down for nearly 2 Days. This happened to us twice within the past week.

Anybody that an idea how to solve this? I think my data is still intact.

Thanx for taking the time to read this.

DJITS.

get cluster disk name passing Disk ID

$
0
0

Hi experts,

can you share me powershell command to get cluster disk name like "Cluster Disk 2" on passing argument as "D"   where D: is drive ID.


sid


Setting the SeparateMonitor Property

$
0
0

Hi

I am trying to set the SeparateMonitor value for a cluster resource but my code seems to not be working. I have tried the following

Get-ClusterResource CAUHyper2dt | Set-ClusterParameter SeparateMonitor:$False

Get-ClusterResource | Where-Object {$_.name -eq "CAUHyper2dt"} | % {$_.SeparateMonitor='False'}

but neither of them work, I am sure my code is just a little wrong, but I cant think how to correct it. can someone help me please?

thanks

Steve

Seeing event 1564 causing failure in two node cluster, with file witness on same subnet (at same, low latency, high bandwidth site)

$
0
0

Hello all,


Yesterday, we faced an issue early in the day that our cluster removed all active nodes (exchange 2010 DAG). and the root cause was because we lost contact with the alternate cluster node (over a WAN link), and concurrently lost connectivity to the file witness share (event id 1564 was recorded).

The odd thing is that the cluster should be tolerant to having the second node be unreachable, as long as the file witness share is reachable by the node.  Being that the file witness share is located at the same physical site (possibly inside the same hypervisor host), the idea that the share is ever unreachable is very interesting!

The system engineering team has adjusted the CrossSubnetDelay and CrossSubnetThreshold values, but I do not think this will solve the problem of the file witness server being inaccessible by the node. 

[PS] C:\>cluster /prop | Select-String "Subnet"

D  CONTOSO-DAG               CrossSubnetDelay               4000 (0xfa0) # not default
D  CONTOSO-DAG               CrossSubnetThreshold           10 (0xa) # not default
D  CONTOSO-DAG               PlumbAllCrossSubnetRoutes      0 (0x0)
D  CONTOSO-DAG               SameSubnetDelay                1000 (0x3e8) # default
D  CONTOSO-DAG               SameSubnetThreshold            5 (0x5) # default

What can I further investigate in relation to the file witness share being inaccessible?  Are there any settings that we can adjust to make sure the node is more tolerant to file witness share availability?  There may be a variety of things occurring on the file witness share server, like VSS snapshots, etc.  But none of these things are expected to be interferring with the operations of (1) the OS, or (2) the availability and accessibility of any share.

Thanks,

Matt

MPIO: faster broken link detection?

$
0
0

Hi,

We're using multiple fibre channel cards in our servers (both 2008 R2 and 2012R2, but MPIO looks quite similar), and our SAN also has multiple controllers and multiple ports, so we're using MPIO, the Round Robin With Subset to be specific. We are using the Microsoft DSM for the MPIO to our storage device, and left everything at default to start with:
* Path Verify Period set to 30, but Path Verify not enabled so it should have no effect.
* Retry Count: 3, Retry Interval 1
* PDO Remove Period: 20
* As described in http://blogs.msdn.com/b/san/archive/2011/09/01/the-windows-disk-timeout-value-understanding-why-this-should-be-set-to-a-small-value.aspx, the disk timeout is on the default 60 seconds.

Now, we do a test: we let a benchmark run to generate disk-I/O and we unplug one of the fibre channel cables on the server-side to simulate a broken link. We can see that for 30 seconds there is no disk-I/O, and after this 30 seconds, I/O is resumed, over the other fibre channel port.

We want to lower this time-out so we can achieve faster failover to the other (still working, still connected) connection, for instance after 10 seconds, and thereby reducing the chance of timeouts. But this doesn't seem to work. I enabled the Path Verify-option and modified the values through the GUI, all without effect. I modified these through the registry (HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Services\mpio\Parameters), but that has no effect either. Even stranger: the values in the registry don't match the values in the GUI. Checking with Get-MPIOSetting I get the values from the GUI (differing from the registry values), but all these have no effect: still about 30 seconds without I/O before it's resumed.

So... what's the right way to change this behaviour? Or is it impossible to modify and will I/O always wait for 30 seconds?

Many thanks in advance!

Johan

 

Help network infrastructure with hyper and failover

$
0
0

Dear All 

I would like some advise for my network infrastructure.

i need to install 2 physical host with hyper core server 2012

then i have some virtual machine on each.

what i am trying to do is to set up a failover what i would like to achieve is in case of failure if i switch my host 1 for example automatically my vm should be working on host 2.

i did use hyper- v manager but my network configuration doest help me doing that i always thought that i had to start the failover replication manually.

so from scratch what are a best practive network configuration that i should use in order to setup a redondancy plan.

Thanks to help and advise

Why is a cluster volume marked to be run in a separate monitor?

$
0
0

Hi.

I have a customer that has created a new Hyper-V cluster with 4 volumes in their SAN. For some reason volumes 1 and 3 are marked to be run in a separate monitor though. From the cluster validation test:

Validating cluster resource Volume1.

This resource is configured to run in a separate monitor. By default, resources are configured to run in a shared monitor. This setting can be changed manually to keep it from affecting or being affected by other resources. It can also be set automatically by the failover cluster. If a resource fails it will be restarted in a separate monitor to try to reduce the impact on other resources if it fails again. This value can be changed by opening the resource properties and selecting the 'Advanced Policies' tab. There is a check-box 'run this resource in a separate Resource Monitor'.

Validating cluster resource Volume2.

Validating cluster resource Volume3.

This resource is configured to run in a separate monitor. By default, resources are configured to run in a shared monitor. This setting can be changed manually to keep it from affecting or being affected by other resources. It can also be set automatically by the failover cluster. If a resource fails it will be restarted in a separate monitor to try to reduce the impact on other resources if it fails again. This value can be changed by opening the resource properties and selecting the 'Advanced Policies' tab. There is a check-box 'run this resource in a separate Resource Monitor'.

Validating cluster resource Volume4.

The cluster can mark resources to be run in a separate monitor under certain circumstances but as far as I know the cluster has not malfunctioned or crashed so I can't figure out why it would do this on it's own.

Is there any reason why we shouldn't reset these resources to be run in a shared monitor?

Thanks!

Viewing all 2783 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>