Microsoft failover clustering feature

November 21, 2013, 10:20 am

≫ Next: Hyper-V 2012 ADMIN$ share on Cluster IP address

≪ Previous: How to set up a compute node only application network

Hello,

I have a question regarding microsoft's failover clustering feature. I have 3 physical servers(Windows Server 2008 R2) that i want to add to a cluster so that the virtual machines can be Highly Available. The 3 servers are connected to a DAS. I have created only 1 LUN on that DAS.

The problem is that the LUN needs to be associated with one owner(one of the three servers) and if that owner fails all the VMs go down(from all 3 servers). Is there any way to corect this with only one LUN, or do i have to create 3 LUNs and asociate each server with a LUN?

At first when i created the LUN, i only created one because i've read that one is enough, but now i consider it a flaw.

Any suggestions?

↧

Hyper-V 2012 ADMIN$ share on Cluster IP address

November 6, 2013, 8:25 am

≫ Next: How to correctly cahnge the IP address of MSCS Cluster 2008 with no service downtime ?

≪ Previous: Microsoft failover clustering feature

Hi,

We have a new 2012 cluster with 2 nodes. We're trying to get a backup solution (Asigra) working to back up the cluster but it requires that the ADMIN$ share on the cluster IP address is reachable across the network - which it is not. The ADMIN$ share on each physical node is reachable.

So for example -

cluster IP = 10.0.0.1 - ADMIN$ not reachable
node 1 = 10.0.0.2 - ADMIN$ reachable
node 2 = 10.0.0.3 - ADMIN$ reachable

We have a 2008 cluster with 3 nodes, where the ADMIN$ share is reachable (I can map to it fine in Windows Explorer) - but someone else set this up and they've since left the organisation.

Anyone able to suggest how we might get this working please?

thanks, Will

↧

How to correctly cahnge the IP address of MSCS Cluster 2008 with no service downtime ?

November 25, 2013, 4:27 am

≫ Next: 2008R2 cluster, 1 node states: DNS server failure

≪ Previous: Hyper-V 2012 ADMIN$ share on Cluster IP address

Hi Folks,

Can anyone please suggest me how can I safely change the 2 node Windows Server 2008 cluster as follows:

Server A - production Node Active,

Server B - recovery Node Passive.

I will be changing the following IP address in the production node (Server A) after I fail over the mailbox to DR node (Server B):

NIC 1 public IP

NIC 1 CMS Virtual IP

NIC 1 MSCS virtual IP

Nic 2 Cluster Private network IP 1 (which receive the mailbox replication data from DR mailbox node)

Nic 2 Cluster Private network IP 2 (which send the mailbox replication data to DR mailbox node)

Please let me know your thoughts and comments if that is possible without causing any downtime apart from the failover from one node to another.

Thanks

/* Server Support Specialist */

↧

2008R2 cluster, 1 node states: DNS server failure

November 26, 2013, 3:24 am

≫ Next: CSV running out of space

≪ Previous: How to correctly cahnge the IP address of MSCS Cluster 2008 with no service downtime ?

Hello,

I have a 2 node cluster (for SQL services).

One of the 2 nodes keeps giving an error in the error log every 15 minutes.
Actually there is an error and a warning:

ERROR eventid (1196):
Cluster network name resource 'Cluster Name' failed registration of one or more associated DNS name(s) for the following reason:
DNS server failure.

WARNING eventid (1578):
Error retrieving the event description.

This error is only thrown by node2 of the cluster and has done this right from the creation of the cluster, but the cluster seems to work fine.

It mentions a DNS server failure, but there is nothing wrong with it (as far as I can see), node1 does not throw this error.

Can someone point me in the right direction to solve this?
And where does the name "Cluster Name" come from?

The only difference is a WINS entry in the NIC settings, I have added the WINS entry to node2.
So for I don't see any changes...

[edit] WINS did NOT solve the problem, 2 new log entries are created with the same error

↧

CSV running out of space

July 27, 2013, 4:43 pm

≫ Next: How activate shadow copy services for a filserver ressource in a failover cluster 2008 R2?

≪ Previous: 2008R2 cluster, 1 node states: DNS server failure

We have a 2012 Hyper-V Cluster with 4 nodes and 2 CSVs. Volume1 ran out of space last night and all the VMs on that Volume went into a paused state. I was sure I had plenty of space, but I extended the SAN volume by 200 GBs anyway to get the VMs working again and now I'm back down to only 150 GBs free tonight again. When I go to the properties of C:\ClusterStorage\Volume1, it says there are 2.25 TBs used of 2.4 TBs total space. I then check the properties of the only folder on that Volume that holds all my VMs and it says there is only 1.83 TBs used. There are several hundred GBs of storage space being used by something that I can't see. We just started backing up our Hyper-V cluster with DPM 2012 SP1 last week and it seems that this may have something to do with it as this never happened before that. Volume2 has plenty of free space and accurately shows the same amount of used space in both areas. Why is this happening? Where is this "mystery" data?

↧

How activate shadow copy services for a filserver ressource in a failover cluster 2008 R2?

November 30, 2013, 11:59 pm

≫ Next: Delete Corrupt VHD from CSV

≪ Previous: CSV running out of space

Hello,

I have a little question, I cannot find a manual by Microsoft to activcate the shadow copy service for a fileserver ressource in the cluster, only for failover cluster in windows server 2003.

And now the question - I have a Windows Failover Cluster 2008 R2 with 2 nodes and one fileserver ressource (as a cluster ressource). When I activate the shadow copy service for the cluster ressource, how must I activate this feature.

1. Over computer management how in Windows Server 2003 Cluster

2. Or direct of the Disk in the failover cluster manager?

I have test it with option 1, the wizzard create a shadow copy and create a task in local task-manager. But the wizzard create not a cluster ressource for the task. It thats normal so? When I check the disk over the failover cluster manager, a go to the properties of the disk, I see shadow copy services option again (point 2.), and her is it deactived.

Short: How activate shadow copy services for a fileserver cluster ressouce in WS2008R2?

↧

Delete Corrupt VHD from CSV

December 1, 2013, 5:55 am

≫ Next: 2012 R2 failover cluster requirements

≪ Previous: How activate shadow copy services for a filserver ressource in a failover cluster 2008 R2?

I have a corrupt VHD (caused by job failure within SCVMM) and I am now unable to delete it. I get this error:

How can I remove this file? I tried the solution from this thread (http://social.technet.microsoft.com/Forums/windowsserver/en-US/b1a8ea27-b5e1-401c-a83c-4669989d70d2/deleting-orphaned-vhd-from-clusterstorage) by stopping the VMM Agent service and attempting to delete the file, but that didn't work.

I can't see the file open when I look in Open Files.

The VM is currently off and the disk isn't attached to it.

Cluster Node is running Windows Server 2008 R2 SP1

↧

2012 R2 failover cluster requirements

November 25, 2013, 1:19 pm

≫ Next: Clustering on Dell EqualLogic SAN

≪ Previous: Delete Corrupt VHD from CSV

Are there different requirements for failover clustering between Windows Server 2012 and 2012 R2?

It appears that requirements are the same according to TechNet link below.

http://technet.microsoft.com/en-us/library/hh831579.aspx

Thanks,

Joseph

Joseph Kejr

↧

Clustering on Dell EqualLogic SAN

December 2, 2013, 5:19 am

≫ Next: error code 0x80070001 incorrect function

≪ Previous: 2012 R2 failover cluster requirements

Hi,

I'd be grateful for some help. I've just built a Server 2012 (not R2) cluster on our EqualLogic SAN. It all went well at the start; on the hosts I had a LAN team (2-port), CSV team (2-port), live migration team (2-port) and 2 x individual iSCSI ports, both on the same VLAN and subnet. This worked fine apart from doing the cluster validation, which complained that both host NICs were on the same subnet. I did some digging around and found that actually, Microsoft do recommend 2 different subnets if possible. The hosts were also running the Dell Host Integration Tools for MPIO.

I reconfigured the hosts, EqualLogic units and switches so I had 2 different VLANs with different IP subnets. Clustering was now happy, but the EqualLogics really weren't- they lost all of their MPIO and were complaining that the SAN group couldn't ping some hosts (which is rubbish, because I tested this from the CLI and the group could ping both subnets).

Anyway... I've just put everything back to how it was (i.e. a single VLAN with a single IP subnet) and the EqualLogics are happy again- each host NIC has one connection to one of the EqualLogic iSCSI ports, making a total of 4 connections for each LUN- but, as expected, the clustering is now complaining that the host NICs are back on the same subnet (this is the only real warning brought up by the Cluster Validation test- the others were all to do with insufficient MPIO paths and similar).

Can anyone shed any light on this? Given that everything seems to work fine with a single iSCSI subnet, can I just ignore that cluster warning? Does the MS best practice only apply if using Microsoft MPIO, and that using a third party tool negates the need for this best practice?

Many thanks!

↧

error code 0x80070001 incorrect function

November 27, 2013, 2:40 pm

≫ Next: Live Migrate fails with event 21502

≪ Previous: Clustering on Dell EqualLogic SAN

Hi,

I have 2 VMs Windows 2012 Datacenter in a fail-over cluster. Each VM resides over VMware ESXi 5.1 U1 server. Every ESXi has SAS HBA HP H221 in Pass-through mode and every VM is connected to this SAS controller.

I have 2 virtual disks connected to both VMs. Both disks are located on a DAS. I can add both disks to the cluster but when I bring them up I am receiving error message:

error code 0x80070001 incorrect function

Error ID 1069

If I disconnect both disks form SAS connection and connect them via ISCSI everything is OK.

I need to use them under SAS connection. How to fix this problem

Thanks!

↧

Live Migrate fails with event 21502

September 10, 2012, 9:21 am

≫ Next: Live Migration failed - failed to delete configuration: The request is not supported. (0x80070032). Event ID 21502

≪ Previous: error code 0x80070001 incorrect function

Live migrate is failing with event id 21502 on both source and destination nodes (and 22026, 21111, 21100 on the source node and 21114, 21107 and 21100 on the destination node.

This is the 21502 event message on the destination node:

'Virtual Machine xyz' failed to start. Live migration of 'xyz' did
not succeed. (Virtual machine ID F659E6C4-40D7-4EE6-ABF1-0FBB97C52E4B)
'xyz' Microsoft Emulated IDE Controller (Instance ID
{83F8638B-8DCA-4152-9EDA-2CA8B33039B4}): Failed to restore with Error 'A device
attached to the system is not functioning.' (0x8007001F). (Virtual machine ID
F659E6C4-40D7-4EE6-ABF1-0FBB97C52E4B) 'xyz': Failed to open attachment
'C:\ClusterStorage\Volume2\Virtual Machines\xyz\Virtual Hard
Disks\Server2008R2.vhd'. Error: 'A device attached to the system is not
functioning.' (0x8007001F). (Virtual machine ID
F659E6C4-40D7-4EE6-ABF1-0FBB97C52E4B) 'xyz': Failed to open attachment
'C:\ClusterStorage\Volume2\Virtual Machines\xyz\Virtual Hard
Disks\Server2008R2.vhd'. Error: 'A device attached to the system is not
functioning.' (0x8007001F). (Virtual machine ID
F659E6C4-40D7-4EE6-ABF1-0FBB97C52E4B)

I have a 2 node cluster. Each node has 4 nics on different subnets (1 each for host management, user traffic, live migrate and heartbeat) and named the same on both nodes. Live Migrate has worked in the past. Both nodes can access the VHD stored on the CSV. Both nodes are talking to each other (ping etc). No DNS issues that I can tell. I've refreshed the virtual machine config. The CSV is not in Redirected Access mode.

Not sure what could be the problem.

Thanks, Russ

↧

Live Migration failed - failed to delete configuration: The request is not supported. (0x80070032). Event ID 21502

February 6, 2013, 7:44 pm

≫ Next: Disappearing ISCSI disks on failover cluster

≪ Previous: Live Migrate fails with event 21502

We have a 3 node cluster attached to a SAN running. All nodes are running Server 2012. We have 2 virtual machines that will no longer live or quick migrate. When we try, we get the following error message.

Event ID: 21502

Live migration of 'Virtual Machine Library' failed.

Virtual machine migration operation for 'SRV-XXX' failed at migration source 'NODE1'. (Virtual machine ID 8CC600A0-5491-45B1-896E-E99BB85AA856)

'SRV-XXX failed to delete configuration: The request is not supported. (0x80070032). (Virtual machine ID 8CC600A0-5491-45B1-896E-E99BB85AA856)

We are not having this issue with any of our other 15 virtual machines. I have searched the forums and have not found any articles with the same situation.

↧

Disappearing ISCSI disks on failover cluster

March 22, 2013, 3:18 pm

≫ Next: Getting started with Cluster - Design advice

≪ Previous: Live Migration failed - failed to delete configuration: The request is not supported. (0x80070032). Event ID 21502

Trying to set up a test Hyper-V Failover Cluster on a pair of Windows Server 2012 Standard Edition computers.

The computers are cabled to my home router/switch, and also have a redundant connection over my home WiFi network. The cabled connection and the Wifi connection are using different subnets.

I've configured both as DNS/DCs for the same domain. One of them has the Microsoft ISCSI Target Server service on it, and I've configured two iSCSI Disks, a 1GB Quorum Disk, and a 300GB disk for VM storage. I've launched the ISCSI Initiator from both computers, and connected them to the target. The two ISCSI disks show up in disk management. On one computer, I initialized/brought online both disks, and formatted them with NTFS.

Next step was to create the failover cluster, but I can't make the disks appear in the cluster. I've tried it with the disks online, with the disks offline, with no drive letters on the disks - no dice.

After the cluster is created, the disks don't appear in the Failover Cluster Manager Storage -> Disks area, and if I click the Add Disk link, I get the dreaded "No suitable disks for cluster disks were found" message. Yet, the disks that were configured in Disk Management are not visible from either node. They also aren't visible in the Server Manager -> File and Storage Services -> ISCSI area. They are entirely missing in action. No good recreating them in that tool either; it says, "No eligible servers are available".

What am I not doing that is preventing the ISCSI disks from being available to the Failover Cluster?

↧

Getting started with Cluster - Design advice

November 27, 2013, 8:56 am

≫ Next: Maximum LUNS in Hyper-V 2012 Cluster

≪ Previous: Disappearing ISCSI disks on failover cluster

We have a new Dell VRTX server with 2 blades and shared storage.

Its my first Cluster so I'm trying to get to grips with the design and terminology.

What I want to achieve (Goals):-

Hyper-V Cluster to ensure resilience and no single point of failure
Virtualise a number of older physical servers, such as Print Servers, consoles for managing systems, door access system etc
Virtualised new file server with DFS for use access to data

Any advice on how best to configure this would be appreciated, not necessarily step by step but whether to use high availability file server?, do we use SMB?, how to best partition the storage?, how best to provide storage for file server? etc etc

I will also be wanting to make snapshots of the Hyper-V servers onto another physical server off-site just in case of a disaster.

↧

Maximum LUNS in Hyper-V 2012 Cluster

September 4, 2013, 10:47 am

≫ Next: VM in Cluster stuck at Stopping

≪ Previous: Getting started with Cluster - Design advice

Hi,

In Hyper-V 2008 R2, I was told that "Windows does not have a great track record managing many LUNs. From experience we see scale issues anywhere at 150 LUNs and above."

I was also told "MSFT is currently fully engaged in the next revision of the platform that all this is built on....."

Wondering if 2012 can support more LUNS.

Is there any one that has an idea how many LUNS will work well on a Hyper-V 2012 cluster.

Please don't answer this question unless you have solid knowledge about this matter.

(sorry I asked this recently but wasn't given a direct answer, so I am asking again more directly).

Thanks

Daniel

↧

VM in Cluster stuck at Stopping

December 1, 2013, 11:37 pm

≫ Next: Multiple Event ID's 4738 and 4724 on new Windows 2012 Hyper V cluster

≪ Previous: Maximum LUNS in Hyper-V 2012 Cluster

Hello,

I currently have the problem with our in house cluster that on VM is stuck in the Stopping state. (see Image)

It seems that the Cluster tries to Shut down the VM, but cannot contact it as in the details section there is no CPU usage or other data displayed.

If I try to remove it i get the error: "A null context handle was passed from the client to the host during a remote procedure call." (0x800706ef)

We now started the VM on another Hyper-V Host, as the machine is used in our working environment.

Hope someone can help me resolve this issue.

Thanks Paul

↧

Multiple Event ID's 4738 and 4724 on new Windows 2012 Hyper V cluster

November 13, 2013, 3:26 pm

≫ Next: MA DSM mismatch between hosts

≪ Previous: VM in Cluster stuck at Stopping

We just setup a two node Windows 2012 Hyper V cluster. Everything is working correctly, but in the Security log on both nodes, we're getting Event ID's 4738 and 4724 every 3 minutes for the built in CLIUSR (Failover Cluster Local Identity) that gets created with the cluster (there's a local CLIUSR account on each node). Because we have an application that parses the security logs and emails us about user account changes, we're getting spammed because of this. Does anyone know if this is normal behavior for that CLIUSR account? Any way to suppress this? On the account properties, "User cannot change password" and "Password never expires" are both checked. I appreciate any insights people might have. Thanks

↧

MA DSM mismatch between hosts

November 28, 2013, 3:41 pm

≫ Next: Clustered disk only showing on one node

≪ Previous: Multiple Event ID's 4738 and 4724 on new Windows 2012 Hyper V cluster

All our servers are fresh server 2012 running the HP DSM\MPIO. But when I the MS Cluster Failover Validation check it failed with the following: "For the device-specific module (DSM) named Microsoft DSM, versions do not match
between node XXX.XX.COM and node XXXX.XXX.COM". The configuration of MPIO\DSM is consistent and correct. I Have also reinstalled the HP DSM\MPIO software.

↧

Clustered disk only showing on one node

February 16, 2010, 8:35 am

≫ Next: Volume Guid and Powershell

≪ Previous: MA DSM mismatch between hosts

I have setup a cluster using hyper v 2008R2 It passed the all validation tests but the Clustered disks both the Quorum and one i intend for the virtual machines only load one node/server at a time. The one which boots last. If i reboot one it will load the disks and remove them from the other node. On the server withou the disk they can be seen in Disk management but are not online

i get this error
id 1034
Cluster physical disk resource 'Cluster Disk 1' cannot be brought online because the associated disk could not be found. The expected signature of the disk was '{e6c8ddd8-70de-479a-8e5a-b91a4a9910b5}'. If the disk was replaced or restored, in the Failover Cluster Management snap-in, you can use the Repair function (in the properties sheet for the disk) to repair the new or restored disk. If the disk will not be replaced, delete the associated disk resource.

↧

Volume Guid and Powershell

December 4, 2013, 12:26 am

≫ Next: Fatal error when trying to form cluster - Error in Validation. The argument is null or empty string. Parameter name: domainName

≪ Previous: Clustered disk only showing on one node

Hi,

As we are using Hyper-V clustering and rapid SAN provisioning it makes sense to attach VHDs using volume GUIDs such as

"\\?\Volume{3e199f9f-5c93-11e3-8130-d4ae5201ad09}\\Vm1Disk1.vhd"

I can attach the disk using powershell and WMI but not using the new "New-VHD" method.

In fact I am unable to use volume GUIDs at all in any file related functions as it doesn't like the "?" character I think which is technically an invalid character in a filename.

Is there any secret way of escaping a volume guid so that powershell can accept it.

Thanks

Daniel

↧