Quantcast
Channel: High Availability (Clustering) forum
Viewing all 2783 articles
Browse latest View live

Cluster Service terminated by GUM Task

$
0
0

I've had an issue where one of my Windows 2012 R2 Hyper-V hosts just decided to keel over and die on me.  The event which I'm seeing is as follows:

Log Name:      System
Source:        Microsoft-Windows-FailoverClustering
Date:          20.07.2016 20:39:19
Event ID:      5377
Task Category: Global Update Mgr
Level:         Error
Keywords:
User:          SYSTEM
Computer:      mgmt45.mgmt.local
Description:
An internal Cluster service operation exceeded the defined threshold of '110' seconds. The Cluster service has been terminated to recover. Service Control Manager will restart the Cluster service and the node will rejoin the cluster.
Event Xml:<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event"><System><Provider Name="Microsoft-Windows-FailoverClustering" Guid="{BAF908EA-3421-4CA9-9B84-6689B8C6F85F}" /><EventID>5377</EventID><Version>0</Version><Level>2</Level><Task>6</Task><Opcode>0</Opcode><Keywords>0x8000000000000000</Keywords><TimeCreated SystemTime="2016-07-20T18:39:19.244464800Z" /><EventRecordID>347017</EventRecordID><Correlation /><Execution ProcessID="4596" ThreadID="9184" /><Channel>System</Channel><Computer>mgmt45.mgmt.local</Computer><Security UserID="S-1-5-18" /></System><EventData><Data Name="OperationName">SynchronizeState</Data><Data Name="ThresholdTimeInSec">110</Data></EventData></Event>
I'm finding extremely little information regarding event 5377 on the Internet.  Apart from doing the standard checking for latest windows updates, and rebooting - how can I prevent this from happening again in the future?  This crash took down 64 virtual machines.


SQL service terminated unexpectedly SQL 2012 Failover Cluster EventID 7034

$
0
0

Hi All,

I have notice that one of our clustered SQL servers has terminated unexpectedly twice this week. I've check the SQL Server Logs and there are no entries fro the time during the crash.

I have also check the event logs and the below events were logged:

evt1

evt2

evt3

evt4

evt5

is there any other logs I can check for this as I can't find the cause of the issue

Thanks for your help

Robert 

Win2k8R2 Cluster 1 node goes in to paused state after replacing nic.

$
0
0

i have a 10 node cluster running hyper-v work loads.

i have within this cluster, a management network, a csv network, a live migration network, a virtual machine vlan trunk and two iscsi networks that attach to my san - a pair of equllogic storage arrays ps 6110xv & E.

the issue is, in thd logs for the affected node, it reports the csv network is unable to find the other nodes abd this sends the affected noxe in to a paused state.

the nics are the same chipset but are now rebadged as qlogic.

cluster validation tests for the network all come back good apart from the iscsi nics warning about being on the same subnet (dells design).I'm at a loss.

ive evicted the node, cleared the configuration via powershell and readded it to no avail.

any ideas are extremely welcome.?

Admin privileges issue for creating windows cluster

$
0
0

Hi,

We want to install SQL Server 2012 Active/Passive cluster on Windows Server 2012.
Node1 and Node2 are in domain with SAN 

While Validating/ Creating windows cluster logging on Node1

1. Validate / Create Cluster
2. Browser search for 2 nodes
3. Next (Only current node is displayed) and 

Message displayed

'You do not have administrative privileges on Node2' 

Vice Versa with Node2

'You do not have administrative privileges on Node1'

Objects are manually created in AD because domain id 'clusadmin' does not have administrative rights

 domain id 'clusadmin' is also in domain. It does not have domain administrative rights due to policy. We have added this id in local administrator group

Are there any specific delegation/ permission that we can give to domain id 'clusadmin' instead of giving administration privileges for creating cluster

Regard,

Nikhil Desai




I don't get it - Cluster Creation

$
0
0

I have 2 Windows 2012 R2 VMs on esxi6 hosts that are configured per MS and VMware docs. These nodes pass the cluster validation tests completely except for one MPIO warning (MPIO is handled at the physical level by the vmware hosts, so no worries there), but when I attempt to create the cluster directly after, it fails. I'm an admin with full rights to the OU with the nodes. The cluster object gets created, but then is automatically removed when the failed attempt finishes.

Is this a common problem? What is the point of a validation process with several hundred individual checks if it's going to then flat out fail?

Question on Cluster startup behavior when DC is a VM

$
0
0

So I was watching Matt talk about how 2012+ cluster can startup without AD (also article). That's awesome! However, I'm left to wonder if there are still other requirements for successful cluster startup now that AD isn't a problem anymore - like DNS. Are you still gonna require some other physical server to answer DNS (or something else)? AKA is the cluster gonna have to perform DNS lookups (or need some other service on a different, non-cluster server) as part of its startup process still? That kinda seems like a gotcha...it's great that you can have your only DC's be VM's on the cluster now, but if you still have to have another physical server to do <any other service required for successful cluster start> that seems a little disappointing. Could someone fill me in?


born to learn!

Live Migration fails

$
0
0

Cannot migrate VM on Hyper-V, it fails with evenID 21502, Live migration of 'Virtual Machine SRBLINK2' failed.

VM has 2 .vhdx files and i do not see configuration files (XML) for it. Both vhdx are stored in a same volume. Most common issue is because vhdx and xml files are stored in separate CSV. Where to find virtual machine configuration files?

Multiple Hyper-V clusters on one Scale Out File Server

$
0
0

Are multiple clusters on the same Scale Out File server supported?  If so, do I use the same File Share Witness for Cluster2 that I used for Cluster1?

I've been searching for documentation on this kind of setup and haven't found much at all.

Thanks,

Michael


DCOM error on server manager connecting to failover cluster

$
0
0

hi

we have 2 server 2012 r2 Hyper-V host servers one of them cannot connect to the cluster in server manager giving up the following errors.

DCOM was unable to communicate with the computer CherwellCluster using any of the configured protocols; requested by PID      918 (C:\Program Files\CA\arcserve Unified Data Protection\Engine\TOMCAT\bin\tomcat7.exe).

DCOM was unable to communicate with the computer CherwellCluster.cherwell.local using any of the configured protocols; requested by PID     2370 (C:\Windows\system32\ServerManager.exe).

I have checked the DCOM TCP/IP protocols on both Virtual Host servers and they match up ok.

a windows 10 remote server manager plus the 2nd Virtual Host can connect perfectly well. both of these can also get good reply from themselves, each other, the affected VH server and the cluster to the powershell command get-wmiobject mscluster_resourcegroup -computer cherwellcluster -namespace “ROOT\MSCluster“.

the affected server can do the same powershell command on the 2nd VH server and our 2012 r2 DCs but not the cherwellcluster

on the affected server the domain firewall is off, wmimgmt says repository is consistent, the entry for the cluster in server manager has been removed and re-added, the server has been rebooted a couple of times and has been brought up to date with windows updates.

whatever the underlying problem is it also affected the vEthernet NIC preventing it from seeing the domain suffix properly which i have fixed by adding it to the NICs Ipv4 DNS tab.

I've exhausted searching on the internet, any ideas?

thanks in advance

Giles.


How to UNassign cluster core resource?

$
0
0

Hello.

I have assign cluster core resource (Name and IP address) to one of the cluster roles, specifically to clustered SQL instance.

I think this is not good practice so I want to UNassign this cluster core resource from any role.

Thanks for advice.

Cluster fails to come online after reboot due to permission loss on C:\ClusterStorage

$
0
0

We have built a couple SQL Server 2014 clusters with Cluster Shared Volumes and we've found that after a reboot the cluster service fails to come online. The error is access is denied to the C:\ClusterStorage. We have CU1 installed for SQL Server.

In order to fix the issue we have to boot the node into safe mode and delete the C:\ClusterStorage folder. Once we boot back into the system normally the C:\ClusterStorage folder is recreated and everything is fine. Until the next reboot when the same thing happens.

These are pretty basic installs where we followed the "Step-by-Step to Deploying Microsoft SQL Server 2014 with Cluster Shared Volumes" guide.

Does anyone know why the access is being stripped on reboot?

Failover Cluster Validation fails Firewall Configuration for unknown reason

$
0
0
I have three nodes, all brand new fresh R2 installs with nothing but Hyper-V role installed and failover clustering feature enabled. I am trying to run the failover cluster validation tests and everything passes but it fails at the Firewall Configuration section with a weird error (see below). The firewall hasn't been touched on any of the machines except by installers to enable iSCSI and iSCSI MPIO. If anyone can help I would great appreciate it...I haven't a clue in the world what this means. Thanks!

An error occurred while executing the test.
There was an error verifying the firewall configuration.
An item with the same key has already been added.

How to make a software highly available?

$
0
0

Hello All,

I have a small software called Windows Active Directory Adapter which connects to the Active Directory and helps in managing the accounts on AD.

I want to make this AD Adapter highly available by configuring it on two machines in Active Passive Configuration so that only one instance of the Adapter is working at a time.

By default the AD Adapter does not have any option for making it highly available.

I want to use the windows the windows clustering features to configure it on two nodes and then make the two nodes Active Passive. Can any body please give idea how to achieve this goal? Do I need to use Hyper V for it?

Thanks and Regards,

Hamza

Disk in cluster

$
0
0

Hi ,

when i open clusters i can see DISK1 , DISK 2 , DISK 3 ...

Are these disks from shared storage or the disk in the cluster nodes? what are the uses of disk in a cluster.

For cluster handling critical applications will these disks contains application data ? Please make me clear on this ....


Paramesh KA

Server 2012 R2 Hyper V: Live Migration crashes source and target host.

$
0
0

When using Live Migration on a shared CSV volume we experience crashes on both source and target host. After a progress of 70-85% the virtual machine get a State of “stopping”, and there is no further progress. We are not able to cancel the move or connect to the virtual machine. Other virtual machines running on the cluster are still running, but powering them off or moving them is not possible. Our two Hyper V hosts has to be manually powered off, as a regular shutdown from the OS does not work.

We started experiencing this problem on the 24th of June after our Hyper V hosts installed updates using cluster aware updating. Ever since then we haven't been able to use live migration functionality. Before the 24<sup>th</sup> live migration worked with no issues. Uninstalling the update from the 24<sup>th</sup> (KB3161606) changes nothing.

What we have tested so far:

1. We are able to reproduce the issue using generation 2 virtual machines no OS installed on it (i.e. running just in the BIOS), and with OS such as server 2012 R and Ubuntu 14.04. With generation 1 virtual machines we don’t see the problem.
2. The issue occurs if the virtual machine is moved to another host, and files are not moved to another location (directory, disk). If the files are moved the problem does not happen.
3. The tests were done with Hyper V Manager as it allows for changing storage location when doing a live move.
4. Using "Failover Clustering Manager - Move -->Virtual Machine storage" also works fine, but this does not involve changing hosts for the virtual machine.

We have been in contact with the storage service provider, who have verified that the iSCSI storage is configured correctly. We have also been in contact with a Microsoft Partner who has verified that our Hyper V systems have been set up according to Best Practices.

Configuration: Dell PowerEdge R710 hosts with a shared iSCSI storage on a Compellent SC4020.

The only thing that is logged in Event viewer around the time of live migration is the following entry:

The description for Event ID 22040 from source Microsoft-Windows-Hyper-V-Worker cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.

If the event originated on another computer, the display information had to be saved with the event.

The following information was included with the event:

%%2147943860

0x800705B4

The locale specific resource for the desired message is not present


Migrating file services failover cluster to virtual machines running Windows Server 2012 R2

$
0
0

Our organization is looking to migrate our existing Failover Cluster from 2 physical servers running Windows Server 2008 R2 to 2 VMware guests running Windows Server 2012 R2. Hoping someone can point out a logical migration path.

In our current setup, we have 2 physical nodes (FS01 and FS02) both running Windows Server 2008 R2. For storage, we have 18 RDMs presented to both nodes from an HP 3PAR back end. All 18 disk drives are currently running from FS1. Within Failover Cluster Manager, we have a common Server Name (F01) and all shares are mapped using it.

My question is where is the logical place to start looking at migrating to 2 virtual nodes running Server 2012 R2? Will we be able to preserve the common Server Name (F01) that all of our shares currently point to? Is it best to spin up an entirely new file services cluster with the VMware guests and migrate our drives over one at a time, or can we add the new VMware guests to our existing failover cluster and move the drives to them one at a time? From everything I've read, OS mismatches are not permitted within a failover cluster.

Thanks


Drive of cluster nodes moves automatically

$
0
0

We are facing this issue from last one week very frequently.

Scenario    :  There are two nodes in Cluster. One of the node is running FTP service locally but using cluster drives for data.

Issue1        :  Sometime cluster drives moves to another node without any reason

Issue2        :  ISCSI initiator doesn’t automatically connects NAS drives after restart.

Clustername Error becuase object cannot be created in Acitve directory

$
0
0

Hi all,

I was able to create a workgroup cluster using windows server 2016.

Everything is working except for the clustername. The error says it cannot be createdin Active directory.

This is a work group cluster so i need to use DNS.

Cluster virtual IP is working fine, but validate cluster is failing.

If my clustername is cluster01, What can I do to have the clustername resolve without error.

P.s. i did not configure the nodes primary dns suffix

thanks,

joey


http://joeydj.com/


Windows Server 2012 R2 Clustering across datacenters

$
0
0

Hi Team,

We are setting up Windows Server 2012 R2 standard edition failover cluster between primary and DR sites.

2 nodes will be in Primary datacenter and 1 node will be in DR site. All 3 nodes will be virtual machines running on VMWare infrastructure. Since one of the requirement for WSFC is to have heartbeat network between nodes, i am just wondering how will it be assigned in this scenario when nodes are in different locations? For heartbeat NICs we generally don't give default gateway, DNS etc. in IP properties and use .reserved private addresses like 192.168.x.x or 172.16.x.x or 10.0.x.x range

Any article that provide how to go about configuring heartbeat networks and IP addressing for WSF will be appreciated. Thanks

Regards, 

Geographical cluster firewall port settings

$
0
0

Hi experts,

I am planing to create two node geographical cluster, anybody can tell me which ports need mapping to my cluster node.

Thanks.

Viewing all 2783 articles
Browse latest View live