SQL Server 2012 Multi-Subnet Cluster Part 4

By:   |   Comments (12)   |   Related: 1 | 2 | 3 | 4 | > Clustering


Problem

In a previous tip on Install SQL Server 2008 on a Windows Server 2008 Cluster Part 1, we have seen how to install and configure a SQL Server 2008 on a Windows Server 2008 Failover Cluster. We now have a new requirement to deploy a SQL Server 2012 instance on a multi-subnet cluster that spans different geographical locations. How do I install and configure a SQL Server 2012 on a multi-subnet cluster?  Check out this tip to learn how.

Solution

To continue this series on Installing SQL Server 2012 on a Multi-Subnet Cluster, let's look at what we have accomplished so far.

  • In Part 1, we have configured the storage in both of the servers that we will be using as part of our cluster.
  • In Part 2, we've created the Windows Server 2008 R2 multi-subnet cluster, configured the cluster heartbeat settings and modified the cluster quorum configuration.
  • Part 3 completed with a working SQL Server 2012 multi-subnet cluster running on a single node.

In this tip, we will proceed to add a node in a SQL Server 2012 multi-subnet cluster and configure network properties of the cluster resource group.

Adding a Node on a SQL Server 2012 Multi-Subnet Cluster

Adding a node in a SQL Server 2012 multi-subnet cluster is no different than performing the same task in a single-subnet cluster - the steps have been highlighted in this tip. What I would like to point out in this tip, however, are the differences in the sections of the installation that pertains to the cluster spanning multiple subnets. As I've also mentioned in the previous tip, the number of nodes you can add in a failover cluster depends on the editions of SQL Server that you will use. Make sure you include that in your planning phase. I'm also skipping some sections because they're basically the same as when you were installing the SQL Server 2012 failover cluster.

To add a node on a SQL Server 2012 multi-subnet cluster:

  1. Run setup.exe from the installation media to launch SQL Server Installation Center.
  2. Click on the Installation link on the left-hand side. Click the Add node to a SQL Server failover cluster link. This will run the SQL Server 2012 Setup wizard.
    Adding a Node on a SQL Server 2012 Multi-Subnet Cluster
  3. In the Setup Support Rules dialog box, validate that the checks return successful results and click OK.
  4. In the Cluster Node Configuration dialog box, validate that the information for the existing SQL Server 2012 cluster is correct. Click Next.
    Cluster Node Configuration dialog box
  5. In the Cluster Network Configuration dialog box, enter the virtual IP address and subnet mask that your SQL Server 2012 cluster will use in the subnet that the second node is in. From the previous tip, this would be 192.168.0.13. Similar to the previous tip, notice that the setup process has detected the existence of two network subnets - LAN_DC1 and LAN_DC2. Since we've already configured the virtual IP address for LAN_DC1 in the first subnet, the section has been disabled. You'll also notice the message box that gives you a brief explanation of how the OR logic dependency works similar to how we've defined it in another previous tip. Click the Yes button in the message box. Click Next.
    enter the virtual IP address and subnet mask that your SQL Server 2012 cluster will use in the subnet that the second node is in
  6. In the Service Accounts dialog box, verify that the information is the same as what you have used to configure the first node. Click Next.
    The Service Accounts dialog box
  7. In the Error and Usage Reporting dialog box, click Next.
  8. In the Add Node Rules dialog box, verify that all checks are successful and click Next.
  9. In the Ready to Add Node dialog box, verify that all configurations are correct and click Install.
  10. In the Complete dialog box, click Close. This concludes adding a node to a SQL Server 2012 Multi-Subnet Cluster.
    Completing the Add a Failover Cluster Node wizard

You can validate your cluster installation by expanding the Services and Applications node in Failover Cluster Manager and check the cluster name of your SQL Server instance. You can now see an option to move the service to another node, in this case, the node you've just added in your failover cluster.

Check the cluster name of your SQL Server instance in the Failover Cluster Manager

You probably noticed that I was not able to successfully failover to the newly added node in my multi-subnet cluster. When you look at the clustered resources for the SQL Server resource group, the virtual IP address for the second subnet, the virtual network name and the clustered disks are all online. This helps me narrow down the issue and is definitely not a network nor a disk issue. The only clustered resource that would not go online is the SQL Server service (and, of course, the SQL Server Agent service because it is dependent on the SQL Server service.) If you look at the Windows Event Log entries on the node that was recently added to the cluster, you will see an error message similar to the one below. This will point you to the potential cause of the problem.

Windows Event Propertes - Event 5123, MSSQLSERVER CREATE FILE encountered operating system error 3
I highlighted in the previous tip that if you will be storing the tempdb database in a local drive instead of a clustered drive, you need to make sure that:
  • The same drive letter and folder structure exists in all of the nodes in the cluster
  • The SQL Server service account has the appropriate permissions on the folder where tempdb will be created
NOTE: I initially granted the SQL Server service account the Modify permissions on the folder where the tempdb 
database will be stored. However, that didn't allow the creation of the tempdb database during failover, 
thus, preventing the clustered instance to go online on the node. I needed to grant the Full Control 
permissions so I can successfully failover.

Tuning the SQL Server Failover Cluster Instance DNS Settings

System downtime is more often a perception than reality. Even if we manage to perform a failover between cluster nodes in a matter of seconds, users will still think that the system is down if the client applications could not connect to the database server. In a multi-subnet cluster, the perceived downtime is influenced by the behavior of the DNS client's ability to resolve the virtual server name. Client workstations and applications cache DNS entries for a period of time before checking with the DNS server to see if the name resolution has changed. This is called the Time-To-Live (TTL) value and, for cluster resources, the default value is 1200 seconds, or 20 minutes. Consider a situation where a client workstation has just checked in with the DNS server and immediately after that, the cluster resource failed over to the other node on a different subnet. Even though the cluster resource is already online on the other subnet, the client workstation will still have to wait for the next time that it has to check in with the DNS server. This can be addressed by reducing the TTL value of the DNS host record registered by the cluster resource, in this case, the SQL Server virtual server name. While we still do not have any recommendations for SQL Server multi-subnet clusters, we can use the one that is recommended for Exchange Servers. As per this Microsoft TechNet article, we can configure the DNS TTL value of the virtual server name for the SQL Server instance to 300 seconds or 5 minutes.

These properties are not visible via the Failover Cluster Management console so we will be using Windows PowerShell to check and modify this property.

#List different cluster resources
PS C:\>Get-ClusterResource | Select Name, ResourceType
#List parameters and their values of the SQL Server Network name
PS C:\>Get-ClusterResource "SQL Network Name (SQLMULTISUBCLUS)" | Get-ClusterParameter
#Set parameter value
PS C:\>Get-ClusterResource "SQL Network Name (SQLMULTISUBCLUS)" | Set-ClusterParameter HostRecordTTL 300
Tuning the SQL Server Failover Cluster Instance DNS Settings

The first line is simply to retrieve all of the cluster resource names in the cluster. We want to know the cluster resource name of the virtual server name of the SQL Server instance, in this case, SQL Server (MSSQLSERVER). The next line simply grabs all the cluster resource parameters for the cluster resource named SQL Server (MSSQLSERVER). The parameter that we want to modify is HostRecordTTL. As I've previously mentioned, the default value is 1200 seconds. The third line simply changes the HostRecordTTL parameter value to 300.

Once the cluster parameter value has been changed, the client applications will no longer have to wait for 20 minutes in order to get the name resolution correctly and, thus, minimizing "perceived" downtime.

Another thing that we need to look into is how the virtual server name gets registered in the DNS server. Microsoft DNS is usually implemented with Active Directory integration which means that an update in the DNS made in one Active Directory site in a subnet may take a while to replicate to another Active Directory-integrated DNS server on another site with a different subnet. This can be addressed by registering all of the virtual IP addresses for the virtual server name on all of the DNS servers even when the virtual IP address is not online. This gives the DNS client an opportunity to store all of the possible IP addresses for a virtual server name. However, this assumes that the client application is smart enough to retry a connection against all available IP addresses for a given virtual server name. The default behavior of a cluster resource name is to only register the virtual IP address that successfully comes online. In our example, the virtual IP address 172.16.0.113 will only be registered in the DNS server within that subnet when it successfully goes online. The same thing is true with the virtual IP address 192.168.0.113. This results in having just one entry per DNS server per subnet for the virtual server name instead of two. To modify this behavior, run this code:

#List parameters and their values of the SQL Server Network name
PS C:\>Get-ClusterResource "SQL Network Name (SQLMULTISUBCLUS)" | Get-ClusterParameter
#Set parameter value
PS C:\>Get-ClusterResource "SQL Network Name (SQLMULTISUBCLUS)" | Set-ClusterParameter RegisterAllProvidersIP 1
PowerShell code to capture the RegisterAllProvidersIP value

The first line simply grabs all the cluster resource parameters for the cluster resource named SQL Server (MSSQLSERVER). The parameter that we want to modify is RegisterAllProvidersIP. As I've previously mentioned, the default value is 0, which means only the virtual IP address that successfully goes online will be registered with the DNS server in that subnet. The second line simply changes the RegisterAllProvidersIP parameter value to 1.

After changing the parameter value, the virtual server name will now be able to register all of the virtual IP addresses in the DNS server corresponding to the virtual IP address that goes online in that subnet. This also means that the client workstation can now store multiple virtual IP addresses that get registered in the DNS server and, thus, also minimizing "perceived" downtime.

Once the SQL Server instance has been successfully failed over and failed back, you can verify in your DNS servers that the virtual IP addresses that you have defined for your SQL Server Failover Clustered Instance has been successfully registered.

verify in your DNS servers that the virtual IP addresses that you have defined for your SQL Server Failover Clustered Instance has been successfully registered in the DNS Manager

Be sure to test these settings with your applications and come up with a baseline measurement of your client connectivity, tweaking the parameter values based on your requirement, taking note that they should meet your availability objectives.

Testing Application Connectivity

In order for client applications to be automatically redirected during a cluster failover, they need to either be using

  1. the SQL Server Native Client 11.0
  2. the Data Provider for SQL Server in .NET Framework 4.02 or above
  3. the Microsoft JDBC Driver 4.0 for SQL Server

A new connection string parameter named MultiSubnetFailover is made available to allow applications to simultaneously try all the IP addresses assigned for the SQL Server multi-subnet cluster instance name and connects to the first one that responds. When dealing with legacy client libraries or 3rd-party data providers, the MultiSubnetFailover parameter cannot be used. The recommendation is to adjust the connection timeout setting by 21 seconds for each additional IP address. An example of using the MultiSubnetFailover parameter in a connection string is shown below.

A new connection string parameter named MultiSubnetFailover is made available

The application was created using the .NET Framework 4.0 with the SQL Server Native Client 11.1. Similarly, the parameter can also be used with SQL Server Management Studio under the Additional Connection Parameters tab.

SQL Server Management Studio Additional Connection Parameters to set the MultiSubnetFailover to True

Or the new -M parameter in sqlcmd.

Or the new -M parameter in sqlcmd.

Be sure to do a failover and failback test while your applications are connected to verify that they are indeed getting properly redirected to the correct subnet.

Congratulations! You now have a working two-node SQL Server 2012 multi-subnet cluster and have verified that client applications have been tested for connectivity.

Next Steps
  • Read more about SQL Server 2012 Multi-Subnet Clustering on MSDN.
  • Review the previous tips on Install SQL Server 2008 on a Windows Server 2008 Cluster Part 1, Part 2, Part 3 and Part 4.
  • Download and install an Evaluation copy of Windows Server 2008 R2 for this tip and SQL Server 2012 in preparation for the next one.
  • Start working on building your test environment in preparation for setting up a SQL Server 2012 multi-subnet cluster on Windows Server 2008 R2. This is a great opportunity to learn more about networking concepts and fundamentals as a SQL Server DBA.


sql server categories

sql server webinars

subscribe to mssqltips

sql server tutorials

sql server white papers

next tip



About the author
MSSQLTips author Edwin Sarmiento Edwin M Sarmiento is a Microsoft SQL Server MVP and Microsoft Certified Master from Ottawa, Canada specializing in high availability, disaster recovery and system infrastructures.

This author pledges the content of this article is based on professional experience and not AI generated.

View all my tips



Comments For This Article




Tuesday, December 2, 2014 - 3:49:20 PM - Jay Back To Top (35466)

 This is one of the most comprehensive ariticle for multisite cluster , with a SQL Server perspective. Thank you very much for writing this up, your articles answered so many of my questions that I got while referring other resources. 

 

 


Monday, June 23, 2014 - 7:41:59 PM - Clint Back To Top (32361)

Hi Edwin,

I am looking at building a SQL 2012 cross site Availability Group on a Windows 2012 Multi Subnet Cluster.

Would there be anything extra I would need to configure that is not contained in this blog post?  For example not needing to use shared storage?

 


Wednesday, April 30, 2014 - 11:13:34 AM - bass_player Back To Top (30566)

Jot,

Here's what you can do to test connectivity if you failover your SQL Server cluster to a different subnet. After failover, flush the DNS cache on your application server using ipconfig /flushdns. This will force the DNS client on the application server to request for new DNS entries. Since the IP address on the same subnet will no longer respond, the DNS client will use the IP on the other subnet to connect to SQL Server


Saturday, April 26, 2014 - 12:50:32 AM - Jot Back To Top (30526)

Thanks for the great article!

We always use ODBC data source in Admin Tools to test the connectivity to SQL from our Application Servers which does not work anymore as it does not support Multi Subnet Failover=true propert?

What do you suggest using now to test our SQL connectivit from our Application servers?


Thursday, October 10, 2013 - 9:28:03 PM - bass_player Back To Top (27111)

Hi Charles,

That warning is expected. Please refer to the Tip section on this MSDN article. While the topic refers to AlwaysOn Availability Groups, the same concept applies to multi-subnet SQL Server failover clustered instances


Thursday, October 10, 2013 - 4:37:38 PM - Charles Deaton Back To Top (27110)

Your recommendation for setting RegisterAllProvidersIP seems to be different this link.

http://www.doitfixit.com/index.php?option=com_content&view=article&id=204:how-to-change-registerallprovidersip-property-value&catid=48:active-directory&Itemid=53

 

 

 

 


Wednesday, June 19, 2013 - 4:57:31 PM - bass_player Back To Top (25493)

Yes, it is possible to build a multi-instance, multi-subnet cluster with both instances serving database requests. You just need to make sure that application reconnection is taken cared of when one instance fails over to one subnet. Because even if the SQL Server instance can quickly failover but if your application cannot connect to it, that's a perceived downtime that will factor in to your recovery objectives


Wednesday, June 19, 2013 - 8:10:38 AM - Sachin Back To Top (25480)

Hi Edwin,

Thanks for informative series.

Is it possible to build an Multi subnet active / active cluster? we want application should connect to local active instance.

 

Regards

Sachin

 

 


Tuesday, March 19, 2013 - 8:59:40 AM - kumar Back To Top (22868)

Thanks for your response.

Can you please let me know how i can assign only one ip to the existing sql network name,i dont want to use two ip's in different subnets.

This need to be done without unclustering the existing cluster


Monday, March 18, 2013 - 5:39:49 PM - bass_player Back To Top (22854)

I'm assuming that this is a multi-subnet cluster from your description. There might be a couple of things to look at here but worth considering this first: Are your clients getting the error message when they connect? Take a look at the DNS configuration on your cluster. The RegisterAllProvidersIP parameter on the cluster resource simply means that it will register all of the virtual IP address used by the cluster on the DNS. If this is not set to TRUE, only the active virtual IP address will be registered on your DNS server. Once you failover, the active virtual IP address will be registered to the DNS server. However, the client applications will not be aware of it immediately because of the default DNS TTL value of 20 minutes. This is why you also need to specify the HostRecordTTL parameter value on your cluster resource. You have to extensively test failover and failback from the SQL Server side and application side prior to going into production to make sure that your infrastructure and design will behave as expected.


Monday, March 18, 2013 - 2:33:09 PM - kumar Back To Top (22850)

Hi,

Few days back i added a node to the existing sql 2012 failover cluster.While performing the add node i got the same message which you got above in cluster network configuration page and i pressed YES.

Now i can see two IP's under the SQL Network name and i am not sure whether this exists earlier or not.

Now actual problem is tried failover few days back and sql server is online but client conections are not getting accepted  and getting below error:

SQL Server could not listen on IP address [] because the cluster resource 'New IP Address' is not online (state = 3). This is an informational message and may indicate that resource 'SQL Network Name ()' has OR type of dependency on several IP addresses some of which are currently offline or in a failed state. Further action is only required if it is generally possible to bind the IP address of the cluster resource 'New IP Address' to a network segment on the current hosting node. 

So now could you please confirm how this can be resolved and how can i remove one of the ip under the sql network name as i want to use only one ip for sql network name instead of two.


Wednesday, September 12, 2012 - 3:49:12 PM - Kapil Back To Top (19494)

Execellect Article. I learnt lot of things from this article.

 

Thanks

Kapil Mahershi















get free sql tips
agree to terms