home clear 64x64
en blue 200x116 de orange 200x116 info letter User
suche 36x36
Latest versionsfixlist
11.1.0.7 FixList
10.5.0.9 FixList
10.1.0.6 FixList
9.8.0.5 FixList
9.7.0.11 FixList
9.5.0.10 FixList
9.1.0.12 FixList
Have problems? - contact us.
Register for free anmeldung-x26
Contact form kontakt-x26

DB2 - Problem description

Problem IC95313 Status: Closed

TSA AUTOMATED HADR DATABASE DOES NOT FAILOVER AFTER UNPLUGGING PUBLIC
NETWORK CABLE FROM THE PRIMARY SERVER

product:
DB2 FOR LUW / DB2FORLUW / A50 - DB2
Problem description:
In a TSA-MP managed HADR environment, if the public network 
cable is unplugged from the HADR primary server, the HADR 
database is unable to failover to the standby server. 
 
See the following example for more details: 
 
- lssam output prior to unplugging the network cable: 
 
Online IBM.ResourceGroup:db2_db2inst1_db2inst1_HADRDB-rg 
Nominal=Online 
        |- Online 
IBM.Application:db2_db2inst1_db2inst1_HADRDB-rs 
                |- Online 
IBM.Application:db2_db2inst1_db2inst1_HADRDB-rs:node01 
                '- Offline 
IBM.Application:db2_db2inst1_db2inst1_HADRDB-rs:node02 
        '- Online IBM.ServiceIP:db2ip_10_10_3_111-rs 
                |- Online 
IBM.ServiceIP:db2ip_10_10_3_111-rs:node01 
                '- Offline 
IBM.ServiceIP:db2ip_10_10_3_111-rs:node02 
Online IBM.ResourceGroup:db2_db2inst1_node01_0-rg Nominal=Online 
        '- Online IBM.Application:db2_db2inst1_node01_0-rs 
                '- Online 
IBM.Application:db2_db2inst1_hostA_0-rs:node01 
Online IBM.ResourceGroup:db2_db2inst1_node02_0-rg Nominal=Online 
        '- Online IBM.Application:db2_db2inst1_node02_0-rs 
                '- Online 
IBM.Application:db2_db2inst1_node02_0-rs:node02 
Online IBM.Equivalency:db2_db2inst1_db2inst1_HADRDB-rg_group-equ 
        |- Online IBM.PeerNode:node01:node01 
        '- Online IBM.PeerNode:node02:node02 
Online IBM.Equivalency:db2_db2inst1_node01_0-rg_group-equ 
        '- Online IBM.PeerNode:node01:node01 
Online IBM.Equivalency:db2_db2inst1_node02_0-rg_group-equ 
        '- Online IBM.PeerNode:node02:node02 
Online IBM.Equivalency:db2_private_network_0 
        |- Online IBM.NetworkInterface:en1:node01 
        '- Online IBM.NetworkInterface:en1:node02 
Online IBM.Equivalency:db2_public_network_0 
        |- Online IBM.NetworkInterface:en2:node02 
        '- Online IBM.NetworkInterface:en2:node01 
 
 
- lssam output after the network cable is unplugged: 
 
Pending Online IBM.ResourceGroup:db2_db2inst1_db2inst1_HADRDB-rg 
Request=Lock Nominal=Online 
        |- Offline 
IBM.Application:db2_db2inst1_db2inst1_HADRDB-rs 
Control=StartInhibitedBecauseSuspended 
                |- Offline 
IBM.Application:db2_db2inst1_db2inst1_HADRDB-rs:node01 
                '- Offline 
IBM.Application:db2_db2inst1_db2inst1_HADRDB-rs:node02 
        '- Online IBM.ServiceIP:db2ip_10_10_3_111-rs 
                |- Online 
IBM.ServiceIP:db2ip_10_10_3_111-rs:node01 
                '- Offline 
IBM.ServiceIP:db2ip_10_10_3_111-rs:node02 
Failed offline IBM.ResourceGroup:db2_db2inst1_node01_0-rg 
Binding=Sacrificed Nominal=Online 
        '- Offline IBM.Application:db2_db2inst1_node01_0-rs 
                '- Offline 
IBM.Application:db2_db2inst1_hostA_0-rs:node01 
Online IBM.ResourceGroup:db2_db2inst1_node02_0-rg Nominal=Online 
        '- Online IBM.Application:db2_db2inst1_node02_0-rs 
                '- Online 
IBM.Application:db2_db2inst1_node02_0-rs:node02 
Online IBM.Equivalency:db2_db2inst1_db2inst1_HADRDB-rg_group-equ 
        |- Online IBM.PeerNode:node01:node01 
        '- Online IBM.PeerNode:node02:node02 
Online IBM.Equivalency:db2_db2inst1_node01_0-rg_group-equ 
        '- Online IBM.PeerNode:node01:node01 
Online IBM.Equivalency:db2_db2inst1_node02_0-rg_group-equ 
        '- Online IBM.PeerNode:node02:node02 
Online IBM.Equivalency:db2_private_network_0 
        |- Online IBM.NetworkInterface:en1:node01 
        '- Online IBM.NetworkInterface:en1:node02 
Online IBM.Equivalency:db2_public_network_0 
        |- Online IBM.NetworkInterface:en2:node02 
        '- Offline IBM.NetworkInterface:en2:node01 
 
As displayed in the above lssam output, HADR is stopped 
(resource is set to "Offline") on the original primary (node01), 
but node02 does not takeover the primary HADR role, i.e. the 
HADR resource for node02 is also set as "Offline". 
 
In addition to this, the virtual IP address (IBM.ServiceIP 
resource) still binds to the original primary server. (node01) 
 
---------------------------------------------------------------- 
 
 
In the above scenario, whereby the public network cable is 
unplugged, the IBM.ServiceIP resource is not brought offline by 
TSA on the primary node(node01). There needs to be an additional 
dependency created from the HADR resource to the public network 
equivalency which will allow the HADR failover process to be 
initiated in the case of a public network cable pull. With this 
additional dependency in place, the HADR resource will be able 
to successfully failover from the primary to the standby in the 
event of a public network cable pull.
Problem Summary:
**************************************************************** 
* USERS AFFECTED:                                              * 
* ALL                                                          * 
**************************************************************** 
* PROBLEM DESCRIPTION:                                         * 
* See Error Description                                        * 
**************************************************************** 
* RECOMMENDATION:                                              * 
* Update to 10.5.0.3                                           * 
****************************************************************
Local Fix:
Verify if there exists a dependency from the HADR resource to 
the public network by issuing the 'lsrel -Ab' command as the DB2 
instance owner. If the dependency exists, here is how it would 
be displayed: 
 
Managed Relationship 1: 
        Class:Resource:Node[Source] = 
IBM.Application:db2_db2inst1_db2inst1_HADRDB-rs 
        Class:Resource:Node[Target] = 
{IBM.Equivalency:db2_public_network_0} 
        Relationship                = DependsOn 
        Conditional                 = NoCondition 
        Name                        = 
db2_db2inst1_db2inst1_HADRDB-rs_DependsOn_db2_public_network_0-r 
el 
        ActivePeerDomain            = hadr_dom 
        ConfigValidity              = 
 
If this dependency does not exist, then create it as follows: 
 
1) Bring the cluster into maintenance mode by running the 
"db2haicu -disable" command as the DB2 instance owner. 
 
2) As root from either node, run the following: 
"export CT_MANAGEMENT_SCOPE=2" 
"mkrel -p dependson -S 
IBM.Application:db2_db2inst1_db2inst1_HADRDB-rs -G 
IBM.Equivalency:db2_public_network_0 
db2_db2inst1_db2inst1_HADRDB-rs_DependsOn_db2_public_network_0-r 
el" 
 
3) Verify that the dependency is now created via the "lsrel -Ab" 
command. 
 
4) Once verified that the dependency exists, exit cluster 
maintenance mode by running the "db2haicu" command as the DB2 
instance owner.
available fix packs:
DB2 Version 10.5 Fix Pack 3 for Linux, UNIX, and Windows
DB2 Version 10.5 Fix Pack 3a for Linux, UNIX, and Windows
DB2 Cancun Release 10.5.0.4 (also known as Fix Pack 4) for Linux, UNIX, and Windows
DB2 Version 10.5 Fix Pack 9 for Linux, UNIX, and Windows

Solution
Problem Fixed In 10.5.0.3
Workaround
not known / see Local fix
Timestamps
Date  - problem reported    :
Date  - problem closed      :
Date  - last modified       :
27.08.2013
03.03.2014
03.03.2014
Problem solved at the following versions (IBM BugInfos)
Problem solved according to the fixlist(s) of the following version(s)
10.5.0.3 FixList
10.5.0.3 FixList