DB2 - Problem description
Problem IC74633 | Status: Closed |
A PANIC MAY OCCUR AFTER ISSUING DB2STOP FORCE ON HADR STANDBY WHILE TAKEOVER HADR DB OPERATION IS IN PROGRESS | |
product: | |
DB2 FOR LUW / DB2FORLUW / 950 - DB2 | |
Problem description: | |
Running db2 takeover hadr on db on a standby machine followed by db2 stop force before the takeover command completes successfully can cause memory to be freed twice. This would lead to a panic in the system. This APAR can only occur on DB2 9.5 FP7. 2010-11-25-17.42.00.931548-300 I68908668E377 LEVEL: Warning PID : 21039 TID : 47382024677696PROC : db2sysc INSTANCE: svtdbm NODE : 000 EDUID : 310 EDUNAME: db2hadrp (SAKE) FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrSDoTakeover, probe:47003 MESSAGE : Info: Standby has completed takeover (now primary). 2010-11-25-17.42.00.944158-300 I68909046E342 LEVEL: Warning PID : 21039 TID : 47382024677696PROC : db2sysc INSTANCE: svtdbm NODE : 000 EDUID : 310 EDUNAME: db2hadrp (SAKE) FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduS, probe:20342 MESSAGE : Info: Standby Finished. 2010-11-25-17.42.00.945189-300 I68909389E553 LEVEL: Warning PID : 21039 TID : 47381592664384PROC : db2sysc INSTANCE: svtdbm NODE : 000 DB : SAKE APPHDL : 0-28727 APPID: *LOCAL.svtdbm.101126110514 AUTHID : SVTDBM EDUID : 630 EDUNAME: db2agent (SAKE) FUNCTION: DB2 UDB, base sys utilities, sqeAgentServices::RequestAgent, probe:12 MESSAGE : Coordinator request rejected. DATA #1 : Hexdump, 4 bytes 0x00002B17E2FE9400 : 0700 0000 .... 2010-11-25-17.42.00.948897-300 I68909943E461 LEVEL: Error PID : 21039 TID : 47381592664384PROC : db2sysc INSTANCE: svtdbm NODE : 000 DB : SAKE APPHDL : 0-28727 APPID: *LOCAL.svtdbm.101126110514 AUTHID : SVTDBM EDUID : 630 EDUNAME: db2agent (SAKE) FUNCTION: DB2 UDB, base sys utilities, sqeAgentServices::RequestAgent, probe:70 MESSAGE : Agent not allocated, sqlcode = -6036 2010-11-25-17.42.00.950313-300 E68910405E495 LEVEL: Warning PID : 21039 TID : 47381592664384PROC : db2sysc INSTANCE: svtdbm NODE : 000 DB : SAKE APPHDL : 0-28727 APPID: *LOCAL.svtdbm.101126110514 AUTHID : SVTDBM EDUID : 630 EDUNAME: db2agent (SAKE) FUNCTION: DB2 UDB, base sys utilities, sqeAgentServices::RequestAgent, probe:100 MESSAGE : ADM7519W DB2 could not allocate an agent. The SQLCODE is "-6036". 2010-11-25-17.42.00.954150-300 I68910901E456 LEVEL: Severe PID : 21039 TID : 47381592664384PROC : db2sysc INSTANCE: svtdbm NODE : 000 DB : SAKE APPHDL : 0-28727 APPID: *LOCAL.svtdbm.101126110514 AUTHID : SVTDBM EDUID : 630 EDUNAME: db2agent (SAKE) FUNCTION: DB2 UDB, base sys utilities, sqeIcoordCB::DispatchIndependentDBAgent, probe:110 RETCODE : ZRC=0xFFFFE86C=-6036 2010-11-25-17.42.00.955750-300 I68911358E544 LEVEL: Error PID : 21039 TID : 47381592664384PROC : db2sysc INSTANCE: svtdbm NODE : 000 DB : SAKE APPHDL : 0-28727 APPID: *LOCAL.svtdbm.101126110514 AUTHID : SVTDBM EDUID : 630 EDUNAME: db2agent (SAKE) FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrSpawnICoord, probe:30100 MESSAGE : HADR ICoord Error: DATA #1 : Hexdump, 4 bytes 0x00002B17E2FEA9B0 : 6D00 0580 m... 2010-11-25-17.42.00.957140-300 I68911903E533 LEVEL: Error PID : 21039 TID : 47381592664384PROC : db2sysc INSTANCE: svtdbm NODE : 000 DB : SAKE APPHDL : 0-28727 APPID: *LOCAL.svtdbm.101126110514 AUTHID : SVTDBM EDUID : 630 EDUNAME: db2agent (SAKE) FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrSpawnICoord, probe:30101 MESSAGE : sqlcode: DATA #1 : Hexdump, 4 bytes 0x00002B17E2FEA9F0 : 6CE8 FFFF l... 2010-11-25-17.42.00.958549-300 I68912437E558 LEVEL: Warning PID : 21039 TID : 47381592664384PROC : db2sysc INSTANCE: svtdbm NODE : 000 DB : SAKE APPHDL : 0-28727 APPID: *LOCAL.svtdbm.101126110514 AUTHID : SVTDBM EDUID : 630 EDUNAME: db2agent (SAKE) FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrTakeoverHdrRouteIn, probe:55603 RETCODE : ZRC=0x8005006D=-2147155859=SQLE_CA_BUILT "SQLCA has been built and saved in component specific control block." 2010-11-25-17.42.00.954201-300 I68912996E341 LEVEL: Warning PID : 21039 TID : 47382024677696PROC : db2sysc INSTANCE: svtdbm NODE : 000 EDUID : 310 EDUNAME: db2hadrp (SAKE) FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduP, probe:20301 MESSAGE : Info: Primary Started. 2010-11-25-17.42.00.965391-300 I68913338E389 LEVEL: Warning PID : 21039 TID : 47382024677696PROC : db2sysc INSTANCE: svtdbm NODE : 000 EDUID : 310 EDUNAME: db2hadrp (SAKE) FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrSetTcpWindowSize, probe:32201 MESSAGE : Info: HADR Socket send buffer size, SO_SNDBUF: 16384 bytes 2010-11-25-17.42.00.966825-300 I68913728E392 LEVEL: Warning PID : 21039 TID : 47382024677696PROC : db2sysc INSTANCE: svtdbm NODE : 000 EDUID : 310 EDUNAME: db2hadrp (SAKE) FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrSetTcpWindowSize, probe:32251 MESSAGE : Info: HADR Socket receive buffer size, SO_RCVBUF: 87380 bytes 2010-11-25-17.42.00.965075-300 I68914121E1738 LEVEL: Severe PID : 21039 TID : 47381592664384PROC : db2sysc INSTANCE: svtdbm NODE : 000 DB : SAKE APPHDL : 0-28727 APPID: *LOCAL.svtdbm.101126110514 AUTHID : SVTDBM EDUID : 630 EDUNAME: db2agent (SAKE) FUNCTION: DB2 UDB, SQO Memory Management, sqloDiagnoseFreeBlockFailure, probe:10 MESSAGE : Possible memory corruption detected. DATA #1 : ZRC, PD_TYPE_ZRC, 4 bytes 0x820F0002 DATA #2 : Corrupt block address, PD_TYPE_CORRUPT_BLK_PTR, 8 bytes 0x00002b1789615100 DATA #3 : Block header, PD_TYPE_BLK_HEADER, 24 bytes 0x00002B17896150E8 : 1596 78B1 0200 B0FA 0000 0000 0000 0000 ..x............. 0x00002B17896150F8 : 0000 0000 0000 0000 ........ DATA #4 : Data header, PD_TYPE_BLK_DATA_HEAD, 48 bytes 0x00002B1789615100 : 8000 0000 0000 0000 DDDD DDDD DDDD DDDD ................ 0x00002B1789615110 : DDDD DDDD DDDD DDDD DDDD DDDD DDDD DDDD ................ CALLSTCK: [0] 0x00002B1761A3CEEC pdLog + 0x18C [1] 0x00002B1763F7042A /home/svtdbm/sqllib/lib64/libdb2e.so.1 + 0x382E42A [2] 0x00002B1763F70F52 sqlofmblkEx + 0x130 [3] 0x00000000004147B6 _Z9sqlofmblkPv + 0x18 [4] 0x00002B1763EE040A sqlo_acs_fblk + 0x1C [5] 0x00002B176259A889 _Z26sqlexAgentCleanUserSecInfoP16SQLEX_AUTHINFO_TP5sqlca + 0xA3 [6] 0x00002B1762385917 _ZN14sqeApplication12AppStopUsingEP8sqeAgenthP5sqlca + 0xF67 [7] 0x00002B1762532B3C _Z11sqlesrspWrpP14db2UCinterface + 0x2D8 [8] 0x00002B176253306D sqleUCagentConnectReset + 0x17D [9] 0x00002B17626B852C /home/svtdbm/sqllib/lib64/libdb2e.so.1 + 0x1F7652C 2010-11-25-17.42.00.976550-300 I68915860E456 LEVEL: Severe PID : 21039 TID : 47381592664384PROC : db2sysc INSTANCE: svtdbm NODE : 000 DB : SAKE APPHDL : 0-28727 APPID: *LOCAL.svtdbm.101126110514 AUTHID : SVTDBM EDUID : 630 EDUNAME: db2agent (SAKE) FUNCTION: DB2 UDB, SQO Memory Management, sqloDiagnoseFreeBlockFailure, probe:30 DATA #1 : String, 12 bytes Double free. 2010-11-25-17.42.00.978148-300 E68916317E928 LEVEL: Critical PID : 21039 TID : 47381592664384PROC : db2sysc INSTANCE: svtdbm NODE : 000 DB : SAKE APPHDL : 0-28727 APPID: *LOCAL.svtdbm.101126110514 AUTHID : SVTDBM EDUID : 630 EDUNAME: db2agent (SAKE) FUNCTION: DB2 UDB, base sys utilities, sqle_panic, probe:10 MESSAGE : ADM14001C An unexpected and critical error has occurred: "Panic". The instance may have been shutdown as a result. "Automatic" FODC (First Occurrence Data Capture) has been invoked and diagnostic information has been recorded in directory "/home/svtdbm/sqllib/db2dump/FODC_Panic_2010-11-25-17.42.00.9780 28/". Please look in this directory for detailed evidence about what happened and contact IBM support if necessary to diagnose the problem. | |
Problem Summary: | |
**************************************************************** * USERS AFFECTED: * * HADR environments * **************************************************************** * PROBLEM DESCRIPTION: * * Running db2 takeover hadr on db on a standby machine * * followed by * * db2 stop force before the takeover command completes * * * * successfully can cause memory to be freed twice. This would * * lead * * to a panic in the system. This APAR can only occur on DB2 * * 9.5 * * FP7. * **************************************************************** * RECOMMENDATION: * * Upgrade to DB2 9.5 FP8 or higher * **************************************************************** | |
Local Fix: | |
Don't issue db2stop force on standby instance while HADR takeover database operation is being performed. | |
available fix packs: | |
DB2 Version 9.5 Fix Pack 8 for Linux, UNIX, and Windows | |
Solution | |
The fix to avoid the double free of memory in this situation is included in DB2 9.5 FP8. | |
Workaround | |
not known / see Local fix | |
Timestamps | |
Date - problem reported : Date - problem closed : Date - last modified : | 22.02.2011 27.06.2011 27.06.2011 |
Problem solved at the following versions (IBM BugInfos) | |
9.5.FP8 | |
Problem solved according to the fixlist(s) of the following version(s) | |
9.5.0.8 |