DB2 - Problem description
Problem IT03811 | Status: Closed |
SANITY CHECK DUAL LOG ENTRIES IN THE CASE CRASH RECOVERY HANG DUE TO DUAL ARRAY IS FULL | |
product: | |
DB2 FOR LUW / DB2FORLUW / A50 - DB2 | |
Problem description: | |
The hang edu db2loggw stacktrace is not complete: Stack: ====================== 0x09000000004D8570 _p_nsleep + 0x10 0x090000000002B744 nsleep + 0xE4 0x09000000001555A8 nanosleep + 0x188 0x090000000F367808 ossSleep + 0xA8 0x090000001FDAA16C sqlorest + 0x188 0x090000001D729930 sqlpgStartMfhEntry__FP9SQLP_DBCBCUiCUl + 0x8B8 But considering below message: 2014-03-01-12.44.43.531921-300 I36007A436 LEVEL: Info PID : 11206932 TID : 12338 PROC : db2sysc 0 INSTANCE: db2inst1 NODE : 000 DB : SAMPLE<<<<< HOSTNAME: host1 EDUID : 12338 EDUNAME: db2loggr (SAMPLE) 0 FUNCTION: DB2 UDB, data protection services, sqlpgSwitchProcessAllLogStreams, probe:1600 //the switch phase DATA #1 : <preformatted> Log stream 0, TailIndex 59, extNum 303894 2014-03-01-12.44.43.573499-300 I36444A576 LEVEL: Info PID : 11206932 TID : 13109 PROC : db2sysc 0 INSTANCE: db2inst1 NODE : 000 DB : SAMPLE<<<<< HOSTNAME: host1 EDUID : 13109 EDUNAME: db2loggw (SAMPLE) 0 FUNCTION: DB2 UDB, data protection services, sqlpgOpenLogExtent, probe:850 MESSAGE : ZRC=0x860F000A=-2045837302=SQLO_FNEX "File not found." DIA8411C A file "" could not be found. DATA #1 : <preformatted> Error opening database log /db2/SAMPLE/log_dir2/NODE0000/LOGSTREAM0000/S0303894.LOG. sqlpgSwitchProcessAllLogStreams, probe:1600 means the redo phase is nearly completed, db2 is performing a 'switch' phase by loggr EDU. Before redo completes, there is only 1 place that loggw EDU could be involved, which is sqlpgCrashRecoveryPostRedo. loggw may need to write something into the log file, but it failed to write to mirror log, so it called sqlpgStartOrCompleteMfhEntryAndUpdateMFHIfNecessary to mark mirror log path BAD. then it found dual array is full: 2014-03-01-12.44.43.592614-300 I37021A504 LEVEL: Info PID : 11206932 TID : 13109 PROC : db2sysc 0 INSTANCE: db2inst1 NODE : 000 DB : SAMPLE<<<<< HOSTNAME: host1 EDUID : 13109 EDUNAME: db2loggw (SAMPLE) 0 FUNCTION: DB2 UDB, data protection services, sqlpgStartMfhEntry, probe:7240 DATA #1 : <preformatted> Dual log array is full. Checking if any new logs have been archived. end: 366957, lowestUnarchivedExtNum: 303893, path: 9 2014-03-01-12.44.43.594471-300 I37526A409 LEVEL: Info PID : 11206932 TID : 13109 PROC : db2sysc 0 INSTANCE: db2inst1 NODE : 000 DB : SAMPLE<<<<< HOSTNAME: host1 EDUID : 13109 EDUNAME: db2loggw (SAMPLE) 0 FUNCTION: DB2 UDB, data protection services, sqlpgStartMfhEntry, probe:7280 MESSAGE : Wait 5 minutes for log file to be archived 2014-03-01-12.44.43.595863-300 I37936A171 LEVEL: Warning PID:11206932 TID:13109 NODE:000 Title: SQLP_LFH Dump File:/db2/db2inst1/db2dump/11206932.13109.000.dump.bin This APAR is to add sanity checking and also to "correct" any dual log entries that we can determine are correct and that we can determine an appropriate value to patch it to. | |
Problem Summary: | |
**************************************************************** * USERS AFFECTED: * * ALL * **************************************************************** * PROBLEM DESCRIPTION: * * See Error Description * **************************************************************** * RECOMMENDATION: * * Upgrade to DB2 V10.5 FP5 or higher. * **************************************************************** | |
Local Fix: | |
Solution | |
Problem fixed In DB2 V10.5 FP5 | |
Workaround | |
not known / see Local fix | |
Timestamps | |
Date - problem reported : Date - problem closed : Date - last modified : | 15.08.2014 31.03.2015 31.03.2015 |
Problem solved at the following versions (IBM BugInfos) | |
Problem solved according to the fixlist(s) of the following version(s) | |
10.5.0.5 |