DB2 - Problem description
Problem IT06221 | Status: Closed |
Q CAPTURE WARM START MIGHT FAIL DUE TO -1273 FROM DB2LOGREAD AFTER RESTORE AND RFWD, IF YOU DELETE LOGS IN PREVIOUS LOG CHAIN | |
product: | |
DB2 FOR LUW / DB2FORLUW / A50 - DB2 | |
Problem description: | |
Q Capture warm start might fail due to db2LogRead API return -1273, after a restore and rollforward operation on pureScale, even performing Q Capture Cold start, if you delete the logs in previous log chain. This might happen if the workload is driven on only a subset of the members, after the rollforward followed by the cold start of QREP. Whenever the idle members start logging, a warm QREP scan will always require to retrieve logs from the previous chain to avoid -1273, but it will cause -1273 if the logs in previous log chain are deleted. The capture log might contain the following messages. ASN8041D "Q Capture" : "ASN01" : "LogrdThread" : db2LogRead API is sending us backwards in the DB2 Log: First LSN is "0000:0000:35cb:b965:0000:0001:f4aa:ed74" while Next Start LSN is "0000:0000:0000:0000:0000:0000:0000:0000" ASN0552E "Q Capture" : "ASN01" : "logrd" : The program encountered an SQL error. The server name is "SAMPLE". The SQL request is "db2ReadLog". The table name is "". The SQLCODE is "-1273". The SQLSTATE is " " .... ASN0005E CAPTURE "ASN01" : "LogrdThread". The Capture program encountered an error when reading the DB2 log. The log sequence number is "0000:0000:35CB:B965:0000:0001:F4AA:ED74", the SQLCODE is "-1273", and the reason code is "". ASN7156I "Q Capture" : "ASN01" : "WorkerThread" : The Q Capture program stopped processing log records in log file "/db2/actlog/NODE0000/LOGSTREAM0004/S0001753.LOG". The corresponding messages in db2diag.log are like this. 2014-12-03-00.34.56.533231+540 I343011474A1222 LEVEL: Info PID : 22282450 TID : 37914 PROC : db2sysc 4 INSTANCE: db2inst1 NODE : 004 DB : SAMPLE HOSTNAME: host01 EDUID : 37914 EDUNAME: db2lfr.4 (SAMPLE) 4 FUNCTION: DB2 UDB, data protection services, sqlpgOpenLogExtent, probe:2164 MESSAGE : ZRC=0x860F000A=-2045837302=SQLO_FNEX "File not found." DIA8411C A file "" could not be found. DATA #1 : String, 19 bytes Log is not found... DATA #2 : Codepath, 8 bytes 18:23:25 DATA #3 : String, 99 bytes extNum / logLocation / logStreamId / callerFlag / logFileName1 / logFileName2 / callerSpecifiedPath DATA #4 : SQLPG_EXTENT_NUM, PD_TYPE_SQLPG_EXTENT_NUM, 4 bytes 1753 DATA #5 : unsigned integer, 4 bytes 0 DATA #6 : db2LogStreamIDType, PD_TYPE_DB2_LOG_STREAM_ID, 2 bytes 4 DATA #7 : Hex integer, 8 bytes 0x0000000000000401 DATA #8 : String, 0 bytes Object not dumped: Address: 0x0A000000107FB038 Size: 0 Reason: Zero-length data DATA #9 : String, 0 bytes Object not dumped: Address: 0x0A000000107FB160 Size: 0 Reason: Zero-length data DATA #10: String, 0 bytes Object not dumped: Address: 0x0000000000000000 Size: 0 Reason: Address is NULL 2014-12-03-00.34.56.536369+540 E343012697A597 LEVEL: Info PID : 22282450 TID : 37143 PROC : db2sysc 4 INSTANCE: db2inst1 NODE : 004 DB : SAMPLE HOSTNAME: host01 EDUID : 37143 EDUNAME: db2logmgr.4 (SAMPLE) 4 FUNCTION: DB2 UDB, data protection services, sqlpgRetrieveLogFile, probe:4130 DATA #1 : <preformatted> Started retrieve for log file S0001753.LOG for log stream 4 from log chain 23. Retrieval destination: /db2/actlog/NODE0000/LOGSTREAM0004/LOGSTREAM0004/ Actual retrieval method: USE_METHOD1 2014-12-03-00.34.56.543735+540 I343013295A513 LEVEL: Info PID : 22282450 TID : 37143 PROC : db2sysc 4 INSTANCE: db2inst1 NODE : 004 DB : SAMPLE HOSTNAME: host01 EDUID : 37143 EDUNAME: db2logmgr.4 (SAMPLE) 4 FUNCTION: DB2 UDB, data protection services, sqlpgRetrieveLogFile, probe:4148 DATA #1 : <preformatted> Completed retrieve for log file S0001753.LOG on chain 23 to /db2/actlog/NODE0000/LOGSTREAM0004/LOGSTREAM0004/. 2014-12-03-00.34.56.544729+540 I343013809A521 LEVEL: Severe PID : 22282450 TID : 37914 PROC : db2sysc 4 INSTANCE: db2inst1 NODE : 004 DB : SAMPLE HOSTNAME: host01 EDUID : 37914 EDUNAME: db2lfr.4 (SAMPLE) 4 FUNCTION: DB2 UDB, data protection services, sqlplfrGetFileHandleToReadFrom, probe:5250 DATA #1 : <preformatted> LFR Scan Num = 10720 LFR Scan Caller's EDUID = 208328 Both log paths seem to be bad, try path1. extentNum = 1753 | |
Problem Summary: | |
**************************************************************** * USERS AFFECTED: * * All who use Q Capture on pureScale environment * **************************************************************** * PROBLEM DESCRIPTION: * * See Error Description * **************************************************************** * RECOMMENDATION: * * Upgrade to Version 10.5 Fix Pack 7. * **************************************************************** | |
Local Fix: | |
Create/update/drop dummy tables on all the members in the following scenario, and do the fake workload to ensure all logstreams have written log records before the cold start. 1) RESTORE/RFWD 2) Create/update/drop dummy tables on all the members 3) Successful QREP Cold start 4) Successful QREP Warm start 5) (Move/rename logs in previous log chain, for your safety) 6) Delete logs in previous chain | |
Solution | |
Problem was first fixed in DB2 Version 10.5 Fix Pack 7. | |
Workaround | |
not known / see Local fix | |
Timestamps | |
Date - problem reported : Date - problem closed : Date - last modified : | 24.12.2014 08.02.2016 08.02.2016 |
Problem solved at the following versions (IBM BugInfos) | |
Problem solved according to the fixlist(s) of the following version(s) | |
10.5.0.7 |