DB2 - Problem description
Problem IC77518 | Status: Closed |
Instance hung at function sqmFastWriterQueueMgr::allocFastWriter Records when activity event monitors are enabled | |
product: | |
DB2 FOR LUW / DB2FORLUW / 970 - DB2 | |
Problem description: | |
When an activity event monitor is active and collection is enabled agents generating activity records may block waiting for records while holding activity list latches (SQLO_LT_sqlrr_curr_activity_cb__sqlrr_curr_activity_cb_latch). This can result in a hang as the fast writers (which release records for reuse) end up waiting on those latches. | |
Problem Summary: | |
**************************************************************** * USERS AFFECTED: * * All users that use activity event monitors * **************************************************************** * PROBLEM DESCRIPTION: * * There are two fast writers are waiting on 2 different * * SQLP_LHSH__xhshlatches. One (22345.98.000.stack.txt) is * * performing an insert and is waiting on one of them. The * * other one ("22345.97.000.stack.txt") is committing some work * * and is waiting on the other one so it can release some * * locks: * * * * * * <StackTrace> * * * * (0) 0xc000000027da3120 ossDumpStackTrace * * (1) 0xc000000027d9ba80 OSSTrapFile 0x100 ossstack.C:1971 * * * * (2) 0xc000000051ff02d0 sqlo_trce * * (3) 0xc0000000520c6370 sqloDumpDiagInfoHandler * * (4) 0xe0000001205c6f80 ---- Signal 16 (SIGUSR1) delivered * * ---- * * (5) 0xc0000000012f21d0 sched_yield * * (6) 0xc00000004c8b8ec0 sqloSpinLockConflict * * (7) 0xc00000004cd50df0 sqlplrm * * (8) 0xc00000004cdad030 sqlpxcm1 sqlpxcm1.C:502 * * (9) 0xc00000004c9f2010 sqlrrcom_dps * * (10) 0xc00000004c90e060 sqlrrcom * * (11) 0xc00000004fbe54a0 sqmRecordTypeArray::commitInserts * * (12) 0xc00000004fb58ac0 sqmRecordTypeArray::processRecord * * engn/include/sqlm_fast_writer_target_inlines.h:3096 * * * * (13) 0xc00000004fbccc50 sqmFastWriter::processWriterRecord * * sqlm_fast_writer.C:365 * * * * (14) 0xc00000004c5e8100 sqmFastWriter::executeFastWriter * * (15) 0xc00000004c5ed0e0 sqleIndCoordProcessRequest * * (16) 0xc00000004c6fbef0 sqeAgent::RunEDU sqle_agent.C:1237 * * (17) 0xc00000004d11d170 sqlzRunEDU sqlz_edu_obj.C:371 * * * * (18) 0xc00000004c7545c0 sqloEDUEntry * * (19) 0xc0000000000fb220 pthread_bound_body * * /ux/core/libs/threadslibs/src/common/pthreads/pthread.c:4851 * * * * </StackTrace> * * * * * * * * Sadly, both of those are waiting on two different * * SQLO_LT_sqlrr_curr_activity_cb__sqlrr_curr_activity_cb_latch * * latches. And those are held by 22345.461.000.stack.txt & * * 22345.461.000.stack.txt and both of those are in * * allocFastWriterRecords waiting for fast writer records. * * * * * * * * * * * * The other 4 fast writers (22345.95.000.stack.txt * * 22345.96.000.stack.txt 22345.99.000.stack.txt * * 22345.100.000.stack.txt) are performing an INSERT are stuck * * in sqlbVerifyAndLatchPage. I believe they're stuck because * * the bufferpool pages from the above fast writers haven't * * been committed and so cannot be ejected from the bufferpool. * * * * * * * * <StackTrace> * * * * (0) 0xc000000027da3120 ossDumpStackTrace * * (1) 0xc000000027d9ba80 OSSTrapFile ossstack.C:1971 * * * * (2) 0xc000000051ff02d0 sqlo_trce sqlodump.C:1595 * * (3) 0xc0000000520c6370 sqloDumpDiagInfoHandler * * sqloedu.C:5085 * * (4) 0xe0000001205c6f80 ---- Signal 16 (SIGUSR1) delivered * * ---- * * (5) 0xc00000000041abf0 _semop_sys * * (6) 0xc00000000042c560 _semop * * ../../../../../core/libs/libc/shared_em_64_perf/../core/sysc * * alls/t_semop.c:19 * * * * (7) 0xc00000004c8c4390 SQLO_SLATCH_CAS::getConflictComplex * * (8) 0xc00000004c8267a0 SQLO_SLATCH_CAS::getConflictEm * * (9) 0xc00000004cb17580 sqloltch_notrack::sqlo_latch * * /view/db2_v97fp4_hpipf64_s110330/vbs/engn/include/sqloLatch. * * h:234 * * (10) 0xc00000004cecde80 sqlbVerifyAndLatchPage * * /view/db2_v97fp4_hpipf64_s110330/vbs/engn/sqb/inc/sqlbslat.h * * :417 * * (11) 0xc00000004cb7dfa0 sqlbfix * * (12) 0xc00000004c862c60 sqldGetPageForAppend * * (13) 0xc00000004ce01e70 sqldInsertRow * * (14) 0xc00000004cd9e9a0 sqldRowInsert sqldmisr.C:825 * * * * (15) 0xc00000004ce4a750 sqlrinsr sqlri_aux.C:729 * * (16) 0xc00000004fb59360 sqmRecordTypeArray::processRecord * * /view/db2_v97fp4_hpipf64_s110330/vbs/engn/include/sqlm_fast_ * * writer_target_inlines.h:481 * * (17) 0xc00000004fbccc50 sqmFastWriter::processWriterRecord * * sqlm_fast_writer.C:365 * * (18) 0xc00000004c5e8100 sqmFastWriter::executeFastWriter * * (19) 0xc00000004c5ed0e0 sqleIndCoordProcessRequest * * (20) 0xc00000004c6fbef0 sqeAgent::RunEDU sqle_agent.C:1237 * * (21) 0xc00000004d11d170 sqlzRunEDU sqlz_edu_obj.C:371 * * * * (22) 0xc00000004c7545c0 sqloEDUEntry * * (23) 0xc0000000000fb220 __pthread_bound_body * * /ux/core/libs/threadslibs/src/common/pthreads/pthread.c:4851 * * * * </StackTrace> * **************************************************************** * RECOMMENDATION: * * Upgrade to Version 9.7 Fix Pack 5 * **************************************************************** | |
Local Fix: | |
available fix packs: | |
DB2 Version 9.7 Fix Pack 5 for Linux, UNIX, and Windows | |
Solution | |
Problem was first fixed in Version 9.7 Fix Pack 5 | |
Workaround | |
not known / see Local fix | |
Timestamps | |
Date - problem reported : Date - problem closed : Date - last modified : | 13.07.2011 21.12.2011 21.12.2011 |
Problem solved at the following versions (IBM BugInfos) | |
9.7.FP5 | |
Problem solved according to the fixlist(s) of the following version(s) | |
9.7.0.5 |