DB2 - Problem description
Problem IT19462 | Status: Closed |
WINDOWS EVENT HANDLE LEAK WITH PARALLELIZED QUERY PLANS (INTRA_PARALLEL YES) - MAY CAUSE OSERR 1450 | |
product: | |
DB2 FOR LUW / DB2FORLUW / B10 - DB2 | |
Problem description: | |
DB2 on windows has an event handle leak with parallelized plans, if INTRA_PARALLEL YES is set. The problem can be seen when parallelized plans are executed, i.e. many subagents are spawned. On windows the event handles can be monitored with the task manager or the "handle" utility from Microsoft sysinternals. If you use the handle tool against db2sysc you will see a growing number of "Event Handles" over time: G:\cds>handle -s -p 6584 Nthandle v4.1 - Handle viewer Copyright (C) 1997-2016 Mark Russinovich Sysinternals - www.sysinternals.com Handle type summary: ALPC Port : 5 Desktop : 1 Directory : 3 EtwRegistration : 56 Event : 1025591 <<<< File : 368 IoCompletion : 2 IRTimer : 2 Key : 44 Mutant : 260 Process : 3 Section : 12 Semaphore : 215 Thread : 455 Token : 2 TpWorkerFactory : 1 WaitCompletionPacket: 3 WindowStation : 2 Total handles: 1027025 This might go on undetected for a while till Microsoft Kernel limit of event handles is reached. At that point you will most likely see OSERR 1450 logged in db2diag.log. Once that happens DB2 will show certain misbehavior, like hangs, SQL1034C errors and/or entries in db2diag.log like the following: - SQLO_NORES during read operations 2017-01-15-16.13.51.394000+060 I10890739F686 LEVEL: Severe PID : 8896 TID : 5896 PROC : db2syscs.exe INSTANCE: db2inst1 NODE : 000 DB : SAMPLE APPHDL : 0-24450 APPID: ::1.53330.170115150048 AUTHID : SAPSR3 HOSTNAME: localhost EDUID : 5896 EDUNAME: db2agent (SAMPLE) 0 FUNCTION: DB2 UDB, buffer pool services, sqlbReadPage, probe:1140 MESSAGE : ZRC=0x870F00F2=-2029059854=SQLO_NORES "no resources to create process or thread" DATA #1 : <preformatted> Failed to read page from disk on attempt number 1. Retrying operation. Only subsequent failures will be logged. - 1450 error logged by sqloInitIPCWaitPost() 2017-01-15-16.13.54.113000+060 E10946809F621 LEVEL: Error (OS) PID : 8896 TID : 9260 PROC : db2syscs.exe INSTANCE: db2inst1 NODE : 000 DB : SAMPLE APPHDL : 0-24668 APPID: ::1.53720.170115151322 AUTHID : SAPSR3 HOSTNAME: localhost EDUID : 9260 EDUNAME: db2agent (SAMPLE) 0 FUNCTION: DB2 UDB, oper system services, sqloInitIPCWaitPost, probe:20 MESSAGE : ZRC=0x830005AA=-2097150550 CALLED : OS, -, CreateEvent OSERR : 1450 "Insufficient system resources exist to complete the requested service." - 1450 error logged by db2agentX threads (parallel sort): 2017-01-15-16.13.52.878000+060 I10926483F924 LEVEL: Severe PID : 8896 TID : 9468 PROC : db2syscs.exe INSTANCE: db2inst1 NODE : 000 DB : SAMPLE APPHDL : 0-24652 APPID: ::1.53710.170115151301 AUTHID : SAPSR3 HOSTNAME: localhost EDUID : 9468 EDUNAME: db2agnts (SAMPLE) 0 FUNCTION: DB2 UDB, relation data serv, sqlrr_dump_ffdc, probe:250 MESSAGE : ZRC=0x830005AA=-2097150550 DATA #1 : SQLCA, PD_DB2_TYPE_SQLCA, 136 bytes sqlcaid : SQLCA sqlcabc: 136 sqlcode: -901 sqlerrml: 4 sqlerrmc: 1450 sqlerrp : SQLRI14A sqlerrd : (1) 0x830005AA (2) 0x000005AA (3) 0x00000000 (4) 0x00000000 (5) 0xFFFFFD09 (6) 0x00000000 | |
Problem Summary: | |
**************************************************************** * USERS AFFECTED: * * ALL * **************************************************************** * PROBLEM DESCRIPTION: * * See Error Description * **************************************************************** * RECOMMENDATION: * * Upgrade to DB2 V11.1m1fp2 * **************************************************************** | |
Local Fix: | |
Update dbm cfg INTRA_PARALLEL NO | |
available fix packs: | |
DB2 Version 11.1 Mod1 Fix Pack1 iFix001 for Linux, UNIX, and Windows | |
Solution | |
Upgrade to DB2 V11.1m1fp2 | |
Workaround | |
not known / see Local fix | |
Timestamps | |
Date - problem reported : Date - problem closed : Date - last modified : | 28.02.2017 19.04.2017 19.04.2017 |
Problem solved at the following versions (IBM BugInfos) | |
Problem solved according to the fixlist(s) of the following version(s) |