본문 바로가기

Project/pureScale Demo 구축

[2011-08-15] DB Crash 현상에 대한 LAB의 답변

IBM Lab 에서 답변이 왔다.
결론은 Udapl 을 업그레이드 하라는 내용이다

아래는 Lab에서 보내온 결과이다.
 

분석 결과 발생한 현상은 AIX defect 787216 유사합니다. AIX level udapl.rte.7.1.0.15 올릴것을 권고해 드립니다.

물론 AIX level 6.1이라고 하셨지만, defect 대한 check 주십시오.

 

LAB 답변 참고하십시오.

-----------------------------------------------------------------

Reviewed the db2diag.log.

 

2011-08-10-10.50.43.929840+540 E315407A1206 LEVEL: Critical

PID : 598200 TID : 19534 PROC : db2sysc 0

INSTANCE: db2sdin1 NODE : 000

APPHDL : 0-52 APPID: *N0.db2sdin1.110810015041

AUTHID : DB2SDIN1

EDUID : 19534 EDUNAME: db2agntp 0

FUNCTION: DB2 UDB, oper system services, sqloEDUCodeTrapHandler,

probe:90

MESSAGE : ADM14011C A critical failure has caused the following type of

error:

"Trap". The DB2 database manager cannot recover from the

failure.

First Occurrence Data Capture (FODC) was invoked in the

following

mode: "Automatic". FODC diagnostic information is located in

the

following directory:

 

"/db2/db2sdin1/sqllib/db2dump/FODC_Trap_2011-08-10-10.50.43.614265/".

DATA #1 : Signal Number Recieved, 4 bytes

5

DATA #2 : Siginfo, 64 bytes

0x070000003BFC9430 : 0000 0005 0000 0000 0000 0008 0000 0000

.................

0x070000003BFC9440 : 0000 0000 0000 0000 0900 0000 1042 2EF8

..............B..

0x070000003BFC9450 : 0000 0000 0000 0000 0000 0000 0000 0000

.................

0x070000003BFC9460 : 0000 0000 0000 0000 0000 0000 0000 0000

.................

 

Trap shows

 

0x0900000010422EF8 GxlQpPostSend + 0x418

0x0900000010408CA0 IbQpPostSend + 0x780

0x09000000103ACF0C dapls_ib_post_send + 0x24C

0x09000000103AA9F4 dapl_ep_post_send_req + 0x194

0x09000000103C0BA4 dapl_ep_post_rdma_write + 0x84

0x0900000010387788 dat_ep_post_rdma_write + 0xA8

0x090000001042CA30 cmd_send + 0xBD0

0x090000000DD6AA64 readlsc + 0x104

0x090000000DD6EF3C list_get_status + 0x1C

0x090000000DD864B8 CAGetStatus + 0x7D8

0x09000000074EECC8

sqleCaCeStructureStatus__23SQLE_CA_CONN_ENTRY_DATAFPv27SAL_STRUCTURE_STA

TUS_ACTIONP20SAL_STRUCTURE_STATUS + 0x104

0x09000000074EE9F4

SAL_StructureStatus__20SAL_CA_STRUCT_HANDLEFCUiRUlPPcC27SAL_STRUCTURE_ST

ATUS_ACTION + 0x260

0x0900000006195A10

SAL_IsStructInPeer__20SAL_CA_STRUCT_HANDLEFCUiPPcPbRUlCb + 0xB4

0x0900000006194D04 SAL_IsSAInPeer__13SAL_SA_HANDLEFCUiPbRUlCb + 0x24

0x090000000637EB24 SAL_IsSAInPeer__13SAL_SA_HANDLEFCUiPbRUlCb@glue2BB +

0x7C

0x090000000637E228

SAL_ConnectStruct__13SAL_SA_HANDLEFC24SAL_ENCODED_CA_INDEX_SETCUiCUl +

0x414

0x0900000006F48FB8

SAL_OpenSAHandle__13SAL_SA_HANDLEFPP13SAL_SA_HANDLEP16sqeLocalDatabaseCU

l + 0x24C

0x0900000006F48CF8 SA_HANDLE_Initialize__9SA_HANDLEFCP16sqeLocalDatabase

+ 0x38

0x0900000006F48C5C sqleSmartArrayInit + 0x88

0x0900000006F49E34

@78@sqledint__FP8sqeAgentP16sqeLocalDatabaseP5sqlcacPciPb + 0x3C8

0x0900000006F353E0

FirstConnect__16sqeLocalDatabaseFP8SQLE_BWARcP8sqeAgentP8sqlo_gmtiT5Pb +

0x1560

0x090000000704B6C8

StartUsingLocalDatabase__8sqeDBMgrFP8SQLE_BWAP8sqeAgentRccP8sqlo_gmtPb +

0x3C8

0x0900000007047BDC

AppStartUsing__14sqeApplicationFP8SQLE_BWAP8sqeAgentcT3P5sqlcaPc + 0x514

0x0900000007F43010

@78@sqleStartDb__FsP8SQLE_BWAP10sqledbdescP13sqledbdescextT1PcT2iT1lUl +

0x6A0

0x0900000008A5D784

sqleCreateDb__FsP8SQLE_BWAP10sqledbdescP13sqledbdescextPcT5T1N25T2iT1T11

_P13SQLE_CFG_RECSP5sqlca + 0x11F4

 

The above stacks looks similar to AIX Defect 787216. The recommonded AIX level is udapl.rte.7.1.0.15.