You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@trafodion.apache.org by "Alice Chen (JIRA)" <ji...@apache.org> on 2015/07/22 20:16:01 UTC

[jira] [Created] (TRAFODION-431) LP Bug: 1339912 - select count(*)from hive table may result in MXOSRVR core

Alice Chen created TRAFODION-431:
------------------------------------

             Summary: LP Bug: 1339912 - select count(*)from hive table may result in MXOSRVR core
                 Key: TRAFODION-431
                 URL: https://issues.apache.org/jira/browse/TRAFODION-431
             Project: Apache Trafodion
          Issue Type: Bug
          Components: sql-exe
            Reporter: Julie Thai
            Assignee: Selvaganesan Govindarajan
            Priority: Critical


Select count(*) from hive-table intermittently fails with ERROR[1]. MXOSRVR core produced.
Encountered on 4 node, cloudera cluster, build 0707_0230.

SQL>select count(*) from hive.hive.lineitem;
*** ERROR[1] The message id: problem_with_server_read
*** ERROR[1] The message id: header_not_long_enough
*** ERROR[1] The message id: problem_with_server_read
*** ERROR[1] The message id: header_not_long_enough

>From MXOSRVR core file:
Core was generated by `mxosrvr -ZkHost rhel-cdh1:2181,rhel-cdh2:2181,rhel-cdh3:2181 -RZ rhel-cdh4.hpl.'.
Program terminated with signal 6, Aborted.
#0  0x00000039b5e32925 in raise () from /lib64/libc.so.6
Missing separate debuginfos, use: debuginfo-install boost-filesystem-1.41.0-18.el6.x86_64 boost-program-options-1.41.0-18.el6.x86_64 boost-system-1.41.0-18.el6.x86_64 cyrus-sasl-lib-2.1.23-13.el6_3.1.x86_64 glibc-2.12-1.132.el6.x86_64 java-1.7.0-openjdk-1.7.0.55-2.4.7.1.el6_5.x86_64 keyutils-libs-1.4-4.el6.x86_64 krb5-libs-1.10.3-15.el6_5.1.x86_64 libcom_err-1.41.12-18.el6.x86_64 libgcc-4.4.7-4.el6.x86_64 libselinux-2.0.94-5.3.el6_4.1.x86_64 libstdc++-4.4.7-4.el6.x86_64 libuuid-2.17.2-12.14.el6.x86_64 nspr-4.10.2-1.el6_5.x86_64 nss-3.15.3-6.el6_5.x86_64 nss-softokn-freebl-3.14.3-10.el6_5.x86_64 nss-util-3.15.3-1.el6_5.x86_64 openldap-2.4.23-34.el6_5.1.x86_64 openssl-1.0.1e-16.el6_5.7.x86_64 qpid-cpp-client-0.14-22.el6_3.x86_64 zlib-1.2.3-29.el6.x86_64
(gdb) where
#0  0x00000039b5e32925 in raise () from /lib64/libc.so.6
#1  0x00000039b5e34105 in abort () from /lib64/libc.so.6
#2  0x00007f29cb9a38c5 in ?? ()
   from /usr/lib/jvm/java/jre/lib/amd64/server/libjvm.so
#3  0x00007f29cbb1178f in ?? ()
   from /usr/lib/jvm/java/jre/lib/amd64/server/libjvm.so
#4  0x00007f29cb9a8a92 in JVM_handle_linux_signal ()
   from /usr/lib/jvm/java/jre/lib/amd64/server/libjvm.so
#5  <signal handler called>
#6  0x00007f29cb7ccef2 in ?? ()
   from /usr/lib/jvm/java/jre/lib/amd64/server/libjvm.so
#7  0x00007f29cb7e472b in ?? ()
   from /usr/lib/jvm/java/jre/lib/amd64/server/libjvm.so
#8  0x00007f29c9b342f5 in JNIEnv_::CallLongMethod (this=0x2a749d8, 
    obj=0x10500000001, methodID=0x2aeb1a8)
    at /usr/lib/jvm/java-1.7.0-openjdk.x86_64/include/jni.h:1004
#9  0x00007f29c9b455ca in HiveClient_JNI::getRedefTime (this=0x7f29aa1dd258, 
    schName=0x7f29bbaf6a48 "default", tabName=0x7f29bbaf6a18 "LINEITEM", 
    redefTime=@0x7f29bbaf66f0) at ../executor/HBaseClient_JNI.cpp:4283
#10 0x00007f29c9b0e54a in HiveMetaData::validate (this=0x7f29aa56ed18, 
    tableId=0, redefTS=1404844822, schName=0x7f29bbaf6a48 "default", 
    tblName=0x7f29bbaf6a18 "LINEITEM") at ../executor/hiveHook.cpp:777
#11 0x00007f29c4a276be in NATableDB::get (this=0x7f29b9a928e0,
    key=0x7f29aa524ca8, bindWA=0x7f29bbaf8c00, findInCacheOnly=0)
    at ../optimizer/NATable.cpp:6467
#12 0x00007f29c4a2884b in NATableDB::get (this=0x7f29b9a928e0, corrName=..., 
    bindWA=0x7f29bbaf8c00, inTableDescStruct=0x0)
    at ../optimizer/NATable.cpp:6830
#13 0x00007f29c4791332 in BindWA::getNATable (this=0x7f29bbaf8c00, 
    corrName=..., catmanCollectTableUsages=1, inTableDescStruct=0x0)
    at ../optimizer/BindRelExpr.cpp:1443
#14 0x00007f29c479e8a6 in RelRoot::bindNode (this=0x7f29aa525430, 
    bindWA=0x7f29bbaf8c00) at ../optimizer/BindRelExpr.cpp:4601
#15 0x00007f29c62a6f68 in CmpMain::compile (this=0x7f29bbafae30, 
    input_str=0x7f29aa53b610 "select count(*) from hive.hive.lineitem", 
    charset=15, queryExpr=@0x7f29bbafad38, gen_code=0x7f29aa538b90, 
    gen_code_len=0x7f29aa538b88, heap=0x7f29ba615928, phase=CmpMain::END, 
    fragmentDir=0x7f29bbafaf88, op=3004, useQueryCache=1, 
    cacheable=0x7f29bbafad24, begTime=0x7f29bbafad40, shouldLog=0)
    at ../sqlcomp/CmpMain.cpp:1795
#16 0x00007f29c62a5f70 in CmpMain::sqlcomp (this=0x7f29bbafae30, 
    input_str=0x7f29aa53b610 "select count(*) from hive.hive.lineitem", 
    charset=15, queryExpr=@0x7f29bbafad38, gen_code=0x7f29aa538b90, 
    gen_code_len=0x7f29aa538b88, heap=0x7f29ba615928, phase=CmpMain::END, 
    fragmentDir=0x7f29bbafaf88, op=3004, useQueryCache=1, 
    cacheable=0x7f29bbafad24, begTime=0x7f29bbafad40, shouldLog=0)
    at ../sqlcomp/CmpMain.cpp:1462
#17 0x00007f29c62a4321 in CmpMain::sqlcomp (this=0x7f29bbafae30, input=..., 
    gen_code=0x7f29aa538b90, gen_code_len=0x7f29aa538b88, heap=0x7f29ba615928, 
    phase=CmpMain::END, fragmentDir=0x7f29bbafaf88, op=3004)
    at ../sqlcomp/CmpMain.cpp:797
#18 0x00007f29c3b3d9f7 in CmpStatement::process (this=0x7f29aa51af18, 
    sqltext=...) at ../arkcmp/CmpStatement.cpp:486
#19 0x00007f29c3b32481 in CmpContext::compileDirect (this=0x7f29b9a8d090, 
    data=0x7f29ba6318a8 "\200", data_len=168, outHeap=0x7f29bb095660, 
    charset=15, op=CmpMessageObj::SQLTEXT_COMPILE, gen_code=@0x7f29bbafb4a0, 
    gen_code_len=@0x7f29bbafb4ac, parserFlags=4194304, diagsArea=
    0x7f29ba631958) at ../arkcmp/CmpContext.cpp:689
#20 0x00007f29cad36204 in CliStatement::prepare2 (this=0x7f29ba636978, 
    source=0x7f29aa1b3eb8 "select count(*) from hive.hive.lineitem", 
    diagsArea=..., passed_gen_code=0x0, passed_gen_code_len=0, charset=15, 
    unpackTdbs=1, cliFlags=129) at ../cli/Statement.cpp:1797
#21 0x00007f29cad3504a in CliStatement::prepare (this=0x7f29ba636978, 
    source=0x7f29aa1b3eb8 "select count(*) from hive.hive.lineitem", 
    diagsArea=..., passed_gen_code=0x0, passed_gen_code_len=0, charset=15, 
    unpackTdbs=1, cliFlags=129) at ../cli/Statement.cpp:1388
#22 0x00007f29cace99f5 in SQLCLI_Prepare2 (cliGlobals=0x2a51b80, 
    statement_id=0x3ed3e18, sql_source=0x7f29bbafb8e0, gencode_ptr=0x0, 
    gencode_len=0, ret_gencode_len=0x0, query_cost_info=0x3ed4048,
    query_comp_stats_info=0x3ed4098, uniqueStmtId=0x3ed3f80 "", 
    uniqueStmtIdLen=0x3ed3f7c, flags=1) at ../cli/Cli.cpp:5904
#23 0x00007f29cad5528a in SQL_EXEC_Prepare2 (statement_id=0x3ed3e18, 
    sql_source=0x7f29bbafb8e0, gencode_ptr=0x0, gencode_len=0, 
    ret_gencode_len=0x0, query_cost_info=0x3ed4048, comp_stats_info=0x3ed4098, 
    uniqueStmtId=0x3ed3f80 "", uniqueStmtIdLen=0x3ed3f7c, flags=1)
    at ../cli/CliExtern.cpp:4913
#24 0x00007f29cc4cb5a1 in SRVR::WSQL_EXEC_Prepare2 (statement_id=0x3ed3e18, 
    sql_source=0x7f29bbafb8e0, gencode_ptr=0x0, gencode_len=0, 
    ret_gencode_len=0x0, query_cost_info=0x3ed4048, comp_stats_info=0x3ed4098, 
    uniqueQueryId=0x3ed3f80 "", uniqueQueryIdLen=0x3ed3f7c, flags=1)
    at SQLWrapper.cpp:800
#25 0x00007f29cc4b2283 in SRVR::PREPARE2 (pSrvrStmt=0x3ed3800, 
    isFromExecDirect=true) at sqlinterface.cpp:5016
#26 0x00007f29cc509767 in odbc_SQLSvc_Prepare2_sme_ (inputRowCnt=0, 
    sqlStmtType=1, stmtLabel=0x3ed35c8 "SQL_CUR_2", 
    sqlString=0x3ed3594 "select count(*) from hive.hive.lineitem", 
    holdableCursor=0, returnCode=0x7f29bbafd124, 
    sqlWarningOrErrorLength=0x7f29bbafd120, sqlWarningOrError=@0x7f29bbafd118, 
    sqlQueryType=0x7f29bbafd110, stmtHandle=0x7f29bbafd098, 
    estimatedCost=0x7f29bbafd10c, inputDescLength=0x7f29bbafd0bc, 
    inputDesc=@0x7f29bbafd0b0, outputDescLength=0x7f29bbafd0ac, 
    outputDesc=@0x7f29bbafd0a0, isFromExecDirect=true) at srvrothers.cpp:926
#27 0x00000000005a07e6 in odbc_SQLSrvr_ExecDirect_ame_ (objtag_=0x3a24150, 
    call_id_=0x3a241a8, dialogueId=891659981, stmtLabel=0x3ed35c8 "SQL_CUR_2", 
    cursorName=0x0, stmtExplainLabel=0x0, stmtType=0, sqlStmtType=1, 
    sqlString=0x3ed3594 "select count(*) from hive.hive.lineitem", 
    sqlAsyncEnable=0, queryTimeout=0, inputRowCnt=0, txnID=0, holdableCursor=0)
    at SrvrConnect.cpp:6877
#28 0x000000000053c1cb in SQLEXECUTE_IOMessage (objtag_=0x3a24150, 
    call_id_=0x3a241a8, operation_id=3012) at Interface/odbcs_srvr.cpp:1734
#29 0x000000000053c497 in DISPATCH_TCPIPRequest (objtag_=0x3a24150, 
    call_id_=0x3a241a8, operation_id=3012) at Interface/odbcs_srvr.cpp:1799
#30 0x000000000046380e in BUILD_TCPIP_REQUEST (pnode=0x3a24150)
    at ../Common/TCPIPSystemSrvr.cpp:593
#31 0x0000000000463755 in PROCESS_TCPIP_REQUEST (pnode=0x3a24150)
    at ../Common/TCPIPSystemSrvr.cpp:571
#32 0x00000000004d24c4 in CNSKListenerSrvr::CheckTCPIPRequest (this=0x28de220, 
    ipnode=0x3a24150) at Interface/Listener_srvr.cpp:61
#33 0x00000000004e3add in CNSKListenerSrvr::tcpip_listener (arg=0x28de220)
    at Interface/linux/Listener_srvr_ps.cpp:397
#34 0x00007f29cafc1a80 in sb_thread_sthr_disp (pp_arg=0x2a30030)
    at threadl.cpp:253
#35 0x00000039b62079d1 in start_thread () from /lib64/libpthread.so.0
#36 0x00000039b5ee8b6d in clone () from /lib64/libc.so.6

Attached sript may intermittently reproduce the error/core.
See README in attachment.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)