You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hawq.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2016/02/01 04:07:39 UTC

[jira] [Commented] (HAWQ-373) QE coredump in pg_encoding_mbcliplen when run concrurrent copy in stress

    [ https://issues.apache.org/jira/browse/HAWQ-373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15125669#comment-15125669 ] 

ASF GitHub Bot commented on HAWQ-373:
-------------------------------------

GitHub user liming01 opened a pull request:

    https://github.com/apache/incubator-hawq/pull/309

    HAWQ-373: Fixed QE coredump in pg_encoding_mbcliplen when run concrur…

    …rent copy in stress
    
    The root cause is:
    In the function copy_in_error_callback() we should consider the process data may be incomplete because master maybe quit at any time when error issues. 

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/liming01/incubator-hawq mli/copy_crash_at_dispatcher

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-hawq/pull/309.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #309
    
----
commit 8f01bbf2add71028f3253a5f0df13bca5321e3b4
Author: Ming LI <ml...@pivotal.io>
Date:   2016-02-01T03:03:48Z

    HAWQ-373: Fixed QE coredump in pg_encoding_mbcliplen when run concrurrent copy in stress

----


> QE coredump in pg_encoding_mbcliplen when run concrurrent copy in stress 
> -------------------------------------------------------------------------
>
>                 Key: HAWQ-373
>                 URL: https://issues.apache.org/jira/browse/HAWQ-373
>             Project: Apache HAWQ
>          Issue Type: Bug
>          Components: Dispatcher
>            Reporter: Ming LI
>            Assignee: Ming LI
>
> We run stress test with concurrent tpch, insert, copy query the QE coredump
> (gdb) bt
> #0  0x0000003e2440f5db in raise () from /lib64/libpthread.so.0
> #1  0x0000000000873412 in SafeHandlerForSegvBusIll (processName=<value optimized out>, postgres_signal_arg=11) at elog.c:4515
> #2  <signal handler called>
> #3  0x0000000000888df0 in pg_encoding_mbcliplen (mbstr=0x1 <Address 0x1 out of bounds>, len=50043862, limit=100) at mbutils.c:983
> #4  pg_mbcliplen (mbstr=0x1 <Address 0x1 out of bounds>, len=50043862, limit=100) at mbutils.c:962
> #5  0x00000000005e881d in limit_printout_length (arg=<value optimized out>) at copy.c:6322
> #6  copy_in_error_callback (arg=<value optimized out>) at copy.c:6278
> #7  0x0000000000876c37 in errfinish (dummy=<value optimized out>) at elog.c:535
> #8  0x00000000006b42f2 in pq_getmessage (s=<value optimized out>, maxlen=<value optimized out>) at pqcomm.c:1181
> #9  0x00000000005e82e2 in CopyGetData (cstate=0x31cdc00, databuf=0x31d1d4d, datasize=<value optimized out>) at copy.c:629
> #10 0x00000000005f0e22 in CopyFrom (cstate=0x0) at copy.c:3898
> #11 0x00000000005f4c27 in DoCopy (stmt=0x30c08c8, queryString=<value optimized out>) at copy.c:1644
> #12 0x00000000007be437 in ProcessUtility (parsetree=0x30c08c8, queryString=<value optimized out>, params=0x0, isTopLevel=1 '\001',
>     dest=0x7fba60f556b0, completionTag=0x7fff37788ed0 "") at utility.c:1091
> #13 0x00000000007b909a in PortalRunUtility (portal=0x30dbc40, utilityStmt=0x30c08c8, isTopLevel=42 '*', dest=0x7fba60f556b0,
>     completionTag=0x7fff37788ed0 "") at pquery.c:1887
> #14 0x00000000007bb48e in PortalRunMulti (portal=0x30dbc40, isTopLevel=1 '\001', dest=0x7fba60f556b0, altdest=<value optimized out>,
>     completionTag=0x7fff37788ed0 "") at pquery.c:1997
> #15 0x00000000007bba52 in PortalRun (portal=<value optimized out>, count=<value optimized out>, isTopLevel=-42 '\326',
>     dest=<value optimized out>, altdest=<value optimized out>, completionTag=<value optimized out>) at pquery.c:1514
> #16 0x00000000007b7f30 in exec_mpp_query (argc=<value optimized out>, argv=<value optimized out>, username=<value optimized out>)
>     at postgres.c:1404
> #17 PostgresMain (argc=<value optimized out>, argv=<value optimized out>, username=<value optimized out>) at postgres.c:4886
> #18 0x0000000000769943 in BackendRun (port=0x2fa2750) at postmaster.c:5875
> #19 BackendStartup (port=0x2fa2750) at postmaster.c:5468
> #20 0x000000000076a0ad in ServerLoop () at postmaster.c:2147
> #21 0x000000000076bebe in PostmasterMain (argc=9, argv=0x2fb85b0) at postmaster.c:1439
> #22 0x00000000006c5f3a in main (argc=9, argv=0x2fb8570) at main.c:226



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)