You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hawq.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2016/06/20 08:24:05 UTC

[jira] [Commented] (HAWQ-839) Libyarn coredump when failover to standby RM

    [ https://issues.apache.org/jira/browse/HAWQ-839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15339149#comment-15339149 ] 

ASF GitHub Bot commented on HAWQ-839:
-------------------------------------

GitHub user linwen opened a pull request:

    https://github.com/apache/incubator-hawq/pull/733

    HAWQ-839. Fix libyarn coredump when failover to standby RM

    Please review, Thanks! 

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/linwen/incubator-hawq hawq-839

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-hawq/pull/733.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #733
    
----
commit 117d669f7bbaad2a0613129f1cefa89b42310975
Author: Wen Lin <wl...@pivotal.io>
Date:   2016-06-20T08:15:30Z

    HAWQ-839. Fix libyarn coredump when failover to standby RM

----


> Libyarn coredump when failover to standby RM
> --------------------------------------------
>
>                 Key: HAWQ-839
>                 URL: https://issues.apache.org/jira/browse/HAWQ-839
>             Project: Apache HAWQ
>          Issue Type: Bug
>          Components: libyarn
>            Reporter: Lin Wen
>            Assignee: Lin Wen
>
> Start hawq with yarn mode and kill Hadoop Yarn resource manager, coredump happens, the stack is below: 
> #0  0x0000003e054325e5 in raise () from /lib64/libc.so.6
> #1  0x0000003e05433dc5 in abort () from /lib64/libc.so.6
> #2  0x00007f04980b1109 in libyarn::HandleYarnFailoverException (e=...)
>     at /home/gpadmin/workspace/hawq/incubator-hawq/depends/libyarn/src/libyarnclient/ApplicationClient.cpp:170
> #3  0x00007f04980b3211 in libyarn::ApplicationClient::getNewApplication (this=0x1f17cd0)
>     at /home/gpadmin/workspace/hawq/incubator-hawq/depends/libyarn/src/libyarnclient/ApplicationClient.cpp:215
> #4  0x00007f049809d639 in libyarn::LibYarnClient::createJob (this=0x1f1e500, jobName="hawq", queue="default",
>     jobId="")
>     at /home/gpadmin/workspace/hawq/incubator-hawq/depends/libyarn/src/libyarnclient/LibYarnClient.cpp:163
> #5  0x00007f04980987b8 in createJob (client=0x1f25950, jobName=Unhandled dwarf expression opcode 0xf3
> )
>     at /home/gpadmin/workspace/hawq/incubator-hawq/depends/libyarn/src/libyarnclient/LibYarnClientC.cpp:61
> #6  createJob (client=0x1f25950, jobName=Unhandled dwarf expression opcode 0xf3
> )
>     at /home/gpadmin/workspace/hawq/incubator-hawq/depends/libyarn/src/libyarnclient/LibYarnClientC.cpp:180
> #7  0x00000000008e1117 in RB2YARN_registerYARNApplication ()
> #8  0x00000000008e31ad in RB2YARN_initializeConnection ()
> #9  0x00000000008e358b in ResBrokerMainInternal ()
> #10 0x00000000008e38e8 in ResBrokerMain ()
> #11 0x00000000008dfb66 in RB_LIBYARN_start ()
> #12 0x000000000090ae5e in MainHandlerLoop ()
> #13 0x000000000090b46a in ResManagerMainServer2ndPhase ()
> #14 0x000000000090ba14 in ResManagerMain ()
> #15 0x000000000090bd71 in ResManagerProcessStartup ()
> #16 0x0000000000767f98 in CommenceNormalOperations ()
> #17 0x0000000000768d44 in do_reaper ()
> #18 0x000000000076dbed in ServerLoop ()
> #19 0x000000000076f73e in PostmasterMain ()
> #20 0x00000000006c828a in main ()



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)