You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hawq.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2016/04/07 10:36:25 UTC

[jira] [Commented] (HAWQ-635) QE process does not exit in libhdfs

    [ https://issues.apache.org/jira/browse/HAWQ-635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15229915#comment-15229915 ] 

ASF GitHub Bot commented on HAWQ-635:
-------------------------------------

GitHub user liming01 opened a pull request:

    https://github.com/apache/incubator-hawq/pull/564

    HAWQ-635. QE process does not exit in libhdfs

    1) The problem is caused by the wrong refs in RpcChannelImpl class, so it run a dead loop when process exiting. I suspect it is called by the exception thrown by RpcClientImpl::getChannel(), which already addRef(), but doesn't call close() when exception occurs.
    
    2) This problem cannot be fixed by SIGNAL, because the clean up process is already called when dead loop occurs.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/liming01/incubator-hawq mli/process_not_exit_libhdfs

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-hawq/pull/564.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #564
    
----
commit 3156d8a0fa4197ab154ac3fbc3f6f42aa1715dbf
Author: Ming LI <ml...@pivotal.io>
Date:   2016-04-07T08:32:39Z

    HAWQ-635. QE process does not exit in libhdfs

----


> QE process does not exit in libhdfs
> -----------------------------------
>
>                 Key: HAWQ-635
>                 URL: https://issues.apache.org/jira/browse/HAWQ-635
>             Project: Apache HAWQ
>          Issue Type: Bug
>            Reporter: Ming LI
>            Assignee: Lei Chang
>
> The QE process cannot exit. 
> The calling stack is:
> [gpadmin@sdw3 ~]$ pstack 489333
> #0  0x00000033f560ef3d in nanosleep () from /lib64/libpthread.so.0
> #1  0x00007ff75309c74a in boost::this_thread::hiden::sleep_for(timespec const&) () from /data/pulse-agent-data/HAWQ-main-FeatureTest-opt-sanity/product/hawq/./lib/libboost_thread.so.1.53.0
> #2  0x00007ff755b850b8 in Hdfs::Internal::RpcChannelImpl::waitForExit() () from /data/pulse-agent-data/HAWQ-main-FeatureTest-opt-sanity/product/hawq/./lib/libhdfs3.so.1
> #3  0x00007ff755b97eff in Hdfs::Internal::RpcClientImpl::close() () from /data/pulse-agent-data/HAWQ-main-FeatureTest-opt-sanity/product/hawq/./lib/libhdfs3.so.1
> #4  0x00007ff755b98094 in Hdfs::Internal::RpcClientImpl::~RpcClientImpl() () from /data/pulse-agent-data/HAWQ-main-FeatureTest-opt-sanity/product/hawq/./lib/libhdfs3.so.1
> #5  0x0000000000540c59 in boost::detail::shared_count::~shared_count() ()
> #6  0x00000033f52361bd in __cxa_finalize () from /lib64/libc.so.6
> #7  0x00007ff755b04456 in __do_global_dtors_aux () from /data/pulse-agent-data/HAWQ-main-FeatureTest-opt-sanity/product/hawq/./lib/libhdfs3.so.1
> #8  0x0000000000000000 in ?? ()



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)