You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Wes McKinney (Jira)" <ji...@apache.org> on 2019/11/05 03:47:00 UTC

[jira] [Commented] (ARROW-7038) [Python] Reading from HDFS after ctrl+c(SIGTERM) causes python hangs

    [ https://issues.apache.org/jira/browse/ARROW-7038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16967210#comment-16967210 ] 

Wes McKinney commented on ARROW-7038:
-------------------------------------

I'm not sure the reason. You're using a pretty old version of pyarrow, you should see if the issue is still present with 0.15.1 (though it probably is). I don't know how the Ctrl-C signal get propagated when the execution is inside the Java libraries

> [Python] Reading from HDFS after ctrl+c(SIGTERM) causes python hangs
> --------------------------------------------------------------------
>
>                 Key: ARROW-7038
>                 URL: https://issues.apache.org/jira/browse/ARROW-7038
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>         Environment: RHEL 7.4
> python 3.6.7
> pyarrow 0.11.1
> hadoop 2.7.2
>            Reporter: Kevin Jung
>            Priority: Major
>
> Python always hangs when it reads the parquet file from hdfs twice after sending ctrl+c.
> To reproduce,
> {code:java}
> import pyarrow.parquet as pq
> pq.read_table('hdfs://localhost:9000/path/to/parquet')
> # Ctrl+C here
> pq.read_table('hdfs://localhost:9000/path/to/parquet')
> # hangs{code}
> There's the weird thing ctrl+c at line 3 doesn't produce KeyboardInterrupt even though using the local file produces it at all.
> For example,
> {code:java}
> import pyarrow.parquet as pq
> pq.read_table('/home/path/to/localfile')
> # doing Ctrl+C here prints out 'KeyboardInterrupt'
> pq.read_table('/home/path/to/localfile')
> # run properly{code}
> Is this a sort of bug from JNI?
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)