You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Kevin Jung (Jira)" <ji...@apache.org> on 2019/11/04 00:16:00 UTC

[jira] [Updated] (ARROW-7038) [Python] Reading from HDFS after ctrl+c(SIGTERM) causes python hangs

     [ https://issues.apache.org/jira/browse/ARROW-7038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kevin Jung updated ARROW-7038:
------------------------------
    Description: 
Python always hangs when it reads the parquet file from hdfs twice after sending ctrl+c.

To reproduce,
{code:java}
import pyarrow.parquet as pq
pq.read_table('hdfs://localhost:9000/path/to/parquet')
# Ctrl+C here
pq.read_table('hdfs://localhost:9000/path/to/parquet')
# hangs{code}
There's the weird thing ctrl+c at line 3 doesn't produce KeyboardInterrupt even though using the local file produces it at all.

For example,
{code:java}
import pyarrow.parquet as pq
pq.read_table('/home/path/to/localfile')
# doing Ctrl+C here prints out 'KeyboardInterrupt'
pq.read_table('/home/path/to/localfile')
# run properly{code}
Is this a sort of bug from JNI?

 

  was:
Python always hangs when it reads the parquet file from hdfs twice after sending ctrl+c.

To reproduce,
{code:java}
import pyarrow.parquet as pq
pq.read_table('hdfs://localhost:9000/path/to/parquet')
# Ctrl+C here
pq.read_table('hdfs://localhost:9000/path/to/parquet')
# hangs{code}
There's the weird thing ctrl+c at line 3 doesn't produce KeyboardInterrupt even though using the local file produces it at all.

For example,
{code:java}
import pyarrow.parquet as pq
pq.read_table('/home/brightics/t5y69ppsctftwbdg')
# doing Ctrl+C here prints out 'KeyboardInterrupt'
pq.read_table('/home/brightics/t5y69ppsctftwbdg')
# run properly{code}
Is this a sort of bug from JNI?

 


> [Python] Reading from HDFS after ctrl+c(SIGTERM) causes python hangs
> --------------------------------------------------------------------
>
>                 Key: ARROW-7038
>                 URL: https://issues.apache.org/jira/browse/ARROW-7038
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>         Environment: RHEL 7.4
> python 3.6.7
> pyarrow 0.11.1
> hadoop 2.7.2
>            Reporter: Kevin Jung
>            Priority: Major
>
> Python always hangs when it reads the parquet file from hdfs twice after sending ctrl+c.
> To reproduce,
> {code:java}
> import pyarrow.parquet as pq
> pq.read_table('hdfs://localhost:9000/path/to/parquet')
> # Ctrl+C here
> pq.read_table('hdfs://localhost:9000/path/to/parquet')
> # hangs{code}
> There's the weird thing ctrl+c at line 3 doesn't produce KeyboardInterrupt even though using the local file produces it at all.
> For example,
> {code:java}
> import pyarrow.parquet as pq
> pq.read_table('/home/path/to/localfile')
> # doing Ctrl+C here prints out 'KeyboardInterrupt'
> pq.read_table('/home/path/to/localfile')
> # run properly{code}
> Is this a sort of bug from JNI?
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)