You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Joris Van den Bossche (Jira)" <ji...@apache.org> on 2022/03/08 16:46:00 UTC
[jira] [Updated] (ARROW-14787) [Python] read an HDFS file by line failed when the open_mode is "rb"
[ https://issues.apache.org/jira/browse/ARROW-14787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Joris Van den Bossche updated ARROW-14787:
------------------------------------------
Summary: [Python] read an HDFS file by line failed when the open_mode is "rb" (was: read an HDFS file by line failed when the open_mode is "rb")
> [Python] read an HDFS file by line failed when the open_mode is "rb"
> --------------------------------------------------------------------
>
> Key: ARROW-14787
> URL: https://issues.apache.org/jira/browse/ARROW-14787
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 6.0.0
> Environment: System: Ubuntu 18.04
> fsspec: 2021.10.1
> pyarrow: 6.0.0
> Reporter: nero
> Priority: Major
>
> Hi there,
> I found some problems when I use `{*}fsspec`{*} to read an HDFS file by line when the open_mode is "rb". It works fine when the *open_mode is "r"* or the {*}file is located locally{*}.
> some snippets:
> {code:java}
> import fsspec
> hdfs_file_path = "hdfs://xxxxxx"
> with fsspec.open(hdfs_file_path, "rb") as f:
> # raise UnspportedOperation
> f.readline() {code}
>
> Error logs:
>
> /opt/conda/lib/python3.7/site-packages/pyarrow/io.pxi in pyarrow.lib.NativeFile.readline()
> UnsupportedOperation:
> Originally from: https://github.com/fsspec/filesystem_spec/issues/810
--
This message was sent by Atlassian Jira
(v8.20.1#820001)