You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "cheng pan (Jira)" <ji...@apache.org> on 2021/08/03 12:07:00 UTC

[jira] [Created] (ARROW-13535) connect hdfs error

cheng pan created ARROW-13535:
---------------------------------

             Summary: connect hdfs error
                 Key: ARROW-13535
                 URL: https://issues.apache.org/jira/browse/ARROW-13535
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
    Affects Versions: 5.0.0
            Reporter: cheng pan


when i use pyarrow to connect my hdfs, I meet error 

I use 

from pyarrow import fs
print(fs.FileSystem.from_uri("hdfs://"))

the error shows loadFileSystems error:
(unable to get root cause for java.lang.NoClassDefFoundError)
(unable to get stack trace for java.lang.NoClassDefFoundError)
hdfsBuilderConnect(forceNewInstance=1, nn=hdfs://, port=0, kerbTicketCachePath=(NULL), userName=(NULL)) error:
(unable to get root cause for java.lang.NoClassDefFoundError)
(unable to get stack trace for java.lang.NoClassDefFoundError)
/arrow/cpp/src/arrow/filesystem/hdfs.cc:51: Failed to disconnect hdfs client: IOError: HDFS hdfsFS::Disconnect failed, errno: 9 (Bad file descriptor)
Traceback (most recent call last):
 File "/home/tdops/fucheng.pan/ray-code/read.py", line 15, in <module>
 print(fs.FileSystem.from_uri("hdfs:"))
 File "pyarrow/_fs.pyx", line 347, in pyarrow._fs.FileSystem.from_uri
 File "pyarrow/error.pxi", line 122, in pyarrow.lib.pyarrow_internal_check_status
 File "pyarrow/error.pxi", line 99, in pyarrow.lib.check_status
OSError: HDFS connection failed



--
This message was sent by Atlassian Jira
(v8.3.4#803005)