You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Bradley Grantham (JIRA)" <ji...@apache.org> on 2019/01/29 09:04:00 UTC
[jira] [Created] (ARROW-4413) [Python] pyarrow.hdfs.connect()
failing
Bradley Grantham created ARROW-4413:
---------------------------------------
Summary: [Python] pyarrow.hdfs.connect() failing
Key: ARROW-4413
URL: https://issues.apache.org/jira/browse/ARROW-4413
Project: Apache Arrow
Issue Type: Bug
Components: Python
Affects Versions: 0.12.0
Environment: Python 2.7
Hadoop distribution: Amazon 2.7.3
Hive 2.1.1
Spark 2.1.1
Tez 0.8.4
Linux 4.4.35-33.55.amzn1.x86_64
Reporter: Bradley Grantham
Trying to connect to hdfs using the below snippet. Using {{hadoop-libhdfs}}.
This error appears in {{v0.12.0}}. It doesn't appear in {{v0.11.1}}. (I used the same environment when testing that it still worked on {{v0.11.1}})
{code:java}
In [1]: import pyarrow as pa
In [2]: fs = pa.hdfs.connect()
---------------------------------------------------------------------------
TypeError Traceback (most recent call last)
<ipython-input-2-e0007ad7fa95> in <module>()
----> 1 fs = pa.hdfs.connect()
/usr/local/lib64/python2.7/site-packages/pyarrow/hdfs.pyc in connect(host, port, user, kerb_ticket, driver, extra_conf)
205 fs = HadoopFileSystem(host=host, port=port, user=user,
206 kerb_ticket=kerb_ticket, driver=driver,
--> 207 extra_conf=extra_conf)
208 return fs
/usr/local/lib64/python2.7/site-packages/pyarrow/hdfs.pyc in __init__(self, host, port, user, kerb_ticket, driver, extra_conf)
36 _maybe_set_hadoop_classpath()
37
---> 38 self._connect(host, port, user, kerb_ticket, driver, extra_conf)
39
40 def __reduce__(self):
/usr/local/lib64/python2.7/site-packages/pyarrow/io-hdfs.pxi in pyarrow.lib.HadoopFileSystem._connect()
72 if host is not None:
73 conf.host = tobytes(host)
---> 74 self.host = host
75
76 conf.port = port
TypeError: Expected unicode, got str
{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)