You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Krisztian Szucs (JIRA)" <ji...@apache.org> on 2019/01/21 10:10:00 UTC
[jira] [Commented] (ARROW-4308) [Python] pyarrow has a hard
dependency on pandas
[ https://issues.apache.org/jira/browse/ARROW-4308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16747816#comment-16747816 ]
Krisztian Szucs commented on ARROW-4308:
----------------------------------------
I couldn't reproduce it, You must have pandas installed or available in the path, because the else branch wouldn't run otherwise:
{code:python}
Python 3.6.7 | packaged by conda-forge | (default, Nov 20 2018, 18:37:09)
[GCC 4.2.1 Compatible Clang 4.0.1 (tags/RELEASE_401/final)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import pyarrow
>>> import pandas
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
ModuleNotFoundError: No module named 'pandas'
>>>
{code:java}
{code}
> [Python] pyarrow has a hard dependency on pandas
> ------------------------------------------------
>
> Key: ARROW-4308
> URL: https://issues.apache.org/jira/browse/ARROW-4308
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Reporter: Wes McKinney
> Priority: Major
> Fix For: 0.13.0
>
>
> We either need to make pandas a soft dependency (as it was in the past) or add it to the package requirements. Calling {{pip install pyarrow}} for 0.12.0 in a fresh environment results in
> {code}
> In [1]: import pyarrow as pa
> ---------------------------------------------------------------------------
> ModuleNotFoundError Traceback (most recent call last)
> <ipython-input-1-852643f3aad4> in <module>
> ----> 1 import pyarrow as pa
> ~/miniconda/envs/pyarrow-pip-3.7/lib/python3.7/site-packages/pyarrow/__init__.py in <module>
> 52
> 53
> ---> 54 from pyarrow.lib import cpu_count, set_cpu_count
> 55 from pyarrow.lib import (null, bool_,
> 56 int8, int16, int32, int64,
> ~/miniconda/envs/pyarrow-pip-3.7/lib/python3.7/site-packages/pyarrow/table.pxi in init pyarrow.lib()
> 26 pass
> 27 else:
> ---> 28 import pyarrow.pandas_compat as pdcompat
> 29
> 30
> ~/miniconda/envs/pyarrow-pip-3.7/lib/python3.7/site-packages/pyarrow/pandas_compat.py in <module>
> 22 import re
> 23
> ---> 24 import pandas.core.internals as _int
> 25 import numpy as np
> 26 import pandas as pd
> ModuleNotFoundError: No module named 'pandas.core'
> {code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)