You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Krisztian Szucs (JIRA)" <ji...@apache.org> on 2019/01/21 10:11:00 UTC

[jira] [Comment Edited] (ARROW-4308) [Python] pyarrow has a hard dependency on pandas

    [ https://issues.apache.org/jira/browse/ARROW-4308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16747816#comment-16747816 ] 

Krisztian Szucs edited comment on ARROW-4308 at 1/21/19 10:10 AM:
------------------------------------------------------------------

I couldn't reproduce it, You must have pandas installed or available in the path, because the else branch wouldn't run otherwise:

 
{code}
Python 3.6.7 | packaged by conda-forge | (default, Nov 20 2018, 18:37:09)
[GCC 4.2.1 Compatible Clang 4.0.1 (tags/RELEASE_401/final)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import pyarrow
>>> import pandas
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
ModuleNotFoundError: No module named 'pandas'
>>>
 {code}


was (Author: kszucs):
I couldn't reproduce it, You must have pandas installed or available in the path, because the else branch wouldn't run otherwise:

 

{code:python}

Python 3.6.7 | packaged by conda-forge | (default, Nov 20 2018, 18:37:09)
[GCC 4.2.1 Compatible Clang 4.0.1 (tags/RELEASE_401/final)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import pyarrow
>>> import pandas
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
ModuleNotFoundError: No module named 'pandas'
>>>
{code:java}
 {code}

> [Python] pyarrow has a hard dependency on pandas
> ------------------------------------------------
>
>                 Key: ARROW-4308
>                 URL: https://issues.apache.org/jira/browse/ARROW-4308
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>            Reporter: Wes McKinney
>            Priority: Major
>             Fix For: 0.13.0
>
>
> We either need to make pandas a soft dependency (as it was in the past) or add it to the package requirements. Calling {{pip install pyarrow}} for 0.12.0 in a fresh environment results in
> {code}
> In [1]: import pyarrow as pa                                                                                                                                                                   
> ---------------------------------------------------------------------------
> ModuleNotFoundError                       Traceback (most recent call last)
> <ipython-input-1-852643f3aad4> in <module>
> ----> 1 import pyarrow as pa
> ~/miniconda/envs/pyarrow-pip-3.7/lib/python3.7/site-packages/pyarrow/__init__.py in <module>
>      52 
>      53 
> ---> 54 from pyarrow.lib import cpu_count, set_cpu_count
>      55 from pyarrow.lib import (null, bool_,
>      56                          int8, int16, int32, int64,
> ~/miniconda/envs/pyarrow-pip-3.7/lib/python3.7/site-packages/pyarrow/table.pxi in init pyarrow.lib()
>      26     pass
>      27 else:
> ---> 28     import pyarrow.pandas_compat as pdcompat
>      29 
>      30 
> ~/miniconda/envs/pyarrow-pip-3.7/lib/python3.7/site-packages/pyarrow/pandas_compat.py in <module>
>      22 import re
>      23 
> ---> 24 import pandas.core.internals as _int
>      25 import numpy as np
>      26 import pandas as pd
> ModuleNotFoundError: No module named 'pandas.core'
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)