You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Henrikh Kantuni (Jira)" <ji...@apache.org> on 2021/10/08 21:40:00 UTC
[jira] [Updated] (ARROW-14267) Cannot convert DataFrame with
geometry `numpy.dtype` cells to Table
[ https://issues.apache.org/jira/browse/ARROW-14267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Henrikh Kantuni updated ARROW-14267:
------------------------------------
Description:
Example:
{code:java}
import geopandas as gpd
import pandas as pd
import pyarrow as pa
path = gpd.datasets.get_path("naturalearth_lowres")
data = gpd.read_file(path)
df = pd.DataFrame(data)
table = pa.Table.from_pandas(df)
print(table)
{code}
Throws the following error:
{code:java}
Traceback (most recent call last):
File "/Users/Henrikh/Desktop/tmp.py", line 8, in <module>
table = pa.Table.from_pandas(df)
File "pyarrow/table.pxi", line 1553, in pyarrow.lib.Table.from_pandas
File "/usr/local/lib/python3.9/site-packages/pyarrow/pandas_compat.py", line 594, in dataframe_to_arrays
arrays = [convert_column(c, f)
File "/usr/local/lib/python3.9/site-packages/pyarrow/pandas_compat.py", line 594, in <listcomp>
arrays = [convert_column(c, f)
File "/usr/local/lib/python3.9/site-packages/pyarrow/pandas_compat.py", line 581, in convert_column
raise e
File "/usr/local/lib/python3.9/site-packages/pyarrow/pandas_compat.py", line 575, in convert_column
result = pa.array(col, type=type_, from_pandas=True, safe=safe)
File "pyarrow/array.pxi", line 302, in pyarrow.lib.array
File "pyarrow/array.pxi", line 79, in pyarrow.lib._ndarray_to_array
File "pyarrow/array.pxi", line 67, in pyarrow.lib._ndarray_to_type
File "pyarrow/error.pxi", line 120, in pyarrow.lib.check_status
pyarrow.lib.ArrowTypeError: ('Did not pass numpy.dtype object', 'Conversion failed for column geometry with type geometry'){code}
was:
Example:
{code:java}
import geopandas as gpd
import pandas as pd
import pyarrow as pa
path = gpd.datasets.get_path("naturalearth_lowres")
data = gpd.read_file(path)
df = pd.DataFrame(data)
table = pa.Table.from_pandas(df)
print(table)
{code}
Throws the following error:
{code:java}
Traceback (most recent call last):
File "/Users/Henrikh/Desktop/tmp.py", line 8, in <module>
table = pa.Table.from_pandas(df)
File "pyarrow/table.pxi", line 1553, in pyarrow.lib.Table.from_pandas
File "/usr/local/lib/python3.9/site-packages/pyarrow/pandas_compat.py", line 594, in dataframe_to_arrays
arrays = [convert_column(c, f)
File "/usr/local/lib/python3.9/site-packages/pyarrow/pandas_compat.py", line 594, in <listcomp>
arrays = [convert_column(c, f)
File "/usr/local/lib/python3.9/site-packages/pyarrow/pandas_compat.py", line 581, in convert_column
raise e
File "/usr/local/lib/python3.9/site-packages/pyarrow/pandas_compat.py", line 575, in convert_column
result = pa.array(col, type=type_, from_pandas=True, safe=safe)
File "pyarrow/array.pxi", line 302, in pyarrow.lib.array
File "pyarrow/array.pxi", line 79, in pyarrow.lib._ndarray_to_array
File "pyarrow/array.pxi", line 67, in pyarrow.lib._ndarray_to_type
File "pyarrow/error.pxi", line 120, in pyarrow.lib.check_status
pyarrow.lib.ArrowTypeError: ('Did not pass numpy.dtype object', 'Conversion failed for column geometry with type geometry'){code}
> Cannot convert DataFrame with geometry `numpy.dtype` cells to Table
> -------------------------------------------------------------------
>
> Key: ARROW-14267
> URL: https://issues.apache.org/jira/browse/ARROW-14267
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 5.0.0
> Reporter: Henrikh Kantuni
> Priority: Minor
> Labels: pyarrow
>
> Example:
> {code:java}
> import geopandas as gpd
> import pandas as pd
> import pyarrow as pa
> path = gpd.datasets.get_path("naturalearth_lowres")
> data = gpd.read_file(path)
> df = pd.DataFrame(data)
> table = pa.Table.from_pandas(df)
> print(table)
> {code}
> Throws the following error:
> {code:java}
> Traceback (most recent call last):
> File "/Users/Henrikh/Desktop/tmp.py", line 8, in <module>
> table = pa.Table.from_pandas(df)
> File "pyarrow/table.pxi", line 1553, in pyarrow.lib.Table.from_pandas
> File "/usr/local/lib/python3.9/site-packages/pyarrow/pandas_compat.py", line 594, in dataframe_to_arrays
> arrays = [convert_column(c, f)
> File "/usr/local/lib/python3.9/site-packages/pyarrow/pandas_compat.py", line 594, in <listcomp>
> arrays = [convert_column(c, f)
> File "/usr/local/lib/python3.9/site-packages/pyarrow/pandas_compat.py", line 581, in convert_column
> raise e
> File "/usr/local/lib/python3.9/site-packages/pyarrow/pandas_compat.py", line 575, in convert_column
> result = pa.array(col, type=type_, from_pandas=True, safe=safe)
> File "pyarrow/array.pxi", line 302, in pyarrow.lib.array
> File "pyarrow/array.pxi", line 79, in pyarrow.lib._ndarray_to_array
> File "pyarrow/array.pxi", line 67, in pyarrow.lib._ndarray_to_type
> File "pyarrow/error.pxi", line 120, in pyarrow.lib.check_status
> pyarrow.lib.ArrowTypeError: ('Did not pass numpy.dtype object', 'Conversion failed for column geometry with type geometry'){code}
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)