You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Matthias Vallentin (Jira)" <ji...@apache.org> on 2022/09/29 16:24:00 UTC
[jira] [Comment Edited] (ARROW-17839) [Python] Cannot create RecordBatch with nested struct containing extension type
[ https://issues.apache.org/jira/browse/ARROW-17839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17611125#comment-17611125 ]
Matthias Vallentin edited comment on ARROW-17839 at 9/29/22 4:23 PM:
---------------------------------------------------------------------
Since I didn't manage (yet) to try out the branch under development, I have one other issue, per the attached example. In this case I'm getting:
{code:java}
TypeError: Incompatible storage type dictionary<values=string, indices=int8, ordered=0> for extension type extension<vast.enumeration<EnumType>>{code}
I'm not sure whether that's considered a "cast" internally or whether I'm simply not creating an {{ExtensionArray}} properly from a dictionary. Any guidance would be much appreciated.
[^enum.py]
was (Author: mavam):
Since I didn't manage (yet) to try out the branch under development, I have one other issue, per the attached example. In this case I'm getting:
{code:java}
TypeError: Incompatible storage type dictionary<values=string, indices=int8, ordered=0> for extension type extension<vast.enumeration<EnumType>>{code}
I'm not sure whether that's considered a "cast" internally or whether I'm simply not creating an {{ExtensionArray}} properly from a dictionary. Any guidance would be much appreciated.
[^enum.py]
> [Python] Cannot create RecordBatch with nested struct containing extension type
> -------------------------------------------------------------------------------
>
> Key: ARROW-17839
> URL: https://issues.apache.org/jira/browse/ARROW-17839
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 9.0.0
> Environment: macOS 12.5.1 on an Apple M1 Ultra.
> Reporter: Matthias Vallentin
> Priority: Blocker
> Attachments: enum.py, example.py
>
>
> I'm running into the following issue:
> {code:java}
> pyarrow.lib.ArrowNotImplementedError: Unsupported cast to extension<vast.address<AddressType>> from fixed_size_binary[16]{code}
> Use case: I want to create a record batch that contains this type:
> {code:java}
> pa.struct([("address", AddressType()), ("length", pa.uint8())]){code}
> Here, {{AddressType}} is an extension type that models an IP address ({{{}pa.binary(16){}}}).
> Please find attached a self-contained example that illustrates the issue.
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)