You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Matthias Vallentin (Jira)" <ji...@apache.org> on 2022/09/29 16:24:00 UTC

[jira] [Comment Edited] (ARROW-17839) [Python] Cannot create RecordBatch with nested struct containing extension type

    [ https://issues.apache.org/jira/browse/ARROW-17839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17611125#comment-17611125 ] 

Matthias Vallentin edited comment on ARROW-17839 at 9/29/22 4:23 PM:
---------------------------------------------------------------------

Since I didn't manage (yet) to try out the branch under development, I have one other issue, per the attached example. In this case I'm getting:
{code:java}
TypeError: Incompatible storage type dictionary<values=string, indices=int8, ordered=0> for extension type extension<vast.enumeration<EnumType>>{code}
I'm not sure whether that's considered a "cast" internally or whether I'm simply not creating an {{ExtensionArray}} properly from a dictionary. Any guidance would be much appreciated.

[^enum.py]


was (Author: mavam):
Since I didn't manage (yet) to try out the branch under development, I have one other issue, per the attached example. In this case I'm getting:

 
{code:java}
TypeError: Incompatible storage type dictionary<values=string, indices=int8, ordered=0> for extension type extension<vast.enumeration<EnumType>>{code}
 

I'm not sure whether that's considered a "cast" internally or whether I'm simply not creating an {{ExtensionArray}} properly from a dictionary. Any guidance would be much appreciated.

[^enum.py]

> [Python] Cannot create RecordBatch with nested struct containing extension type
> -------------------------------------------------------------------------------
>
>                 Key: ARROW-17839
>                 URL: https://issues.apache.org/jira/browse/ARROW-17839
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>    Affects Versions: 9.0.0
>         Environment: macOS 12.5.1 on an Apple M1 Ultra.
>            Reporter: Matthias Vallentin
>            Priority: Blocker
>         Attachments: enum.py, example.py
>
>
> I'm running into the following issue:
> {code:java}
> pyarrow.lib.ArrowNotImplementedError: Unsupported cast to extension<vast.address<AddressType>> from fixed_size_binary[16]{code}
> Use case: I want to create a record batch that contains this type:
> {code:java}
> pa.struct([("address", AddressType()), ("length", pa.uint8())]){code}
> Here, {{AddressType}} is an extension type that models an IP address ({{{}pa.binary(16){}}}).
> Please find attached a self-contained example that illustrates the issue.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)