You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Matthias Vallentin (Jira)" <ji...@apache.org> on 2022/09/29 06:24:00 UTC

[jira] [Commented] (ARROW-17839) [Python] Cannot create RecordBatch with nested struct containing extension type

    [ https://issues.apache.org/jira/browse/ARROW-17839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17610822#comment-17610822 ] 

Matthias Vallentin commented on ARROW-17839:
--------------------------------------------

Thanks for the point, [~jorisvandenbossche]. Glad to see that a fix is underway.

Would you mind pointing me to instructions on how to do the test that you performed? I am using Poetry and couldn't get the branch to compile. In theory, I thought this should do the trick:
{code:java}
[tool.poetry.dependencies]
#pyarrow = "^9.0"
pyarrow = { git = "https://github.com/milesgranger/arrow.git", branch = "ARROW-15545_cast-of-extension-types", subdirectory = "python" }{code}
But this fails to compile due to missing dependencies. (I managed to workaround OpenSSL by providing the right env var, but now I'm stuck with Flight not being found.) I was hoping that there is some sort of dev guide that shows how to get going.

> [Python] Cannot create RecordBatch with nested struct containing extension type
> -------------------------------------------------------------------------------
>
>                 Key: ARROW-17839
>                 URL: https://issues.apache.org/jira/browse/ARROW-17839
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>    Affects Versions: 9.0.0
>         Environment: macOS 12.5.1 on an Apple M1 Ultra.
>            Reporter: Matthias Vallentin
>            Priority: Blocker
>         Attachments: example.py
>
>
> I'm running into the following issue:
> {code:java}
> pyarrow.lib.ArrowNotImplementedError: Unsupported cast to extension<vast.address<AddressType>> from fixed_size_binary[16]{code}
> Use case: I want to create a record batch that contains this type:
> {code:java}
> pa.struct([("address", AddressType()), ("length", pa.uint8())]){code}
> Here, {{AddressType}} is an extension type that models an IP address ({{{}pa.binary(16){}}}).
> Please find attached a self-contained example that illustrates the issue.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)