You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Krisztian Szucs (Jira)" <ji...@apache.org> on 2020/09/23 13:03:00 UTC

[jira] [Assigned] (ARROW-6281) [Python] Produce chunked arrays for nested types in pyarrow.array

     [ https://issues.apache.org/jira/browse/ARROW-6281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Krisztian Szucs reassigned ARROW-6281:
--------------------------------------

    Assignee: Krisztian Szucs

> [Python] Produce chunked arrays for nested types in pyarrow.array
> -----------------------------------------------------------------
>
>                 Key: ARROW-6281
>                 URL: https://issues.apache.org/jira/browse/ARROW-6281
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>            Reporter: Wes McKinney
>            Assignee: Krisztian Szucs
>            Priority: Major
>             Fix For: 2.0.0
>
>
> As follow up to ARROW-5028 and other issues, in a case like
> {code}
> vals = [['x' * 1024]] * ((2 << 20) + 1)
> arr = pa.array(vals)
> {code}
> The child array of the ListArray cannot hold all of the string data. After the patch for ARROW-5028, an exception is raised rather than returning a malformed array. We could (with some effort) instead produce a chunked array of list<string> type



--
This message was sent by Atlassian Jira
(v8.3.4#803005)