You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "David Li (Jira)" <ji...@apache.org> on 2022/07/26 18:00:00 UTC

[jira] [Updated] (ARROW-17216) [C++] Support joining tables with non-key fields as list

     [ https://issues.apache.org/jira/browse/ARROW-17216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

David Li updated ARROW-17216:
-----------------------------
    Summary: [C++] Support joining tables with non-key fields as list  (was: Support joining tables with non-key fields as list)

> [C++] Support joining tables with non-key fields as list
> --------------------------------------------------------
>
>                 Key: ARROW-17216
>                 URL: https://issues.apache.org/jira/browse/ARROW-17216
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Jayjeet Chakraborty
>            Priority: Major
>              Labels: query-engine
>
> I am trying to join 2 Arrow tables where some columns are of {{list<float>}} data type. Note that my join columns/keys are primitive data types and some my non-join columns/keys are of {{{}list<float>{}}}. But, PyArrow {{join()}} cannot join such as table, although pandas can. It says
> {{ArrowInvalid: Data type list<item: float> is not supported in join non-key field}}
> when I execute this piece of code
> {{joined_table = table_1.join(table_2, ['k1', 'k2', 'k3'])}}
> A [stackoverflow|https://stackoverflow.com/questions/73071105/listitem-float-not-supported-in-join-non-key-field] response pointed out that Arrow currently cannot handle non-fixed types for joins. Can this be fixed ? Or is this intentional ?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)