You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/06/02 10:59:00 UTC

[jira] [Resolved] (ARROW-16685) [Python] Using Python's set in inner Table.join has undeterministic behaviour

     [ https://issues.apache.org/jira/browse/ARROW-16685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Antoine Pitrou resolved ARROW-16685.
------------------------------------
    Resolution: Fixed

Issue resolved by pull request 13281
[https://github.com/apache/arrow/pull/13281]

> [Python] Using Python's set in inner Table.join has undeterministic behaviour
> -----------------------------------------------------------------------------
>
>                 Key: ARROW-16685
>                 URL: https://issues.apache.org/jira/browse/ARROW-16685
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>            Reporter: Alenka Frim
>            Assignee: Alessandro Molina
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 9.0.0
>
>          Time Spent: 5h 50m
>  Remaining Estimate: 0h
>
> For the inner join of pa.Table we are using Python's set, which causes undeterministic behaviour. This should be fixed so that the ordering of the output columns is stable.
> See:
>  * [https://github.com/apache/arrow/pull/13260#issuecomment-1141906725] 
>  * [https://github.com/apache/arrow/runs/6615749316?check_suite_focus=true#step:6:5590]



--
This message was sent by Atlassian Jira
(v8.20.7#820007)