You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Alessandro Molina (Jira)" <ji...@apache.org> on 2021/10/12 08:27:00 UTC

[jira] [Created] (ARROW-14293) Basic Join functionality in PyArrow

Alessandro Molina created ARROW-14293:
-----------------------------------------

             Summary: Basic Join functionality in PyArrow
                 Key: ARROW-14293
                 URL: https://issues.apache.org/jira/browse/ARROW-14293
             Project: Apache Arrow
          Issue Type: Improvement
            Reporter: Alessandro Molina
             Fix For: 7.0.0


We want to expose a {{Table.join}} and {{Dataset.join}} functionalities in PyArrow which can leverage our join feature from the ExecPlan to expose.

The {{Table.join}} can easily return a new {{Table}}, questions about what {{Dataset.join}} might return are more complex as it probably doesn't make much sense to return a new {{Dataset}} given that the result won't map to any files on disk



--
This message was sent by Atlassian Jira
(v8.3.4#803005)