You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Scott Wilson (Jira)" <ji...@apache.org> on 2020/03/24 19:59:00 UTC

[jira] [Created] (ARROW-8199) Guidance for creating multi-column sort on Table example?

Scott Wilson created ARROW-8199:
-----------------------------------

             Summary: Guidance for creating multi-column sort on Table example?
                 Key: ARROW-8199
                 URL: https://issues.apache.org/jira/browse/ARROW-8199
             Project: Apache Arrow
          Issue Type: Wish
          Components: C++ - Compute
    Affects Versions: 0.16.0
            Reporter: Scott Wilson


I'm just coming up to speed with Arrow and am noticing a dearth of examples ... maybe I can help here.

I'd like to implement multi-column sorting for Tables and just want to ensure that I'm not duplicating existing work or proposing a bad design.

My thought was to create a Table-specific version of SortToIndices() where you can specify the columns and sort order.

Then I'd create Array "views" that use the Indices to remap from the original Array values to the values in sorted order. (Original data is not sorted, but could be as a second step.) I noticed some of the array list variants keep offsets, but didn't see anything that supports remapping per a list of indices, but this may just be my oversight?

Thanks in advance, Scott



--
This message was sent by Atlassian Jira
(v8.3.4#803005)