You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Yesh (Jira)" <ji...@apache.org> on 2021/09/14 00:17:00 UTC

[jira] [Created] (ARROW-13991) [C++][Dataset] - Dataset write to accept mask array to include/exclude rows

Yesh created ARROW-13991:
----------------------------

             Summary: [C++][Dataset] - Dataset write to accept mask array to include/exclude rows 
                 Key: ARROW-13991
                 URL: https://issues.apache.org/jira/browse/ARROW-13991
             Project: Apache Arrow
          Issue Type: Improvement
          Components: C++
            Reporter: Yesh


Currently filesystem union datset is built out using individual in-memory tables. Prior to building out these tables into union dataset a complex filter step has already determined the rows to excluded/included via a mask array. The ask is for this mask/array to be incorporated into writer side of of the union dataset or fragments to eliminate or include rows in final file.

This ask if incorporated eliminates an additional step in the application to filter out rows in a table.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)