You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Yesh (Jira)" <ji...@apache.org> on 2021/09/14 00:17:00 UTC
[jira] [Created] (ARROW-13991) [C++][Dataset] - Dataset write to
accept mask array to include/exclude rows
Yesh created ARROW-13991:
----------------------------
Summary: [C++][Dataset] - Dataset write to accept mask array to include/exclude rows
Key: ARROW-13991
URL: https://issues.apache.org/jira/browse/ARROW-13991
Project: Apache Arrow
Issue Type: Improvement
Components: C++
Reporter: Yesh
Currently filesystem union datset is built out using individual in-memory tables. Prior to building out these tables into union dataset a complex filter step has already determined the rows to excluded/included via a mask array. The ask is for this mask/array to be incorporated into writer side of of the union dataset or fragments to eliminate or include rows in final file.
This ask if incorporated eliminates an additional step in the application to filter out rows in a table.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)