You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Jonathan Keane (Jira)" <ji...@apache.org> on 2021/10/21 19:22:00 UTC
[jira] [Assigned] (ARROW-14426) [C++] Add a minimum_row_group_size
to dataset writing
[ https://issues.apache.org/jira/browse/ARROW-14426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jonathan Keane reassigned ARROW-14426:
--------------------------------------
Assignee: Weston Pace
> [C++] Add a minimum_row_group_size to dataset writing
> -----------------------------------------------------
>
> Key: ARROW-14426
> URL: https://issues.apache.org/jira/browse/ARROW-14426
> Project: Apache Arrow
> Issue Type: Bug
> Components: C++
> Reporter: Jonathan Keane
> Assignee: Weston Pace
> Priority: Major
>
> Right now we right whatever chunks we get, but if those chunks are exceptionally small, we should bundle them up and write out a configurable minimum row group size
--
This message was sent by Atlassian Jira
(v8.3.4#803005)