You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Todd Farmer (Jira)" <ji...@apache.org> on 2022/07/12 14:05:03 UTC

[jira] [Assigned] (ARROW-15589) [C++] Add support for sliced Substrait reads

     [ https://issues.apache.org/jira/browse/ARROW-15589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Todd Farmer reassigned ARROW-15589:
-----------------------------------

    Assignee:     (was: Ariana Villegas)

This issue was last updated over 90 days ago, which may be an indication it is no longer being actively worked. To better reflect the current state, the issue is being unassigned. Please feel free to re-take assignment of the issue if it is being actively worked, or if you plan to start that work soon.

> [C++] Add support for sliced Substrait reads
> --------------------------------------------
>
>                 Key: ARROW-15589
>                 URL: https://issues.apache.org/jira/browse/ARROW-15589
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Weston Pace
>            Priority: Major
>              Labels: substrait
>
> The Substrait format allows for "sliced reads" which only read a part of the file and would most likely be used if a read operation were distributed across multiple files.
> For each file a start byte and length is specified.  For files that contain indivisible "groups" (e.g. Parquet row groups) this is handled by picking some heuristic.  For example, read all row groups whose midpoint is contained in the interval.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)