You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Weston Pace (Jira)" <ji...@apache.org> on 2021/09/07 17:58:00 UTC

[jira] [Commented] (ARROW-13338) [C++][Dataset] Make async Scanner the default

    [ https://issues.apache.org/jira/browse/ARROW-13338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17411394#comment-17411394 ] 

Weston Pace commented on ARROW-13338:
-------------------------------------

Just an update that this is currently blocked.  The async write is coming along fine and should be available for 6.0.0 but it is not clear to me yet that 6.0.0's async scanner will support back pressure.  If the async scanner does not have that feature then I would be reluctant to make it the default.  I will try and resolve this soon.

> [C++][Dataset] Make async Scanner the default
> ---------------------------------------------
>
>                 Key: ARROW-13338
>                 URL: https://issues.apache.org/jira/browse/ARROW-13338
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Ben Kietzman
>            Priority: Major
>              Labels: dataset, pull-request-available
>             Fix For: 6.0.0
>
>          Time Spent: 2h 10m
>  Remaining Estimate: 0h
>
> The synchronous Scanner is not integrated with ExecPlans and doesn't offer better performance in any case than the asynchronous scanner, so we should start migrating away from the extraneous code path so it can be deprecated and then deleted.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)