You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Neal Richardson (Jira)" <ji...@apache.org> on 2022/10/24 18:26:00 UTC

[jira] [Assigned] (ARROW-12282) [R] Refactor collect and compute methods

     [ https://issues.apache.org/jira/browse/ARROW-12282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Neal Richardson reassigned ARROW-12282:
---------------------------------------

    Assignee: Neal Richardson

> [R] Refactor collect and compute methods
> ----------------------------------------
>
>                 Key: ARROW-12282
>                 URL: https://issues.apache.org/jira/browse/ARROW-12282
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: R
>            Reporter: Ian Cook
>            Assignee: Neal Richardson
>            Priority: Major
>
> ARROW-11754 added a {{compute()}} method which calls {{collect(as_data_frame = FALSE)}}. This was a clean and simple way to add {{compute()}}, but it would be more logical if {{collect()}} instead called {{compute()}}, or if both called an internal function that forced computation of the lazy ops. But this is complicated by the need to call {{restore_dplyr_features()}} in both {{compute()}} and {{collect()}} and muddied by some uncertainty about where exactly {{ensure_group_vars()}} and {{ensure_arrange_vars()}} should be called. Take a closer look at this later date and consider whether to refactor it.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)