You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Neal Richardson (Jira)" <ji...@apache.org> on 2021/07/30 16:57:00 UTC

[jira] [Issue Comment Deleted] (ARROW-13498) [C++] ScanNode takes filter but doesn't filter

     [ https://issues.apache.org/jira/browse/ARROW-13498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Neal Richardson updated ARROW-13498:
------------------------------------
    Comment: was deleted

(was: Maybe this is my confusion in ARROW-13344? Because the ScanNode takes a filter and projection, I don't ever create a FilterNode because I assume that the filter is already applied--why else would I provide a filter to the ScanNode? But maybe that's mistaken?)

> [C++] ScanNode takes filter but doesn't filter
> ----------------------------------------------
>
>                 Key: ARROW-13498
>                 URL: https://issues.apache.org/jira/browse/ARROW-13498
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++
>            Reporter: Neal Richardson
>            Priority: Major
>             Fix For: 6.0.0
>
>
> This turned out to be my confusion in ARROW-13344: because the ScanNode takes a filter and projection, I wasn't creating a FilterNode because I assume that the filter is already applied--why else would I provide a filter to the ScanNode? But it turns out that if you don't Filter again, you get unfiltered results:
> {code}
> Table$create(
>   group=c(1, 2), 
>   value=c(5, 6)
> ) %>% 
>   filter(value > 5) %>% 
>   group_by(group) %>% 
>   summarize(sum(value)) %>% 
>   collect()
> # A tibble: 2 x 2
>   group `sum(value)`
>   <dbl>        <dbl>
> 1     1            5
> 2     2            6
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)