You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Ben Kietzman (Jira)" <ji...@apache.org> on 2021/05/05 18:49:00 UTC
[jira] [Commented] (ARROW-12659) [C++][Compute] Support
SimplifyWithGuarantee(is_null(foo), invert(is_null(foo)))
[ https://issues.apache.org/jira/browse/ARROW-12659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17339840#comment-17339840 ]
Ben Kietzman commented on ARROW-12659:
--------------------------------------
I can definitely see the utility of producing a row group statistics expression which includes non nullability, but I think it'd be {{foo >= 3 and foo <= 4 and is_valid(foo)}}
instead of {{foo >= 3 and foo <= 4 and invert(is_null(foo))}}
> [C++][Compute] Support SimplifyWithGuarantee(is_null(foo), invert(is_null(foo)))
> --------------------------------------------------------------------------------
>
> Key: ARROW-12659
> URL: https://issues.apache.org/jira/browse/ARROW-12659
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: David Li
> Priority: Major
> Labels: dataset, datasets, expression, expressions
>
> Simplifying a predicate using is_null with a guarantee using invert(is_null()) doesn't lead to any simplification.
> Once supported, tests should be added for Datasets to exercise row group filtering (and to generate the necessary clauses from statistics).
--
This message was sent by Atlassian Jira
(v8.3.4#803005)