You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Wenzhe Zhou (Jira)" <ji...@apache.org> on 2020/06/06 23:31:00 UTC

[jira] [Resolved] (IMPALA-3741) Push bloom filters to Kudu scanners

     [ https://issues.apache.org/jira/browse/IMPALA-3741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Wenzhe Zhou resolved IMPALA-3741.
---------------------------------
     Fix Version/s: Impala 4.0
    Target Version: Kudu_Impala, Impala 4.0  (was: Kudu_Impala)
        Resolution: Fixed

> Push bloom filters to Kudu scanners
> -----------------------------------
>
>                 Key: IMPALA-3741
>                 URL: https://issues.apache.org/jira/browse/IMPALA-3741
>             Project: IMPALA
>          Issue Type: Task
>          Components: Backend
>    Affects Versions: Kudu_Impala
>            Reporter: Matthew Jacobs
>            Assignee: Wenzhe Zhou
>            Priority: Major
>              Labels: kudu, performance
>             Fix For: Impala 4.0
>
>
> Impala relies on bloom filters to reduce number of rows from coming out of the scan node for selective joins. 
> Queries get up to 20x speedup, not having bloom filter support in Kudu will create a big performance gap between Parquet and Kudu.
> https://github.com/cloudera/Impala/blob/cdh5-trunk/be/src/util/bloom-filter.h



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org