You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Junjie Chen (Jira)" <ji...@apache.org> on 2021/02/01 09:44:00 UTC

[jira] [Comment Edited] (PARQUET-1805) Refactor the configuration for bloom filters

    [ https://issues.apache.org/jira/browse/PARQUET-1805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17276198#comment-17276198 ] 

Junjie Chen edited comment on PARQUET-1805 at 2/1/21, 9:43 AM:
---------------------------------------------------------------

I think what [~yumwang] concern is we enable all columns' bloom filter when {{parquet.bloom.filter.enabled}} is set to true. That behaviour is a bit odd consider if we have a table with a heap of columns. We could change to use {{parquet.bloom.filter.enabled#column.path}} to enable the bloom filter for the specific column after setting {{parquet.bloom.filter.enabled}}.


was (Author: junjie):
I think what [~yumwang] concern is we enable all columns' bloom filter when {{parquet.bloom.filter.enabled}} is set to true. That behaviour is a bit odd, we could change to use {{parquet.bloom.filter.enabled#column.path}} to enable the bloom filter for the specific column after setting {{parquet.bloom.filter.enabled}}.

> Refactor the configuration for bloom filters
> --------------------------------------------
>
>                 Key: PARQUET-1805
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1805
>             Project: Parquet
>          Issue Type: Improvement
>            Reporter: Gabor Szadovszky
>            Assignee: Gabor Szadovszky
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.12.0
>
>
> Refactor the hadoop configuration for bloom filters according to PARQUET-1784.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)