You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Tim Armstrong (Jira)" <ji...@apache.org> on 2020/12/22 22:16:00 UTC

[jira] [Created] (IMPALA-10405) Consider setting parquet_page_row_count_limit to 20000 by default

Tim Armstrong created IMPALA-10405:
--------------------------------------

             Summary: Consider setting parquet_page_row_count_limit to 20000 by default
                 Key: IMPALA-10405
                 URL: https://issues.apache.org/jira/browse/IMPALA-10405
             Project: IMPALA
          Issue Type: Improvement
          Components: Backend
            Reporter: Tim Armstrong


PARQUET-1414 did some experiments for parquet-mr that concluded that this setting would enhance page filtering without giving up much compression. It's the default over there.

We should probably just do the same in Impala because we already have that evidence that it's better and we can avoid it being a confounding factor.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)