You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Csaba Ringhofer (JIRA)" <ji...@apache.org> on 2019/01/14 15:53:00 UTC

[jira] [Updated] (IMPALA-2735) Push down conjunct evaluation into Parquet column readers

     [ https://issues.apache.org/jira/browse/IMPALA-2735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Csaba Ringhofer updated IMPALA-2735:
------------------------------------
    Labels: parquet performance  (was: performance)

> Push down conjunct evaluation into Parquet column readers
> ---------------------------------------------------------
>
>                 Key: IMPALA-2735
>                 URL: https://issues.apache.org/jira/browse/IMPALA-2735
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Backend
>    Affects Versions: Impala 2.3.0
>            Reporter: Tim Armstrong
>            Assignee: Michael Ho
>            Priority: Minor
>              Labels: parquet, performance
>
> Currently EvalConjuncts() operates on an entire row, which requires that values be materialised into the row format before evaluating conjuncts. In some cases we partially materialise each tuple before evaluating conjuncts, but this is not cache-efficient if we want to materialise column-by-column.
> In some cases, e.g. the Parquet scanner, it would be helpful if we could evaluate conjuncts that only reference a single slot on a simple value outside of the row format so that we can materialise a dense array of that columns values then filter out rows that don't pass conjuncts.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org