You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Tim Armstrong (JIRA)" <ji...@apache.org> on 2017/09/20 20:45:01 UTC
[jira] [Created] (IMPALA-5965) Avoid per-value switch on
NeedsConversionInline() when decoding dictionary-encoded strings and
timestamps
Tim Armstrong created IMPALA-5965:
-------------------------------------
Summary: Avoid per-value switch on NeedsConversionInline() when decoding dictionary-encoded strings and timestamps
Key: IMPALA-5965
URL: https://issues.apache.org/jira/browse/IMPALA-5965
Project: IMPALA
Issue Type: Bug
Components: Backend
Affects Versions: Impala 2.10.0
Reporter: Tim Armstrong
Assignee: Tim Armstrong
Priority: Minor
I noticed a minor inefficiency in the handling of NeedsConversion() in the parquet scanner. In cases where it's not a runtime constant like dictionary-encoded strings and timestamps, we switch per value. This is probably only a few instructions but in this part of the code that matters.
I did a quick benchmark and saw speedups from ~2.25s->2.11s in scan time on this query:
{code}
use tpch_parquet;
set num_nodes=1;
set mt_dop=1;
select min(l_returnflag), min(l_linestatus) from biglineitem;
summary;
{code}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)