You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@impala.apache.org by "Tim Armstrong (Code Review)" <ge...@cloudera.org> on 2016/04/01 02:52:23 UTC

[Impala-CR](cdh5-trunk) PREVIEW ONLY: IMPALA-2737: per-partition processing in agg

Tim Armstrong has uploaded a new patch set (#2).

Change subject: PREVIEW ONLY: IMPALA-2737: per-partition processing in agg
......................................................................

PREVIEW ONLY: IMPALA-2737: per-partition processing in agg

Separates out evaluation of agg input rows from insertion into the hash
table. This allows us to do a radix-sort of rows based on hash value and
process a partition at a time.

This sets things up to do optimisations like prefetching for
aggregations and joins.

TODO: update comments with changed IR
TODO: improve performance - there is too much indirection with the
  different arrays in ExprValuesBuffer and PartitionedBatch.

NOTE: this depends on the LLVM 3.8 patch

Change-Id: I93adce55e6d9302dc191d95058440c8a9fa48f48
---
M be/src/codegen/gen_ir_descriptions.py
M be/src/codegen/impala-ir.cc
M be/src/exec/CMakeLists.txt
A be/src/exec/hash-partitioning-ir.cc
A be/src/exec/hash-partitioning.cc
A be/src/exec/hash-partitioning.h
M be/src/exec/hash-table-test.cc
M be/src/exec/hash-table.cc
M be/src/exec/hash-table.h
M be/src/exec/hash-table.inline.h
M be/src/exec/partitioned-aggregation-node-ir.cc
M be/src/exec/partitioned-aggregation-node.cc
M be/src/exec/partitioned-aggregation-node.h
M be/src/exec/partitioned-hash-join-node.cc
14 files changed, 969 insertions(+), 349 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala refs/changes/91/2691/2
-- 
To view, visit http://gerrit.cloudera.org:8080/2691
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I93adce55e6d9302dc191d95058440c8a9fa48f48
Gerrit-PatchSet: 2
Gerrit-Project: Impala
Gerrit-Branch: cdh5-trunk
Gerrit-Owner: Tim Armstrong <ta...@cloudera.com>