You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Quanlong Huang (Jira)" <ji...@apache.org> on 2022/03/16 03:03:00 UTC
[jira] [Created] (IMPALA-11185) Reuse orc::ColumnVectorBatch in the scanner life-cycle
Quanlong Huang created IMPALA-11185:
---------------------------------------
Summary: Reuse orc::ColumnVectorBatch in the scanner life-cycle
Key: IMPALA-11185
URL: https://issues.apache.org/jira/browse/IMPALA-11185
Project: IMPALA
Issue Type: Improvement
Components: Backend
Reporter: Quanlong Huang
Assignee: Quanlong Huang
In HdfsOrcScanner::AssembleRows(), we always re-create a orc::ColumnVectorBatch. The ideal pattern is reusing the batch and only destroyed it when the scanner is closed.
In the flame graph of TPC-H Q1 collected by [~drorke] , the createRowBatch and destructors occupies almost half of the scanner time.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org