You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Quanlong Huang (Jira)" <ji...@apache.org> on 2022/03/16 03:03:00 UTC

[jira] [Created] (IMPALA-11185) Reuse orc::ColumnVectorBatch in the scanner life-cycle

Quanlong Huang created IMPALA-11185:
---------------------------------------

             Summary: Reuse orc::ColumnVectorBatch in the scanner life-cycle
                 Key: IMPALA-11185
                 URL: https://issues.apache.org/jira/browse/IMPALA-11185
             Project: IMPALA
          Issue Type: Improvement
          Components: Backend
            Reporter: Quanlong Huang
            Assignee: Quanlong Huang


In HdfsOrcScanner::AssembleRows(), we always re-create a orc::ColumnVectorBatch. The ideal pattern is reusing the batch and only destroyed it when the scanner is closed.

In the flame graph of TPC-H Q1 collected by [~drorke] , the createRowBatch and destructors occupies almost half of the scanner time.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org