You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@orc.apache.org by "Fang Zheng (JIRA)" <ji...@apache.org> on 2018/11/29 22:48:00 UTC
[jira] [Created] (ORC-442) [C++] Code improvements in Statistics
and Writer
Fang Zheng created ORC-442:
------------------------------
Summary: [C++] Code improvements in Statistics and Writer
Key: ORC-442
URL: https://issues.apache.org/jira/browse/ORC-442
Project: ORC
Issue Type: Improvement
Components: C++
Reporter: Fang Zheng
A few code changes in Statistics and Writer classes:
1. Change StatisticsImpl to use vector instead of list for storing ColumnStatistics. Because the required operations are push_back() in ctor, iteration in dtor, and random element access in getColumnStatistics(), and list does not support random access in constant time, vector would be more appropriate than list.
2. InternalBooleanStatistics is currently typedef-ed as InternalStatisticsImpl<uint64_t>. Since min/max/sum does not apply to BooleanColumnStatistics, we should define InternalBooleanStatistics to be InternalStatisticsImpl<char> to save 21 bytes per instance.
3. Misc. changes to ColumnWriter.hh, Writer.cc, Compression.hh, and Statistics.hh to fix typos in Doxygen and reduce object copies.
Please see PR for details.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)