You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Xin Hao (JIRA)" <ji...@apache.org> on 2015/02/03 08:40:34 UTC

[jira] [Created] (HIVE-9560) When hive.stats.collect.rawdatasize=true, 'rawDataSize' for an ORC table will result in value '0' after running 'analyze table TABLE_NAME compute statistics;'

Xin Hao created HIVE-9560:
-----------------------------

             Summary: When hive.stats.collect.rawdatasize=true, 'rawDataSize' for an ORC table will result in value '0' after running 'analyze table TABLE_NAME compute statistics;'
                 Key: HIVE-9560
                 URL: https://issues.apache.org/jira/browse/HIVE-9560
             Project: Hive
          Issue Type: Bug
            Reporter: Xin Hao


When hive.stats.collect.rawdatasize=true, 'rawDataSize' for an ORC table will result in value '0' after running 'analyze table TABLE_NAME compute statistics;'

Reproduce step:
(1) set hive.stats.collect.rawdatasize=true;
(2) Generate an ORC table in hive, and the value of its 'rawDataSize' is NOT zero.
You can find the value of 'rawDataSize' (NOT zero) by executing  'describe extended TABLE_NAME;' 
(4) Execute 'analyze table TABLE_NAME compute statistics;'
(5) Execute  'describe extended TABLE_NAME;' again, and you will find that  the value of 'rawDataSize' will be changed to '0'.





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)