You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Sergey Shelukhin (JIRA)" <ji...@apache.org> on 2018/08/22 00:09:00 UTC
[jira] [Created] (HIVE-20434) analyze on an empty partition doesn't
produce correct stats
Sergey Shelukhin created HIVE-20434:
---------------------------------------
Summary: analyze on an empty partition doesn't produce correct stats
Key: HIVE-20434
URL: https://issues.apache.org/jira/browse/HIVE-20434
Project: Hive
Issue Type: Bug
Reporter: Sergey Shelukhin
Assignee: Ashutosh Chauhan
{noformat}
set hive.stats.dbclass=fs;
set hive.stats.fetch.column.stats=true;
set datanucleus.cache.collections=false;
set hive.merge.mapfiles=false;
set hive.merge.mapredfiles=false;
set hive.mapred.mode=nonstrict;
set hive.stats.autogather=true;
set hive.stats.column.autogather=true;
set hive.compute.query.using.stats=true;
set hive.explain.user=false;
set hive.fetch.task.conversion=none;
set hive.query.results.cache.enabled=false;
create table stats_part(key int,value string) partitioned by (p int) tblproperties ("transactional"="false");
insert into table stats_part partition(p=101) values (1, "foo");
explain select count(key) from stats_part; -- <== stats are correct
ALTER TABLE stats_part CHANGE COLUMN key key2 int;
explain select count(key2) from stats_part; -- <== stats are now incorrect
analyze table stats_part partition(p) compute statistics for columns;
explain select count(key2) from stats_part; -- <== stats are now correct again
alter table stats_part add partition(p=105);
explain select count(key2) from stats_part; -- <== stats are now incorrect
analyze table stats_part partition(p) compute statistics for columns;
explain select count(key2) from stats_part; -- <== stats are still incorrect!
drop table stats_part;
{noformat}
Seems like it's impossible to have correct stats on a table with an empty partition.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)