You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Mostafa Mokhtar (JIRA)" <ji...@apache.org> on 2014/10/20 23:33:34 UTC
[jira] [Created] (HIVE-8524) When table is renamed stats are lost
as changes are not propagated to metastore tables TAB_COL_STATS and
PART_COL_STATS
Mostafa Mokhtar created HIVE-8524:
-------------------------------------
Summary: When table is renamed stats are lost as changes are not propagated to metastore tables TAB_COL_STATS and PART_COL_STATS
Key: HIVE-8524
URL: https://issues.apache.org/jira/browse/HIVE-8524
Project: Hive
Issue Type: Bug
Components: Metastore
Affects Versions: 0.14.0
Reporter: Mostafa Mokhtar
Assignee: Gunther Hagleitner
Fix For: 0.14.0
When a Hive table is renamed that the name is not updated in TAB_COL_STATS and PART_COL_STATS.
Repro
1) Create table
2) insert rows
3) Analyze table t1 compute statistics for columns;
4) set hive.stats.fetch.column.stats=true;
5) Explain select * from t1 where c1 > x
6) ALTER TABLE t1 RENAME TO 2;
7) Explain select * from t2 where c1 > x ; /* stats will be missing */
8) Query the Metastore tables to validate
According to the documentation Metastore should be updated
{code}
This statement lets you change the name of a table to a different name.
As of version 0.6, a rename on a managed table moves its HDFS location as well. (Older Hive versions just renamed the table in the metastore without moving the HDFS location.)
{code}
Another related issue is that the schema of the stats table is not consistent with TBLS and DBS as these two table are normalized while TAB_COL_STATS and PART_COL_STATS have TABLE_NAME and DB_NAME denormalized in them.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)