You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Gang Tim Liu (JIRA)" <ji...@apache.org> on 2013/01/19 03:12:13 UTC
[jira] [Created] (HIVE-3917) Support fast operation for analyze
command
Gang Tim Liu created HIVE-3917:
----------------------------------
Summary: Support fast operation for analyze command
Key: HIVE-3917
URL: https://issues.apache.org/jira/browse/HIVE-3917
Project: Hive
Issue Type: Improvement
Components: Statistics
Affects Versions: 0.11.0
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
hive supports analyze command to gather statistics from existing tables/partition https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-ExistingTables
It collects:
1. Number of Rows
2. Number of files
3. Size in Bytes
If table/partition is big, the operation would take time since it will open all files and scan all data.
It would be nice to support fast operation to gather statistics which doesn't require to open all files like
1.. Number of files
2. Size in Bytes
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira