You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Ashutosh Chauhan (JIRA)" <ji...@apache.org> on 2012/12/05 00:08:58 UTC

[jira] [Reopened] (HIVE-3750) JDBCStatsPublisher fails when ID length exceeds length of ID column

     [ https://issues.apache.org/jira/browse/HIVE-3750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ashutosh Chauhan reopened HIVE-3750:
------------------------------------


This is making quite a few test case in TestParse and elsewhere to fail, because golden files were not updated corresponding to maxStatsKeyPrefixLength config which is now showing up in the outputs.
Please run all tests and update golden file (assuming they need to be updated).
                
> JDBCStatsPublisher fails when ID length exceeds length of ID column
> -------------------------------------------------------------------
>
>                 Key: HIVE-3750
>                 URL: https://issues.apache.org/jira/browse/HIVE-3750
>             Project: Hive
>          Issue Type: Bug
>          Components: Statistics
>    Affects Versions: 0.11
>            Reporter: Kevin Wilfong
>            Assignee: Kevin Wilfong
>         Attachments: HIVE-3750.1.patch.txt
>
>
> When the length of the ID field passed to JDBCStatsPublisher exceeds the length of the column in the table (currently 255 characters) stats collection fails.  This causes the entire query to fail when hive.stats.reliable is set to true.
> One way to prevent this would be to calculate a deterministic, very low collision hash of the ID prefix used for aggregation and use that when the length of the ID is too long.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira