You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Ashutosh Chauhan (JIRA)" <ji...@apache.org> on 2014/04/15 01:33:14 UTC

[jira] [Commented] (HIVE-3077) Insert overwrite table doesn't fail for bucketed tables and breaks bucketing

    [ https://issues.apache.org/jira/browse/HIVE-3077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969002#comment-13969002 ] 

Ashutosh Chauhan commented on HIVE-3077:
----------------------------------------

Dupe of HIVE-6867 ?

> Insert overwrite table doesn't fail for bucketed tables and breaks bucketing
> ----------------------------------------------------------------------------
>
>                 Key: HIVE-3077
>                 URL: https://issues.apache.org/jira/browse/HIVE-3077
>             Project: Hive
>          Issue Type: Bug
>          Components: CLI
>    Affects Versions: 0.8.0, 0.8.1, 0.9.0, 0.9.1, 0.10.0
>         Environment: java version "1.6.0_30"
> hive version 0.9.0
> hadoop version 0.20.205.0
>            Reporter: Mark Grover
>            Assignee: Xuefu Zhang
>
> If table my_table is bucketed, the command "insert into table my_table ..." is supposed to give an error stating "Bucketized tables do not support INSERT INTO".
> However, it doesn't seem to do that in all cases.
> Consider the following example on Hive 0.9.0:
> create table src(x string) clustered by( x ) sorted by ( x ) into 32 buckets; 
> create table dest(x string) clustered by( x ) sorted by ( x ) into 32 buckets; 
> Now, put some data into x (after enable hive.enforce.bucketing and hive.enforce.sorting to be true).
> Then, do:
> insert into table dest select * from src; 
> This should fail since dest is a bucketized table. However, this succeeds creating a 33rd file inside the HDFS folder for the table, thereby corrupting it.
> This happens regardless of whether the src table is bucketed or not.



--
This message was sent by Atlassian JIRA
(v6.2#6252)