You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Gabriel C Balan (JIRA)" <ji...@apache.org> on 2015/05/02 00:01:11 UTC

[jira] [Created] (HIVE-10577) losing data loading into list bucketing table

Gabriel C Balan created HIVE-10577:
--------------------------------------

             Summary: losing data loading into list bucketing table
                 Key: HIVE-10577
                 URL: https://issues.apache.org/jira/browse/HIVE-10577
             Project: Hive
          Issue Type: Bug
          Components: Hive
    Affects Versions: 1.1.1
         Environment: linux
            Reporter: Gabriel C Balan


Some rows don't make it into a list bucketing table when the skew column is of type string. All is fine, however, when the skew column is of type float

{code:title=src.txt}
1
1.1
111
{code}

{code: title=hive DDL/DML|borderStyle=solid}
set hive.input.format=org.apache.hadoop.hive.ql.io.HiveInputFormat;
set hive.mapred.supports.subdirectories=true;
set hive.optimize.listbucketing=true;
set mapred.input.dir.recursive=true;

drop table if exists src;
create table src (c1 string);
load data local inpath 'src.txt' overwrite into table src;

drop table if exists lb_str;
create table lb_str (c1 string)
skewed by (c1) on ('1.1','110','1') STORED AS DIRECTORIES;
insert overwrite table lb_str select * from src;

drop table if exists lb_float;
create table lb_float (c1 float)
skewed by (c1) on ('1.1','110','1') STORED AS DIRECTORIES;
insert overwrite table lb_float select * from src;

select * from lb_str;
select * from lb_float;
{code}

{code: title=hive cli|borderStyle=solid}
hive> select * from lb_str;
OK
1.1
110
Time taken: 0.071 seconds, Fetched: 2 row(s)
hive> select * from lb_float;
OK
1.1
1.0
110.0
{code}

No 'c1=1/' directory is created in 'lb_str/'





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)