You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Prasanth J (JIRA)" <ji...@apache.org> on 2014/10/17 20:40:34 UTC

[jira] [Created] (HIVE-8498) Insert into table misses some rows when vectorization is enabled

Prasanth J created HIVE-8498:
--------------------------------

             Summary: Insert into table misses some rows when vectorization is enabled
                 Key: HIVE-8498
                 URL: https://issues.apache.org/jira/browse/HIVE-8498
             Project: Hive
          Issue Type: Bug
          Components: Vectorization
    Affects Versions: 0.13.1, 0.14.0
            Reporter: Prasanth J
            Assignee: Jitendra Nath Pandey
            Priority: Critical


 Following is a small reproducible case for the issue

create table orc1
  stored as orc
  tblproperties("orc.compress"="ZLIB")
  as
    select rn
    from
    (
      select cast(1 as int) as rn from src limit 1
      union all
      select cast(100 as int) as rn from src limit 1
      union all
      select cast(10000 as int) as rn from src limit 1
    ) t;

create table orc_rn1 (rn int);
create table orc_rn2 (rn int);
create table orc_rn3 (rn int);

// These inserts should produce 3 rows but only 1 row is produced
from orc1 a
insert overwrite table orc_rn1 select a.* where a.rn < 100
insert overwrite table orc_rn2 select a.* where a.rn >= 100 and a.rn < 1000
insert overwrite table orc_rn3 select a.* where a.rn >= 1000;

select * from orc_rn1
union all
select * from orc_rn2
union all
select * from orc_rn3;

The expected output of the query is
1
100
10000

But with vectorization enabled we get
1



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)