You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@trafodion.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/10/06 05:27:02 UTC
[jira] [Commented] (TRAFODION-2766) HIVE: Inserted row into a hive table has gone missing

    [ https://issues.apache.org/jira/browse/TRAFODION-2766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16194162#comment-16194162 ] 

ASF GitHub Bot commented on TRAFODION-2766:
-------------------------------------------

GitHub user sandhyasun opened a pull request:

    https://github.com/apache/incubator-trafodion/pull/1259

    Fix for missing rows when doing dml to hive tables [TRAFODION-2766]

    Fix to make the generation of the underlying filenames more unique.
    This was also the cause of intermittent failure of seabase/TEST031 . So removing the KNOWN diff file created for that. 

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/sandhyasun/incubator-trafodion traf_misc

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-trafodion/pull/1259.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #1259
    
----
commit 749295ad774620d49db79d961b1bbacac8d589c7
Author: Sandhya Sundaresan <sa...@apache.org>
Date:   2017-09-11T17:52:42Z

    Changes to handle errors during drop of lob tables that leave the table inconsistent.

commit f6505ab79a7d0d7d1462b19c19a4133a10c9cdcb
Author: Sandhya Sundaresan <sa...@apache.org>
Date:   2017-10-02T19:57:13Z

    Merge remote branch 'origin/master' into traf_misc

commit f49b86f6c1c0ae17d5d4f3366b421e774f7ce56a
Author: Sandhya Sundaresan <sa...@apache.org>
Date:   2017-10-06T05:22:44Z

    Fix to make hive hdfs filenames more unique

----


> HIVE: Inserted row into a hive table has gone missing
> -----------------------------------------------------
>
>                 Key: TRAFODION-2766
>                 URL: https://issues.apache.org/jira/browse/TRAFODION-2766
>             Project: Apache Trafodion
>          Issue Type: Bug
>          Components: sql-exe
>    Affects Versions: 2.3-incubating
>            Reporter: Sandhya Sundaresan
>            Assignee: Sandhya Sundaresan
>
> An inserted row into a hive table would go missing in various circumstances. As shown in the following occurrence, the first 2 inserts were executed fine. Select afterwards also showed that the rows were there:
> 1st insert: (C_CHAR10, P_CHAR10) = ('STR_1_01', 'STR_1_01')
> 2nd insert: (C_CHAR10, P_CHAR10) = (NULL, 'STR_1_02')
> Then the 3rd insert inserted a new row, which was inserted fine too:
> 3rd insert: (C_CHAR10, P_CHAR10) = ('STR_1_03', NULL)
> But the select statement afterwards only got 2 rows back. The row from the 2nd insert (C_CHAR10, P_CHAR10) = (NULL, 'STR_1_02') has gone missing after the 3rd insert took place. This doesn't appear to be a select problem, since a select from the hive shell in the end doesn't show the row from the 2nd insert either.
> Simpler testcase :
> cqd hive_max_string_length_in_bytes '10';
> process hive statement 'drop table t031hive1';
> process hive statement 'create table t031hive1 (a int, b timestamp, c string)';
> insert into hive.hive.t031hive1 values ('1', '2017-01-01 10:10:10', 2);
> --select * from hive.hive.t031hive1;
> insert into hive.hive.t031hive1 values ('2', '2017-01-02 11:11:11', 3),
>                          ('3', '2017-01-03 11:11:11', 4),
>                          (4, timestamp '2017-01-04 11:11:11', '5');
>                                     
> --select * from hive.hive.t031hive1;
> insert into hive.hive.t031hive1 values (2, '2017-01-02 11:11:11', 'a'),
>                           (111111111111, '2017-01-03 11:11:11', 'b');
> select * from hive.hive.t031hive1;
> >>process hive statement 'create table t031hive1 (a int, b timestamp, c string)';
> --- SQL operation complete.
> >>insert into hive.hive.t031hive1 values ('1', '2017-01-01 10:10:10', 2);
> --- 1 row(s) inserted.
> >>insert into hive.hive.t031hive1 values ('2', '2017-01-02 11:11:11', 3),
> +>                          ('3', '2017-01-03 11:11:11', 4),
> +>                          (4, timestamp '2017-01-04 11:11:11', '5');
> --- 3 row(s) inserted.
> >>
> >>-- this insert should return overflow error
> >>insert into hive.hive.t031hive1 values (2, '2017-01-02 11:11:11', 'a'),
> +>                          (111111111111, '2017-01-03 11:11:11', 'b');
> *** ERROR[8411] A numeric overflow occurred during an arithmetic computation or data conversion. Conversion of Source Type:LARGEINT(REC_BIN64_SIGNED) Source Value:111111111111 to Target Type:INTEGER SIGNED(REC_BIN32_SIGNED).
> --- 0 row(s) inserted.
> >>
> >>select * from hive.hive.t031hive1;
> A            B                           C         
> -----------  --------------------------  ----------
>           1  2017-01-01 10:10:10.000000  2         
> --- 1 row(s) selected.
> >>



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)