You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Ayush Saxena (Jira)" <ji...@apache.org> on 2022/02/22 19:06:00 UTC

[jira] [Commented] (HIVE-25918) Invalid stats after multi inserting into the same partition

    [ https://issues.apache.org/jira/browse/HIVE-25918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17496278#comment-17496278 ] 

Ayush Saxena commented on HIVE-25918:
-------------------------------------

[~kkasa] / [~kgyrtkirk] The introduced test seems flaky, it is the only test failing in my PR. I tried a flaky job run for this as well.

Can you please check once.

http://ci.hive.apache.org/job/hive-flaky-check/519/console

> Invalid stats after multi inserting into the same partition
> -----------------------------------------------------------
>
>                 Key: HIVE-25918
>                 URL: https://issues.apache.org/jira/browse/HIVE-25918
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Statistics
>            Reporter: Krisztian Kasa
>            Assignee: Krisztian Kasa
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 3h 40m
>  Remaining Estimate: 0h
>
> {code}
> create table source(p int, key int,value string);
> insert into source(p, key, value) values (101,42,'string42');
> create table stats_part(key int,value string) partitioned by (p int);
> from source
> insert into stats_part select key, value, p
> insert into stats_part select key, value, p;
> select count(*) from stats_part;
> {code}
> In this case {{StatsOptimizer}} helps serving this query because the result should be {{rowNum}} of the partition {{p=101}}. The result is
> {code}
> 1
> {code}
> however it shloud be
> {code}
> 2
> {code}
> because both insert branches inserts 1-1 records.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)