You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Mostafa Mokhtar (JIRA)" <ji...@apache.org> on 2015/04/14 22:06:58 UTC

[jira] [Updated] (HIVE-10331) ORC : Is null SARG filters out all row groups written in old ORC format

     [ https://issues.apache.org/jira/browse/HIVE-10331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Mostafa Mokhtar updated HIVE-10331:
-----------------------------------
    Description: 
Queries are returning wrong results as all row groups gets filtered out and no rows get scanned.

{code}
SELECT 
  count(*)
    FROM
        store_sales
    WHERE
        ss_addr_sk IS NULL
{code}

With hive.optimize.index.filter disabled we get the correct results

In pickRowGroups stats show that hasNull_ is fales, while the rowgroup actually has null.
Same query runs fine for newly loaded ORC tables.

  was:
Queries are returning wrong results as all row groups gets filtered out and no rows get scanned.

{code}
SELECT 
  count(*)
    FROM
        store_sales
    WHERE
        ss_addr_sk IS NULL
{code}

With hive.optimize.index.filter disabled we get the correct results


> ORC : Is null SARG filters out all row groups written in old ORC format
> -----------------------------------------------------------------------
>
>                 Key: HIVE-10331
>                 URL: https://issues.apache.org/jira/browse/HIVE-10331
>             Project: Hive
>          Issue Type: Bug
>          Components: Hive
>    Affects Versions: 1.1.0
>            Reporter: Mostafa Mokhtar
>            Assignee: Prasanth Jayachandran
>             Fix For: 1.2.0
>
>
> Queries are returning wrong results as all row groups gets filtered out and no rows get scanned.
> {code}
> SELECT 
>   count(*)
>     FROM
>         store_sales
>     WHERE
>         ss_addr_sk IS NULL
> {code}
> With hive.optimize.index.filter disabled we get the correct results
> In pickRowGroups stats show that hasNull_ is fales, while the rowgroup actually has null.
> Same query runs fine for newly loaded ORC tables.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)