You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@hive.apache.org by "Vineet Garg (JIRA)" <ji...@apache.org> on 2018/04/24 22:59:00 UTC

[jira] [Commented] (HIVE-18448) Drop Support For Indexes From Apache Hive

    [ https://issues.apache.org/jira/browse/HIVE-18448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16451376#comment-16451376 ] 

Vineet Garg commented on HIVE-18448:
------------------------------------

[~kgyrtkirk] this patch turned {{hive.optimize.index.filter}} to false. This has disabled ORC ppd by default. We should turn if on again as it might cause performance regression

> Drop Support For Indexes From Apache Hive
> -----------------------------------------
>
>                 Key: HIVE-18448
>                 URL: https://issues.apache.org/jira/browse/HIVE-18448
>             Project: Hive
>          Issue Type: Improvement
>          Components: Indexing
>            Reporter: BELUGA BEHR
>            Assignee: Zoltan Haindrich
>            Priority: Minor
>              Labels: TODOC3.0
>             Fix For: 3.0.0
>
>         Attachments: HIVE-18448.01.patch, HIVE-18448.01wip02.patch, HIVE-18448.01wip03.patch, HIVE-18448.01wip04.patch, HIVE-18448.01wip05.patch
>
>
> If a user needs to look up a small subset of records quickly, they can use Apache HBase, if they need fast retrieval of larger sets of data, or fast joins, aggregations, they can use Apache Impala.  It seems to me that Hive indexes do not serve much of a role in the future of Hive.
> Even without moving workloads to other products, columnar file formats with their statistics achieve similar goals as Hive indexes.
> Please consider dropping Indexes from the Apache Hive project.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)