You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Allan Yan (JIRA)" <ji...@apache.org> on 2015/06/17 04:00:09 UTC

[jira] [Created] (HIVE-11033) BloomFilter index is not honored by ORC reader

Allan Yan created HIVE-11033:
--------------------------------

             Summary: BloomFilter index is not honored by ORC reader
                 Key: HIVE-11033
                 URL: https://issues.apache.org/jira/browse/HIVE-11033
             Project: Hive
          Issue Type: Bug
    Affects Versions: 1.2.0
            Reporter: Allan Yan


There is a bug in the org.apache.hadoop.hive.ql.io.orc.ReaderImpl class which caused the bloom filter index saved in the ORC file not being used. The reason is because the bloomFilterIndices variable defined in the SargApplier class superseded from its parent class.

Here is one way to fix it
{noformat}
18:46 $ diff src/java/org/apache/hadoop/hive/ql/io/orc/RecordReaderImpl.java src/java/org/apache/hadoop/hive/ql/io/orc/RecordReaderImpl.java.original
174d173
<     bloomFilterIndices = new OrcProto.BloomFilterIndex[types.size()];
178c177
<           sarg, options.getColumnNames(), strideRate, types, included.length, bloomFilterIndices);
---
>           sarg, options.getColumnNames(), strideRate, types, included.length);
204a204
>     bloomFilterIndices = new OrcProto.BloomFilterIndex[types.size()];
673c673
<         List<OrcProto.Type> types, int includedCount, OrcProto.BloomFilterIndex[] bloomFilterIndices) {
---
>         List<OrcProto.Type> types, int includedCount) {
677c677
<       this.bloomFilterIndices = bloomFilterIndices;
---
>       bloomFilterIndices = new OrcProto.BloomFilterIndex[types.size()];
{noformat}





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)