You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Rajesh Balamohan (JIRA)" <ji...@apache.org> on 2017/08/02 02:24:00 UTC

[jira] [Updated] (HIVE-17209) ObjectCacheFactory should return null when tez shared object registry is not setup

     [ https://issues.apache.org/jira/browse/HIVE-17209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rajesh Balamohan updated HIVE-17209:
------------------------------------
       Resolution: Fixed
     Hadoop Flags: Reviewed
    Fix Version/s: 3.0.0
           Status: Resolved  (was: Patch Available)

Created ORC-221 for orc related change and it got committed as well.

Thanks [~sershe]. Committed this patch to master.

> ObjectCacheFactory should return null when tez shared object registry is not setup
> ----------------------------------------------------------------------------------
>
>                 Key: HIVE-17209
>                 URL: https://issues.apache.org/jira/browse/HIVE-17209
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Rajesh Balamohan
>            Assignee: Rajesh Balamohan
>            Priority: Minor
>             Fix For: 3.0.0
>
>         Attachments: HIVE-17209.1.patch
>
>
> HIVE-15269 introduced dynamic min/max bloom filter ("hive.tez.dynamic.semijoin.reduction=true"). This needs to access ObjectCache and in tez, ObjectCache can only be created by {{TezProcessor}}.
> In the following case {{AM --> splits --> OrcInputFormat.pickStripes::evaluatePredicateMinMax --> DynamicValue.getLiteral --> objectCache access}}, AM ends up throwing lots of NPE since AM has not created ObjectCache.  
> Orc reader catches these exceptions, skips PPD and proceeds further. For e.g, in Q95 it ends up throwing ~30,000 NPE before completing split information.
> ObjectCacheFactory should return null when tez shared object registry is not setup. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)