You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@oozie.apache.org by "Ryota Egashira (JIRA)" <ji...@apache.org> on 2015/07/10 20:33:04 UTC

[jira] [Updated] (OOZIE-2135) Partition existence check by HCatClient.getPartitions causes high load on HCatalog

     [ https://issues.apache.org/jira/browse/OOZIE-2135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ryota Egashira updated OOZIE-2135:
----------------------------------
    Description: 
HCatClient.getPartitions is used to check partition existence in Oozie, but it is known to be inefficient for large table with many partitions. Hive team working on new method, partitionExists, which has performance optimization, and Oozie should switch to this once committed.
[~mithun]  could you please share Hive JIRA for reference? 

> Partition existence check by HCatClient.getPartitions causes high load on HCatalog
> ----------------------------------------------------------------------------------
>
>                 Key: OOZIE-2135
>                 URL: https://issues.apache.org/jira/browse/OOZIE-2135
>             Project: Oozie
>          Issue Type: Bug
>            Reporter: Ryota Egashira
>            Assignee: Ryota Egashira
>
> HCatClient.getPartitions is used to check partition existence in Oozie, but it is known to be inefficient for large table with many partitions. Hive team working on new method, partitionExists, which has performance optimization, and Oozie should switch to this once committed.
> [~mithun]  could you please share Hive JIRA for reference? 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)