You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Carl Steinbach (JIRA)" <ji...@apache.org> on 2011/06/30 20:16:28 UTC

[jira] [Updated] (HIVE-2030) isEmptyPath() to use ContentSummary cache

     [ https://issues.apache.org/jira/browse/HIVE-2030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Carl Steinbach updated HIVE-2030:
---------------------------------

      Component/s: Query Processor
    Fix Version/s: 0.8.0

> isEmptyPath() to use ContentSummary cache
> -----------------------------------------
>
>                 Key: HIVE-2030
>                 URL: https://issues.apache.org/jira/browse/HIVE-2030
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Processor
>            Reporter: Siying Dong
>            Assignee: Siying Dong
>            Priority: Minor
>             Fix For: 0.8.0
>
>         Attachments: HIVE-2030.1.patch, HIVE-2030.2.patch, HIVE-2030.3.patch
>
>
> addInputPaths() calls isEmptyPath() for every input path. Now every call is a DFS namenode call. Making isEmptyPath() to use cached ContentSummary, we should be able to avoid some namenode calls and reduce latency in the case of multiple partitions.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira