You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Mithun Radhakrishnan (JIRA)" <ji...@apache.org> on 2015/02/20 03:57:11 UTC

[jira] [Created] (HIVE-9736) StorageBasedAuthProvider should batch namenode-calls where possible.

Mithun Radhakrishnan created HIVE-9736:
------------------------------------------

             Summary: StorageBasedAuthProvider should batch namenode-calls where possible.
                 Key: HIVE-9736
                 URL: https://issues.apache.org/jira/browse/HIVE-9736
             Project: Hive
          Issue Type: Bug
          Components: Metastore, Security
            Reporter: Mithun Radhakrishnan
            Assignee: Mithun Radhakrishnan


Consider a table partitioned by 2 keys (dt, region). Say a dt partition could have 10000 associated regions. Consider that the user does:
{code:sql}
ALTER TABLE my_table DROP PARTITION (dt='20150101');
{code}

As things stand now, {{StorageBasedAuthProvider}} will make individual {{DistributedFileSystem.listStatus()}} calls for each partition-directory, and authorize each one separately. It'd be faster to batch the calls, and examine multiple FileStatus objects at once.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)