You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Uma Maheswara Rao G (Jira)" <ji...@apache.org> on 2020/10/31 21:26:00 UTC

[jira] [Created] (HIVE-24342) isPathEncrypted should make sure resolved path also from HDFS

Uma Maheswara Rao G created HIVE-24342:
------------------------------------------

             Summary: isPathEncrypted should make sure resolved path also from HDFS
                 Key: HIVE-24342
                 URL: https://issues.apache.org/jira/browse/HIVE-24342
             Project: Hive
          Issue Type: Bug
          Components: HiveServer2, Shims
    Affects Versions: 3.1.2
            Reporter: Uma Maheswara Rao G
            Assignee: Uma Maheswara Rao G


Currently isPathEncrypted will make sure path is from hdfs by check the path scheme is "hdfs"

In the case if mounted ViewFileSystem based files systems like ViewFSOverloadScheme or ViewHDFS (HDFS-15289) may need o check resolved path is really hdfs.

In ViewHDFS case, we can mount hdfs://ns1/test ---> o3fs://b.v.ozone1/test

When user calling queries with the path hdfs://ns1/test, isPathEncrypted will think the path is from hdfs only as its checking path scheme.

 
{code:java}
0: jdbc:hive2://umag-1.umag.root.xxx.site:218> select * from test30;
Error: Error while compiling statement: FAILED: SemanticException Unable to determine if hdfs://ns1/test is encrypted: java.lang.UnsupportedOperationException: This API:getEZForPath is specific to DFS. Can't run on other fs:o3fs://bucket.volume.ozone1 (state=42000,code=40000)
0: jdbc:hive2://umag-1.umag.root.xxx.site:218> cd Closing: 0: jdbc:hive2://umag-1.umag.root.xxx.site:2181,umag-2.umag.root.xxx.site:2181,umag-5.umag.root.xxx.site:2181/default;password=root;principal=hive/umag-5.umag.root.xxx.site@ROOT.HWX.SITE;retries=5;serviceDiscoveryMode=zooKeeper;user=root;zooKeeperNamespace=hiveserver2
{code}
 

So, here we should use resolvePath to make sure the resolved path really in hdfs. If the resolved path is not from hdfs (in above case, it o3fs path), then it will return false.

After fixing this, the query is passing.:

 
{code:java}
0: jdbc:hive2://umag-1.umag.root.xxx.site:218> select * from test30;
INFO  : Compiling command(queryId=hive_20201031002253_1691548f-6fa8-4ea9-9cd4-87b70fe8f6bb): select * from test30
INFO  : No Stats for default@test30, Columns: item, user_id, state, order_id
INFO  : Semantic Analysis Completed (retrial = false)
INFO  : Created Hive schema: Schema(fieldSchemas:[FieldSchema(name:test30.order_id, type:bigint, comment:null), FieldSchema(name:test30.user_id, type:string, comment:null), FieldSchema(name:test30.item, type:string, comment:null), FieldSchema(name:test30.state, type:string, comment:null)], properties:null)
INFO  : Completed compiling command(queryId=hive_20201031002253_1691548f-6fa8-4ea9-9cd4-87b70fe8f6bb); Time taken: 4.47 seconds
INFO  : Executing command(queryId=hive_20201031002253_1691548f-6fa8-4ea9-9cd4-87b70fe8f6bb): select * from test30
INFO  : Completed executing command(queryId=hive_20201031002253_1691548f-6fa8-4ea9-9cd4-87b70fe8f6bb); Time taken: 0.09 seconds
INFO  : OK
+------------------+-----------------+--------------+---------------+
| test30.order_id  | test30.user_id  | test30.item  | test30.state  |
+------------------+-----------------+--------------+---------------+
| 1234             | u1              | iphone7      | CA            |
| 2345             | u1              | ipad         | CA            |
| 3456             | u2              | desktop      | NY            |
 
 
+------------------+-----------------+--------------+---------------+
11 rows selected (6.975 seconds)
{code}
 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)