You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Quanlong Huang (Jira)" <ji...@apache.org> on 2020/02/03 23:18:00 UTC

[jira] [Created] (HIVE-22823) Support column masking policy on nested columns

Quanlong Huang created HIVE-22823:
-------------------------------------

             Summary: Support column masking policy on nested columns
                 Key: HIVE-22823
                 URL: https://issues.apache.org/jira/browse/HIVE-22823
             Project: Hive
          Issue Type: New Feature
            Reporter: Quanlong Huang


Hive currently ignores column masking policies on nested columns. Only masking policies on top level columns are recognized.

For example, create a table with nested columns as:
{code:sql}
CREATE TABLE customers(
  id int,   name string, 
  email_preferences struct<email_format:string,frequency:string,categories:struct<promos:boolean,surveys:boolean>>, 
 addresses map<string,struct<street_1:string,street_2:string,city:string,state:string,zip_code:string>>, 
  orders array<struct<order_id:string,order_date:string,items:array<struct<product_id:int,sku:string,name:string,price:double,qty:int>>>>
) stored as ORC;
{code}

In Ranger, create a column masking policy on a nested column "email_preferences.email_format" to nullify the values. The policy won't be applied in queries  on this table, neither query like "select email_preferences.email_format from customers" nor "select email_preferences from customers".

It's good to support column masking policies on nested columns.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)