You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Quanlong Huang (Jira)" <ji...@apache.org> on 2020/02/03 23:18:00 UTC
[jira] [Created] (HIVE-22823) Support column masking policy on
nested columns
Quanlong Huang created HIVE-22823:
-------------------------------------
Summary: Support column masking policy on nested columns
Key: HIVE-22823
URL: https://issues.apache.org/jira/browse/HIVE-22823
Project: Hive
Issue Type: New Feature
Reporter: Quanlong Huang
Hive currently ignores column masking policies on nested columns. Only masking policies on top level columns are recognized.
For example, create a table with nested columns as:
{code:sql}
CREATE TABLE customers(
id int, name string,
email_preferences struct<email_format:string,frequency:string,categories:struct<promos:boolean,surveys:boolean>>,
addresses map<string,struct<street_1:string,street_2:string,city:string,state:string,zip_code:string>>,
orders array<struct<order_id:string,order_date:string,items:array<struct<product_id:int,sku:string,name:string,price:double,qty:int>>>>
) stored as ORC;
{code}
In Ranger, create a column masking policy on a nested column "email_preferences.email_format" to nullify the values. The policy won't be applied in queries on this table, neither query like "select email_preferences.email_format from customers" nor "select email_preferences from customers".
It's good to support column masking policies on nested columns.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)