You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Gidon Gershinsky (Jira)" <ji...@apache.org> on 2020/08/04 08:20:00 UTC

[jira] [Updated] (PARQUET-1376) Data obfuscation layer for encryption

     [ https://issues.apache.org/jira/browse/PARQUET-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gidon Gershinsky updated PARQUET-1376:
--------------------------------------
    Description: 
Data obfuscation in sensitive columns - for users without access to column encryption keys.
 # Implement on top of [basic Parquet encryption|https://github.com/apache/parquet-format/blob/encryption/Encryption.md] 
 # Built-in support for multiple masking mechanisms, with different trade-off between data utility, leakage, and size/throughput overhead
 # Provide interface for plug-in custom masking mechanism
 # Enable storing multiple masked versions of the same column in a file
 # Provide readers with explicit list of column’s masked versions in a file
 # Enable readers to select a masked version of a column
 # Stretch: Implement tools for analysis of file data privacy properties and information leakage
 # Stretch: Leverage privacy analysis tools for tuning file data anonymity
 # Optional: Support aggregated obfuscation

  was:
Anonymity layer for hidden columns
 # Different data masking options
 ** per-cell
 ** aggregated (average, etc)
 # Reader notification on data access status
 # Providing readers with a choice of masking options (if available)


> Data obfuscation layer for encryption
> -------------------------------------
>
>                 Key: PARQUET-1376
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1376
>             Project: Parquet
>          Issue Type: New Feature
>            Reporter: Gidon Gershinsky
>            Assignee: Gidon Gershinsky
>            Priority: Major
>
> Data obfuscation in sensitive columns - for users without access to column encryption keys.
>  # Implement on top of [basic Parquet encryption|https://github.com/apache/parquet-format/blob/encryption/Encryption.md] 
>  # Built-in support for multiple masking mechanisms, with different trade-off between data utility, leakage, and size/throughput overhead
>  # Provide interface for plug-in custom masking mechanism
>  # Enable storing multiple masked versions of the same column in a file
>  # Provide readers with explicit list of column’s masked versions in a file
>  # Enable readers to select a masked version of a column
>  # Stretch: Implement tools for analysis of file data privacy properties and information leakage
>  # Stretch: Leverage privacy analysis tools for tuning file data anonymity
>  # Optional: Support aggregated obfuscation



--
This message was sent by Atlassian Jira
(v8.3.4#803005)