You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@madlib.apache.org by Frank McQuillan <fm...@pivotal.io> on 2018/10/12 00:31:54 UTC

Data anonymization feature

Hi,

Given the recent recent GDPR policy
https://eugdpr.org/
some community members might be wondering about anonymization and data
privacy
features that we could add to MADlib.

There is a JIRA here
https://issues.apache.org/jira/browse/MADLIB-911
and it would be great if folks could add comments or thoughts on which
direction to take.

Seems like porting the PDL Tools library function
http://pivotalsoftware.github.io/PDLTools/group__grp__anonymization.html
might be useful but probably just a minimum.  For one thing I do not know
what kind of
hashing they do.

More comprehensive integrations like
https://arx.deidentifier.org/
as suggested in the JIRA would be great but a lot of work.

Frank