You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@sentry.apache.org by "Alexander Kolbasov (JIRA)" <ji...@apache.org> on 2017/09/02 00:50:00 UTC

[jira] [Created] (SENTRY-1915) Sentry should use old PathDump structures to send full snapshots to HDFS

Alexander Kolbasov created SENTRY-1915:
------------------------------------------

             Summary: Sentry should use old PathDump structures to send full snapshots to HDFS
                 Key: SENTRY-1915
                 URL: https://issues.apache.org/jira/browse/SENTRY-1915
             Project: Sentry
          Issue Type: Bug
          Components: Sentry
    Affects Versions: 2.0.0
            Reporter: Alexander Kolbasov


It turns out that in 2.0 we changed the way full snapshots are sent from Sentry to HDFS. Before they were using {{HMSPaths}} which used tree structure and eliminated some duplication. Also SENTRY-1827 helped to compressed this on the serialization side.

Now we are using {{TPathChanges}} structure that is not tree-based and contains very non-efficient way of representing paths: {{required list<list<string>> addPaths;}} so we split each paths on slashes and store list of elements instead of sorting a tree





--
This message was sent by Atlassian JIRA
(v6.4.14#64029)