You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oak-issues@jackrabbit.apache.org by "Thomas Mueller (JIRA)" <ji...@apache.org> on 2015/03/19 11:23:38 UTC

[jira] [Updated] (OAK-2466) DataStoreBlobStore: chunk ids should not contain the size

     [ https://issues.apache.org/jira/browse/OAK-2466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Thomas Mueller updated OAK-2466:
--------------------------------
    Fix Version/s:     (was: 1.2)
                   1.3.0

> DataStoreBlobStore: chunk ids should not contain the size
> ---------------------------------------------------------
>
>                 Key: OAK-2466
>                 URL: https://issues.apache.org/jira/browse/OAK-2466
>             Project: Jackrabbit Oak
>          Issue Type: Improvement
>          Components: core
>            Reporter: Thomas Mueller
>            Assignee: Thomas Mueller
>             Fix For: 1.3.0
>
>
> The blob store garbage collection (data store garbage collection) uses the chunk ids to identify binaries to be deleted. The blob ids contain the size now (<contentHash>#<size>), and the blob id is currently equal to the chunk id.
> It would be more efficient to _not_ use the size, and instead just use the content hash, for the chunk ids. That way, enumerating the entries that are in the store is potentially faster. Also, it allows us to change the blob id in the future, for example add more information to it (for example the creation time, or the first few bytes of the content) if we ever want to.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)