You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oak-issues@jackrabbit.apache.org by "Dheeraj Khanna (JIRA)" <ji...@apache.org> on 2016/06/21 23:25:58 UTC

[jira] [Commented] (OAK-3140) DataStore / BlobStore: add a method to pass a "type" when writing

    [ https://issues.apache.org/jira/browse/OAK-3140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15343021#comment-15343021 ] 

Dheeraj Khanna commented on OAK-3140:
-------------------------------------

[~tmueller]
{quote}Disable calculating the content hash (de-duplication) for some binaries. {quote}
Would it make sense to calculate the hash before uploading the binary itself, may be using some kind of preprocessing. 
Use case: in case of large assets (>5GB) the file gets uploaded first and then the user gets to know that this is a duplicate file, if this can be done in advance, the user will not have to wait for a large upload to finish (which could take many minutes) to get this information.

> DataStore / BlobStore: add a method to pass a "type" when writing
> -----------------------------------------------------------------
>
>                 Key: OAK-3140
>                 URL: https://issues.apache.org/jira/browse/OAK-3140
>             Project: Jackrabbit Oak
>          Issue Type: New Feature
>          Components: blob
>            Reporter: Thomas Mueller
>            Assignee: Thomas Mueller
>              Labels: performance
>
> Currently, the BlobStore interface has a method "String writeBlob(InputStream in)". This issue is about adding a new method "String writeBlob(String type, InputStream in)", for the following reasons (in no particular order):
> * Store some binaries (for example Lucene index files) in a different place, in order to safely and quickly run garbage collection just on those files.
> * Store some binaries in a slow, some in a fast storage or location.
> * Disable calculating the content hash (de-duplication) for some binaries.
> * Store some binaries in a shared storage (for fast cross-repository copying), and some in local storage.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)