You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/03/03 15:20:18 UTC

[jira] [Comment Edited] (TIKA-1663) Add a DigestingParser to add MD5/SHA-X hashes as fields in Metadata

    [ https://issues.apache.org/jira/browse/TIKA-1663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15177851#comment-15177851 ] 

Tim Allison edited comment on TIKA-1663 at 3/3/16 2:19 PM:
-----------------------------------------------------------

Thank you, Nick.  I somewhat prefer the first option (once we add the parameter setting).  I'm hesitant to promote the DigestingParser to a special place, but I'm game if the community is.

Oh, the other thing...I think I want to add options for encoding the digest bytes.  CommonCrawl is using Base32 of sha1...for example.



was (Author: tallison@mitre.org):
Thank you, Nick.  I somewhat prefer the first option (once we add the parameter setting).  I'm hesitant to promote the DigestingParser (wrapper) to a special place, but I'm game if the community is.


> Add a DigestingParser to add MD5/SHA-X hashes as fields in Metadata
> -------------------------------------------------------------------
>
>                 Key: TIKA-1663
>                 URL: https://issues.apache.org/jira/browse/TIKA-1663
>             Project: Tika
>          Issue Type: Improvement
>            Reporter: Tim Allison
>            Priority: Minor
>         Attachments: digesting_parser_v1.patch
>
>
> It might be useful to integrate commons' DigestUtils and allow users to easily add the MD5 or other supported hashes to the Metadata object.
> Anyone else find this of use?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)