You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@metron.apache.org by "Jon Zeolla (JIRA)" <ji...@apache.org> on 2016/11/03 17:26:58 UTC

[jira] [Updated] (METRON-544) Add an integrity checking field for large values

     [ https://issues.apache.org/jira/browse/METRON-544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jon Zeolla updated METRON-544:
------------------------------
    Description: 
If a value is > 32766 long, add an integrity checking field using a hash such as SHA3-256.  This provides a method to both validate and compare the data against another store without a 32k limitation.  For instance, if the field is "uri", add a key of "uri_hash" and a value of the hash of the original field's value (pre-truncation).  

Where this should be added is up for debate - I would initially suggest putting it upstream of the indexingBolt in the indexing topology (as a splitter), but I could see some arguments for putting it in the indexingBolt, or somewhere in the enrichments topology.

  was:
If a value is > 32766 long, add an integrity checking field using a hash such as SHA3-256.  This provides a method to both validate and compare the data against another store without a 32k limitation.  For instance, if the field is "uri", add a key of "uri_hash" and a value of the hash of the original field's value.  

Where this should be added is up for debate - I would initially suggest putting it upstream of the indexingBolt in the indexing topology (as a splitter), but I could see some arguments for putting it in the indexingBolt, or somewhere in the enrichments topology.


> Add an integrity checking field for large values
> ------------------------------------------------
>
>                 Key: METRON-544
>                 URL: https://issues.apache.org/jira/browse/METRON-544
>             Project: Metron
>          Issue Type: Sub-task
>            Reporter: Jon Zeolla
>            Priority: Minor
>
> If a value is > 32766 long, add an integrity checking field using a hash such as SHA3-256.  This provides a method to both validate and compare the data against another store without a 32k limitation.  For instance, if the field is "uri", add a key of "uri_hash" and a value of the hash of the original field's value (pre-truncation).  
> Where this should be added is up for debate - I would initially suggest putting it upstream of the indexingBolt in the indexing topology (as a splitter), but I could see some arguments for putting it in the indexingBolt, or somewhere in the enrichments topology.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)