You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Hoss Man (JIRA)" <ji...@apache.org> on 2015/05/07 20:26:00 UTC
[jira] [Commented] (SOLR-7510) UpdateProcessor to compute a murmur3
hash of a field at index time
[ https://issues.apache.org/jira/browse/SOLR-7510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14533141#comment-14533141 ]
Hoss Man commented on SOLR-7510:
--------------------------------
Basic thinking i have...
* FieldMuttaingUpdateProcessor
* by default mutates no fields
* typical usage would be after CloneFieldUpdateProcessor
* looks at each field value given, and uses instanceof to pick the best method to call on the HashFunction
** if not a simple primitive, defaults to toString() then hash
** so for optimal hashing of numerics, users should put this after the appropriate Parse(Numer)UpdateProcessor
*** slightly cumbersome, but mainly targeted more for string fields anyway, since that's where pre-computing hte hash values is the most important
> UpdateProcessor to compute a murmur3 hash of a field at index time
> ------------------------------------------------------------------
>
> Key: SOLR-7510
> URL: https://issues.apache.org/jira/browse/SOLR-7510
> Project: Solr
> Issue Type: Sub-task
> Reporter: Hoss Man
>
> SOLR-6968 is adding HyperLogLog support to stats component. HLL accuracy depends on having good (long) hash values -- these can be computed at query time, but we should give users a simple option to compute them at index time for efficiency (especially with things like String hashing)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org