You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2008/09/21 22:57:46 UTC

[jira] Updated: (HBASE-875) Use MurmurHash instead of JenkinsHash

     [ https://issues.apache.org/jira/browse/HBASE-875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andrzej Bialecki  updated HBASE-875:
------------------------------------

    Attachment: murmur.patch

This patch adds MurmurHash as an option (used by default) alongside with JenkinsHash, and provides for backwards-compatibility with data created before this patch.

> Use MurmurHash instead of JenkinsHash
> -------------------------------------
>
>                 Key: HBASE-875
>                 URL: https://issues.apache.org/jira/browse/HBASE-875
>             Project: Hadoop HBase
>          Issue Type: Improvement
>          Components: util
>    Affects Versions: 0.19.0
>            Reporter: Andrzej Bialecki 
>         Attachments: murmur.patch
>
>
> I recently ported the MurmurHash (http://murmurhash.googlepages.com/) to Java, and according to my tests it's roughly 5 times faster than the current version of JenkinsHash in the trunk/ . According to the author (and other analysts at comp.sci.crypt) this hash has an excellent avalanche behavior, and low collision rate. I propose to either replace the JenkinsHash or add this hash as an option to be used in BloomFilter-s and related classes.
> If your opinion is positive, I'll prepare a patch. The Java implementation of the hash can be found here: http://www.getopt.org/murmur/MurmurHash.java

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.