You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Sorabh Hamirwasia (JIRA)" <ji...@apache.org> on 2017/09/29 20:15:03 UTC
[jira] [Created] (DRILL-5827) Create tests for detecting skew in
hash codes generated by Hash Functions
Sorabh Hamirwasia created DRILL-5827:
----------------------------------------
Summary: Create tests for detecting skew in hash codes generated by Hash Functions
Key: DRILL-5827
URL: https://issues.apache.org/jira/browse/DRILL-5827
Project: Apache Drill
Issue Type: Task
Reporter: Sorabh Hamirwasia
There have been few instances where the hash function used by Drill has produced skewed results on different data sets. It would be good to create some tests which can detect skew in hash codes produced by Hash Functions. This will help to avoid any regression based on changes for hash function usage or implementations. Creating data on fly in the tests like:
1) Set of random numbers.
2) Set of randomly generated strings
3) Set of random string with same prefix
4) Set of random string with same suffix
5) Set of continuous numbers.
And also adding Issue Data sets found during investigations in [DRILL-4237|https://issues.apache.org/jira/browse/DRILL-4237] / [DRILL-5816|https://issues.apache.org/jira/browse/DRILL-5816] / [DRILL-4119|https://issues.apache.org/jira/browse/DRILL-4119]
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)