You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Sergey (JIRA)" <ji...@apache.org> on 2013/08/02 12:47:48 UTC
[jira] [Created] (PIG-3409) org.apache.pig.data.DefaultTuple
hashcode perfomance issue
Sergey created PIG-3409:
---------------------------
Summary: org.apache.pig.data.DefaultTuple hashcode perfomance issue
Key: PIG-3409
URL: https://issues.apache.org/jira/browse/PIG-3409
Project: Pig
Issue Type: Bug
Components: impl
Affects Versions: 0.11
Reporter: Sergey
Priority: Critical
I've met serious perfomance issue.
please see visualvm screenshot.
Here is hashCode implementation from the class:
{code}
@Override
public int hashCode() {
int hash = 17;
for (Iterator<Object> it = mFields.iterator(); it.hasNext();) {
Object o = it.next();
if (o != null) {
hash = 31 * hash + o.hashCode();
}
}
return hash;
}
{code}
I don't see any reason here to iterate over the whole tuple, aggregate hash value and then return it.
I can fix it, if it's possible to take part in dev process. I'm new to it :(
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira