You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Gopal V (JIRA)" <ji...@apache.org> on 2013/03/05 20:24:13 UTC

[jira] [Updated] (HIVE-4078) Delay the serialize-deserialize pair in CommonJoinResolver

     [ https://issues.apache.org/jira/browse/HIVE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gopal V updated HIVE-4078:
--------------------------

    Summary: Delay the serialize-deserialize pair in CommonJoinResolver  (was: Remove the serialize-deserialize pair in CommonJoinResolver)
    
> Delay the serialize-deserialize pair in CommonJoinResolver
> ----------------------------------------------------------
>
>                 Key: HIVE-4078
>                 URL: https://issues.apache.org/jira/browse/HIVE-4078
>             Project: Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Gopal V
>            Assignee: Gopal V
>         Attachments: HIVE-4078-20130227.patch, HIVE-4078.patch
>
>
> CommonJoinProcessor tries to clone a MapredWork while attempting a conversion to a map-join
> {code}
>   // deep copy a new mapred work from xml
>   InputStream in = new ByteArrayInputStream(xml.getBytes("UTF-8"));
>   MapredWork newWork = Utilities.deserializeMapRedWork(in, physicalContext.getConf());
> {code}
> which is a very heavy operation memory wise & cpu-wise.
> Instead of cloning via XMLEncoder, it is faster to use BeanUtils.cloneBean() which is following same data paths (get/set bean methods) instead.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira