You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "zgl (JIRA)" <ji...@apache.org> on 2018/02/01 18:53:01 UTC

[jira] [Assigned] (HIVE-4078) Delay the serialize-deserialize pair in CommonJoinTaskDispatcher

     [ https://issues.apache.org/jira/browse/HIVE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

zgl reassigned HIVE-4078:
-------------------------

    Assignee: zgl  (was: Gopal V)

> Delay the serialize-deserialize pair in CommonJoinTaskDispatcher
> ----------------------------------------------------------------
>
>                 Key: HIVE-4078
>                 URL: https://issues.apache.org/jira/browse/HIVE-4078
>             Project: Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Gopal V
>            Assignee: zgl
>            Priority: Major
>              Labels: client, perfomance
>             Fix For: 0.11.0
>
>         Attachments: HIVE-4078-20130305.2.patch, HIVE-4078-20130305.patch, HIVE-4078-20130406.patch, HIVE-4078-trunk-rebase.patch
>
>
> CommonJoinProcessor tries to clone a MapredWork while attempting a conversion to a map-join
> {code}
>   // deep copy a new mapred work from xml
>   InputStream in = new ByteArrayInputStream(xml.getBytes("UTF-8"));
>   MapredWork newWork = Utilities.deserializeMapRedWork(in, physicalContext.getConf());
> {code}
> which is a very heavy operation memory wise & cpu-wise.
> It would be better to do this only if a conditional task is required, resulting in a copy of the task.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)