You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Rajesh Balamohan (Jira)" <ji...@apache.org> on 2020/04/27 06:39:00 UTC

[jira] [Updated] (TEZ-4156) Fix Tez to reuse IPC connections

     [ https://issues.apache.org/jira/browse/TEZ-4156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rajesh Balamohan updated TEZ-4156:
----------------------------------
    Attachment: TEZ-4156.1.patch

> Fix Tez to reuse IPC connections
> --------------------------------
>
>                 Key: TEZ-4156
>                 URL: https://issues.apache.org/jira/browse/TEZ-4156
>             Project: Apache Tez
>          Issue Type: Improvement
>            Reporter: Rajesh Balamohan
>            Priority: Major
>         Attachments: TEZ-4156.1.patch
>
>
> When tracking DAG progress, TezClientUtils ends up creating new remote user. Because of this new UGI creation, IPC connections are not reused internally.
> https://github.com/apache/tez/blob/master/tez-api/src/main/java/org/apache/tez/client/TezClientUtils.java#L965
> More info from Hadoop side:
> In hadoop's IPC layer, connectionIds are checked based on UserGroupInformation.
> https://github.com/apache/hadoop/blob/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/ipc/Client.java#L1600
> However, UserGroupInformation comparison is based on ==
> https://github.com/apache/hadoop/blob/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/security/UserGroupInformation.java#L1789



--
This message was sent by Atlassian Jira
(v8.3.4#803005)