You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Jonathan Turner Eagles (Jira)" <ji...@apache.org> on 2020/07/21 19:24:00 UTC

[jira] [Commented] (TEZ-4087) Shuffle: Fix shuffle cleanup to prevent thread leaks

    [ https://issues.apache.org/jira/browse/TEZ-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17162299#comment-17162299 ] 

Jonathan Turner Eagles commented on TEZ-4087:
---------------------------------------------

[~rajesh.balamohan], [~ashutoshc]. This commit has caused findbugs warning filed in TEZ-4203. Can either of you put up a patch to address?

> Shuffle: Fix shuffle cleanup to prevent thread leaks
> ----------------------------------------------------
>
>                 Key: TEZ-4087
>                 URL: https://issues.apache.org/jira/browse/TEZ-4087
>             Project: Apache Tez
>          Issue Type: Bug
>            Reporter: Rajesh Balamohan
>            Assignee: Rajesh Balamohan
>            Priority: Major
>         Attachments: TEZ-4087.1.patch
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> In certain cases, Shuffle's cleanupIgnoreErrors() is not called. This leaves 4 threads (inmem, diskmerger, Referee, ShuffleAndMergeRunner) run forever.
> When these are run in long running processes (e.g LLAP in Hive), they reach the thread limits over time.
> Note: Root cause why cleanupIgnoreErrors() is not invoked is not yet known. I will share the details when i get more details on this. Creating this ticket to add additional safety knobs to ensure that thread leaks do not happen.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)