You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "László Bodor (Jira)" <ji...@apache.org> on 2020/09/07 08:04:00 UTC

[jira] [Commented] (TEZ-4230) LocalContainerLauncher can kill task future too early, causing app hang

    [ https://issues.apache.org/jira/browse/TEZ-4230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17191554#comment-17191554 ] 

László Bodor commented on TEZ-4230:
-----------------------------------

[~jeagles], [~jlowe]: could you please review this small patch? this is probably the last 0.10.0 blocker

long story short, I identified this with a flaky hive test, analyzed the logs, and with the fix, the test passed 500 times without an issue (details in previous comment)

> LocalContainerLauncher can kill task future too early, causing app hang
> -----------------------------------------------------------------------
>
>                 Key: TEZ-4230
>                 URL: https://issues.apache.org/jira/browse/TEZ-4230
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.9.2
>            Reporter: László Bodor
>            Assignee: László Bodor
>            Priority: Major
>             Fix For: 0.10.0, 0.9.3
>
>         Attachments: TEZ-4230.01.patch, TestCrudCompactorOnTez.log, TestCrudCompactorOnTez2.log, jstack.log, org.apache.hadoop.hive.ql.txn.compactor.TestCrudCompactorOnTez-output.txt
>
>
> Reproduced issue in ptest run which I made to run against tez staging artifacts (https://repository.apache.org/content/repositories/orgapachetez-1068/)
> http://ci.hive.apache.org/blue/organizations/jenkins/hive-precommit/detail/PR-1311/14/pipeline/417
> I'm about to investigate this. I think Tez 0.10.0 cannot be released until we won't confirm if it's a hive or tez bug.
> {code}
> mvn test -Pitests,hadoop-2 -Dtest=TestMmCompactorOnTez -pl ./itests/hive-unit
> {code}
> tez setup:
> https://github.com/apache/hive/commit/92516631ab39f39df5d0692f98ac32c2cd320997#diff-a22bcc9ba13b310c7abfee4a57c4b130R83-R97



--
This message was sent by Atlassian Jira
(v8.3.4#803005)