You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Bikas Saha (JIRA)" <ji...@apache.org> on 2015/09/19 22:49:04 UTC

[jira] [Issue Comment Deleted] (TEZ-2834) tez app hangs at large scale (~30TB)

     [ https://issues.apache.org/jira/browse/TEZ-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Bikas Saha updated TEZ-2834:
----------------------------
    Comment: was deleted

(was: {color:red}-1 overall{color}.  Here are the results of testing the latest attachment
  http://issues.apache.org/jira/secure/attachment/12761258/TEZ-2834.2.patch
  against master revision 7ed7025.

    {color:green}+1 @author{color}.  The patch does not contain any @author tags.

    {color:green}+1 tests included{color}.  The patch appears to include 3 new or modified test files.

    {color:green}+1 javac{color}.  The applied patch does not increase the total number of javac compiler warnings.

    {color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

    {color:green}+1 findbugs{color}.  The patch does not introduce any new Findbugs (version 3.0.1) warnings.

    {color:green}+1 release audit{color}.  The applied patch does not increase the total number of release audit warnings.

    {color:red}-1 core tests{color}.  The patch failed these unit tests in :
                   org.apache.tez.test.TestFaultTolerance

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/1158//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/1158//console

This message is automatically generated.)

> tez app hangs at large scale (~30TB)
> ------------------------------------
>
>                 Key: TEZ-2834
>                 URL: https://issues.apache.org/jira/browse/TEZ-2834
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.7.1
>            Reporter: Rajesh Balamohan
>            Assignee: Bikas Saha
>         Attachments: DAG_view.png, TEZ-2834.1.patch, TEZ-2834.2.patch, application_1442254312093_0095.1.log.gz, application_1442254312093_0095.2.log.gz, hive_view.png
>
>
> Will attach the DAG.
> Repro for reference: TPC-DS q_70 @ 30 TB scale.
> "Map 7" completes in 2 waves. Output is very tiny, so reducer 8 gets launched slightly late.  But before "Reducer 9" can get scheduled, slots are taken up by "Map 1", which is not preempted for running "Reducer 9".
> This is with 0.7.1 codebase.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)