You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Rajesh Balamohan (JIRA)" <ji...@apache.org> on 2015/04/21 01:13:59 UTC
[jira] [Commented] (TEZ-2279) Exception with pipelined shuffle in UnorderedKV case

    [ https://issues.apache.org/jira/browse/TEZ-2279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14503908#comment-14503908 ] 

Rajesh Balamohan commented on TEZ-2279:
---------------------------------------

Tried with the latest version of Hive (20-Apr) and it worked fine without issues. Will revisit this if the reported exception is encountered again.

> Exception with pipelined shuffle in UnorderedKV case
> ----------------------------------------------------
>
>                 Key: TEZ-2279
>                 URL: https://issues.apache.org/jira/browse/TEZ-2279
>             Project: Apache Tez
>          Issue Type: Bug
>            Reporter: Rajesh Balamohan
>
> {noformat}
> Caused by: java.io.IOException: Reached EOF. Completed reading 1438635
>         at org.apache.tez.runtime.library.common.sort.impl.IFile.checkState(IFile.java:818)
>         at org.apache.tez.runtime.library.common.sort.impl.IFile$Reader.positionToNextRecord(IFile.java:699)
>         at org.apache.tez.runtime.library.common.sort.impl.IFile$Reader.readRawKey(IFile.java:732)
>         at org.apache.tez.runtime.library.common.sort.impl.IFile$Reader.nextRawKey(IFile.java:728)
>         at org.apache.tez.runtime.library.common.readers.UnorderedKVReader.readNextFromCurrentReader(UnorderedKVReader.java:151)
>         at org.apache.tez.runtime.library.common.readers.UnorderedKVReader.next(UnorderedKVReader.java:112)
>         at org.apache.hadoop.hive.ql.exec.tez.HashTableLoader.load(HashTableLoader.java:111)
>         ... 26 more
> ], TaskAttempt 1 failed, info=[Error: Failure while running task:java.lang.RuntimeException: java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: java.io.IOException: Reached EOF. Completed reading 1605374
>         at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:186)
>         at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:138)
>         at org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:330)
>         at org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(TezTaskRunner.java:180)
>         at org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(TezTaskRunner.java:172)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at javax.security.auth.Subject.doAs(Subject.java:422)
>         at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)
>         at org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.callInternal(TezTaskRunner.java:172)
>         at org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.callInternal(TezTaskRunner.java:168)
>         at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36)
>         at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>         at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>         at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
>         at java.lang.Thread.run(Thread.java:745)
> {noformat}
> This happens only when there are spills happen in Unordered case.  Doesn't happen when the ratios are adjusted tez.task.scale.memory.ratios="PARTITIONED_UNSORTED_OUTPUT:3,UNSORTED_INPUT:3,UNSORTED_OUTPUT:3,SORTED_OUTPUT:12,SORTED_MERGED_INPUT:12,PROCESSOR:1,OTHER:4"
> This needs more investigation as it is hard to say if there is issue in hive/tez side for hashtable loading.  Creating this as a placeholder for further investigation.
> Scenario:
> =======
> 1. hive trunk (mar 13 build)
> 2. Tez 0.7 master
> 3. TPCDS Query 17 @ 10 TB scale



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)