You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Bikas Saha (JIRA)" <ji...@apache.org> on 2013/12/20 13:18:12 UTC

[jira] [Resolved] (TEZ-682) TezGroupedSplits fails with empty (zero length) file

     [ https://issues.apache.org/jira/browse/TEZ-682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Bikas Saha resolved TEZ-682.
----------------------------

       Resolution: Fixed
    Fix Version/s: 0.3.0
         Assignee: Bikas Saha

commit 27fd81e4fe1575bff6c32ad148f5cb9a71d0df20
Author: Bikas Saha <bi...@apache.org>
Date:   Fri Dec 20 04:17:05 2013 -0800

    TEZ-682. TezGroupedSplits fails with empty (zero length) file (bikas)


> TezGroupedSplits fails with empty (zero length) file
> ----------------------------------------------------
>
>                 Key: TEZ-682
>                 URL: https://issues.apache.org/jira/browse/TEZ-682
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.2.0, 0.3.0
>            Reporter: Gunther Hagleitner
>            Assignee: Bikas Saha
>             Fix For: 0.3.0
>
>         Attachments: TEZ-682.1.patch
>
>
> Running hive on a directory with some 0 length files in it:
> 2013-12-17 23:01:35,868 ERROR [AsyncDispatcher event handler] org.apache.tez.dag.app.dag.impl.VertexImpl: Vertex Input: bucket1_1 initializer failed
> java.lang.NullPointerException
>   at org.apache.hadoop.io.Text.encode(Text.java:443)
>   at org.apache.hadoop.io.Text.encode(Text.java:424)
>   at org.apache.hadoop.io.Text.writeString(Text.java:476)
>   at org.apache.hadoop.mapred.split.TezGroupedSplit.write(TezGroupedSplit.java:87)
>   at org.apache.tez.mapreduce.hadoop.MRHelpers.createSplitProto(MRHelpers.java:446)
>   at org.apache.tez.mapreduce.common.MRInputAMSplitGenerator.initialize(MRInputAMSplitGenerator.java:129)
>   at org.apache.tez.dag.app.dag.RootInputInitializerRunner$InputInitializerCallable.call(RootInputInitializerRunner.java:121)
>   at org.apache.tez.dag.app.dag.RootInputInitializerRunner$InputInitializerCallable.call(RootInputInitializerRunner.java:97)
>   at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
>   at java.util.concurrent.FutureTask.run(FutureTask.java:138)
>   at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
>   at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
>   at java.lang.Thread.run(Thread.java:695)
> Deleting the empty files makes the error disappear.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)