You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@tez.apache.org by "Rajesh Balamohan (JIRA)" <ji...@apache.org> on 2015/04/28 11:47:06 UTC

[jira] [Updated] (TEZ-2377) RandomWriter ends up using TextOutputFormat instead of SequenceFileOutputFormat

     [ https://issues.apache.org/jira/browse/TEZ-2377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rajesh Balamohan updated TEZ-2377:
----------------------------------
    Attachment: TEZ-2377.1.patch

[~bikassaha], [~sseth], [~zjffdu] - Please review. Test is not included. Tez was using "jobConf.getUseNewMapper()" earlier, and hence was working in earlier releases. But this got changed to "jobConf.getUseNewReducer()" as a part of TEZ-2162.

> RandomWriter ends up using TextOutputFormat instead of SequenceFileOutputFormat
> -------------------------------------------------------------------------------
>
>                 Key: TEZ-2377
>                 URL: https://issues.apache.org/jira/browse/TEZ-2377
>             Project: Apache Tez
>          Issue Type: Bug
>            Reporter: Rajesh Balamohan
>            Assignee: Rajesh Balamohan
>         Attachments: TEZ-2377.1.patch
>
>
> {code}
> yarn jar ./dist/tez/tez-tests-0.7.0-SNAPSHOT.jar randomwriter "-Dmapreduce.randomwriter.totalbytes=10737418" /tmp/test1
> {code}
> This ends up generating TextOutputFormat.  
> {code}
> yarn jar ./dist/tez/tez-tests-0.7.0-SNAPSHOT.jar sort  
> "-Dmapreduce.framework.name=yarn-tez" -r 5 /tmp/test1 /tmp/test_sorted
> {code}
> This ends up throwing error 
> {noformat}
> Failure while running task:java.io.IOException: hdfs://tez-vm:56565/tmp/test1/part-00000 not a SequenceFile
>         at org.apache.hadoop.io.SequenceFile$Reader.init(SequenceFile.java:1851)
>         at org.apache.hadoop.io.SequenceFile$Reader.initialize(SequenceFile.java:1811)
>         at org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1760)
>         at org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1774)
>         at org.apache.hadoop.mapreduce.lib.input.SequenceFileRecordReader.initialize(SequenceFileRecordReader.java:54)
>         at org.apache.tez.mapreduce.lib.MRReaderMapReduce.setupNewRecordReader(MRReaderMapReduce.java:149)
>         at org.apache.tez.mapreduce.lib.MRReaderMapReduce.<init>(MRReaderMapReduce.java:78)
>         at org.apache.tez.mapreduce.input.MRInput.initializeInternal(MRInput.java:475)
>         at org.apache.tez.mapreduce.input.MRInputLegacy.init(MRInputLegacy.java:108)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)