You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Hitesh Shah (JIRA)" <ji...@apache.org> on 2015/05/14 22:39:01 UTC

[jira] [Commented] (TEZ-2339) randomwriter generates sequence file which does not have "SEQ6" in the beginning (causes exceptions while reading)

    [ https://issues.apache.org/jira/browse/TEZ-2339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14544327#comment-14544327 ] 

Hitesh Shah commented on TEZ-2339:
----------------------------------

Is this still an issue? 

> randomwriter generates sequence file which does not have "SEQ6" in the beginning (causes exceptions while reading)
> ------------------------------------------------------------------------------------------------------------------
>
>                 Key: TEZ-2339
>                 URL: https://issues.apache.org/jira/browse/TEZ-2339
>             Project: Apache Tez
>          Issue Type: Bug
>            Reporter: Rajesh Balamohan
>
> Scenario:
> =======
> yarn jar ./dist/tez/tez-tests-0.7.0-SNAPSHOT.jar randomwriter "-Dmapreduce.randomwriter.totalbytes=10000000" "-Dmapreduce.framework.name=yarn-tez"  /user/rajesh/tez-test-rw/
> Job finishes, however the data generated by the above command does not have "SEQ6" which is needed for reading back the data.  It contains some garbage values like "99,51,32,102" instead of "83,69,81,6" (SEQ6) .  This causes exceptions while reading back the data.
> For e.g 
> yarn jar ./dist/tez/tez-tests-0.7.0-SNAPSHOT.jar sort "-Dmapreduce.framework.name=yarn-tez"  -r 10 /user/rajesh/tez-test-rw/ /user/rajesh/tez-test-rw-sort/
> would fail (exception at org.apache.hadoop.io.SequenceFile$Reader.init) reading the data from /user/rajesh/tez-test-rw/ 
> This is observed even in 0.5.4 branch. Haven't checked in branch 0.4



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)