You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Robert Joseph Evans (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/08 23:15:40 UTC

[jira] [Resolved] (MAPREDUCE-3388) Streaming task with special char gets wrong output

     [ https://issues.apache.org/jira/browse/MAPREDUCE-3388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Joseph Evans resolved MAPREDUCE-3388.
--------------------------------------------

    Resolution: Invalid

As it turns out this was an issue with our tests which was masked by a small bug in streaming.  I have filed MAPREDUCE-3521 to address this issue in streaming.
                
> Streaming task with special char gets wrong output 
> ---------------------------------------------------
>
>                 Key: MAPREDUCE-3388
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3388
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: contrib/streaming, mrv2
>    Affects Versions: 0.23.0, 0.24.0
>            Reporter: Robert Joseph Evans
>            Assignee: Robert Joseph Evans
>            Priority: Critical
>
> In 0.20.204:
> {noformat}
> hadoop jar $HADOOP_HOME/hadoop-streaming.jar '-Dmapreduce.job.acl-view-job=*' -files 'file:///tmp/InputDir#testlink!@$&*()-_+=' -input input.txt -mapper 'xargs cat' -reducer cat -output output -jobconf mapred.job.name=streamingTest-1000 -jobconf 'mapreduce.job.acl-view-job=*'
> {noformat}
> Output:
> {noformat}
> hadoop fs -cat output/*
> and-so-is-the-c#
> c++-also-supports-object-oriented-concepts
> hadoop-apache-org-core  
> hadoop-streaming
> hod-is-the-part-of-hadoop
> it-is-residing-on-apache-under-repos-asf
> java-an object-oriented-language
> smalltalk-is-also-object-oriented-language
> streaming-is-also-part-of-it
> {noformat}
> For 0.23:
> {noformat}
> hadoop jar $HADOOP_MAPRED_HOME/hadoop-streaming.jar -Dmapreduce.job.acl-view-job=* -files 'file:///tmp/InputDir#testlink!@$&*()-_+=' -input input.txt  -mapper 'xargs cat' -reducer cat -output output -jobconf mapred.job.name=streamingTest-1000 -jobconf 'mapreduce.job.acl-view-job=*'
> {noformat}
> Output:
> {noformat}
> testlink!@$&*()-_+=/input1.txt testlink!@$&*()-_+=/input2.txt
> {noformat}
> The contents of input.txt are as follows:
> {noformat}
> hadoop fs -cat Streaming/streaming-1000/input.txt
> testlink!@$&*()-_+=/input1.txt
> testlink!@$&*()-_+=/input2.txt
> {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira