You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Arun C Murthy (JIRA)" <ji...@apache.org> on 2007/05/02 23:42:15 UTC
[jira] Created: (HADOOP-1320) Rewrite 'random-writer' to use
'-reducer NONE'
Rewrite 'random-writer' to use '-reducer NONE'
----------------------------------------------
Key: HADOOP-1320
URL: https://issues.apache.org/jira/browse/HADOOP-1320
Project: Hadoop
Issue Type: Improvement
Components: examples
Affects Versions: 0.12.3
Reporter: Arun C Murthy
Assigned To: Arun C Murthy
Priority: Minor
Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (HADOOP-1320) Rewrite 'random-writer' to use
'-reducer NONE'
Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12505458 ]
Hadoop QA commented on HADOOP-1320:
-----------------------------------
Integrated in Hadoop-Nightly #123 (See [http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Nightly/123/])
> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
> Key: HADOOP-1320
> URL: https://issues.apache.org/jira/browse/HADOOP-1320
> Project: Hadoop
> Issue Type: Improvement
> Components: examples
> Affects Versions: 0.12.3
> Reporter: Arun C Murthy
> Assignee: Arun C Murthy
> Priority: Minor
> Fix For: 0.14.0
>
> Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (HADOOP-1320) Rewrite 'random-writer' to use
'-reducer NONE'
Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Arun C Murthy updated HADOOP-1320:
----------------------------------
Attachment: HADOOP-1320_1_20070611.patch
RandomWriter now uses 'reducer {{NONE}}', and also extends {{ToolBase}} and hence options such as {{test.randomwriter.bytes_per_map}} and {{map.output.compress}} can be set via command-line... (phew! relief! :) )
> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
> Key: HADOOP-1320
> URL: https://issues.apache.org/jira/browse/HADOOP-1320
> Project: Hadoop
> Issue Type: Improvement
> Components: examples
> Affects Versions: 0.12.3
> Reporter: Arun C Murthy
> Assignee: Arun C Murthy
> Priority: Minor
> Fix For: 0.14.0
>
> Attachments: HADOOP-1320_1_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (HADOOP-1320) Rewrite 'random-writer' to use
'-reducer NONE'
Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Arun C Murthy updated HADOOP-1320:
----------------------------------
Attachment: HADOOP-1320_2_20070611.patch
> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
> Key: HADOOP-1320
> URL: https://issues.apache.org/jira/browse/HADOOP-1320
> Project: Hadoop
> Issue Type: Improvement
> Components: examples
> Affects Versions: 0.12.3
> Reporter: Arun C Murthy
> Assignee: Arun C Murthy
> Priority: Minor
> Fix For: 0.14.0
>
> Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (HADOOP-1320) Rewrite 'random-writer' to use
'-reducer NONE'
Posted by "Doug Cutting (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Doug Cutting updated HADOOP-1320:
---------------------------------
Resolution: Fixed
Status: Resolved (was: Patch Available)
I just committed this. Thanks, Arun!
> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
> Key: HADOOP-1320
> URL: https://issues.apache.org/jira/browse/HADOOP-1320
> Project: Hadoop
> Issue Type: Improvement
> Components: examples
> Affects Versions: 0.12.3
> Reporter: Arun C Murthy
> Assignee: Arun C Murthy
> Priority: Minor
> Fix For: 0.14.0
>
> Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (HADOOP-1320) Rewrite 'random-writer' to use
'-reducer NONE'
Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12504050 ]
Hadoop QA commented on HADOOP-1320:
-----------------------------------
+1
http://issues.apache.org/jira/secure/attachment/12359427/HADOOP-1320_2_20070611.patch applied and successfully tested against trunk revision r546635.
Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/275/testReport/
Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/275/console
> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
> Key: HADOOP-1320
> URL: https://issues.apache.org/jira/browse/HADOOP-1320
> Project: Hadoop
> Issue Type: Improvement
> Components: examples
> Affects Versions: 0.12.3
> Reporter: Arun C Murthy
> Assignee: Arun C Murthy
> Priority: Minor
> Fix For: 0.14.0
>
> Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (HADOOP-1320) Rewrite 'random-writer' to use
'-reducer NONE'
Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Arun C Murthy updated HADOOP-1320:
----------------------------------
Attachment: (was: HADOOP-1320_2_20070611.patch)
> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
> Key: HADOOP-1320
> URL: https://issues.apache.org/jira/browse/HADOOP-1320
> Project: Hadoop
> Issue Type: Improvement
> Components: examples
> Affects Versions: 0.12.3
> Reporter: Arun C Murthy
> Assignee: Arun C Murthy
> Priority: Minor
> Fix For: 0.14.0
>
> Attachments: HADOOP-1320_1_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (HADOOP-1320) Rewrite 'random-writer' to use
'-reducer NONE'
Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12503427 ]
Arun C Murthy commented on HADOOP-1320:
---------------------------------------
I realise, rather belatedly, that changing the filename of the output of {{MapTask.DirectMapOutputCollector}} to make it compatible with that of {{ReduceTask}} results in changes to output of streaming's *-reducer NONE* too (e.g. I had to fix {{org.apache.hadoop.streaming.TestStreamReduceNone}}) - is this a big concern for streaming users? Runping? Arkday?
> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
> Key: HADOOP-1320
> URL: https://issues.apache.org/jira/browse/HADOOP-1320
> Project: Hadoop
> Issue Type: Improvement
> Components: examples
> Affects Versions: 0.12.3
> Reporter: Arun C Murthy
> Assignee: Arun C Murthy
> Priority: Minor
> Fix For: 0.14.0
>
> Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
Re: [jira] Issue Comment Edited: (HADOOP-1320) Rewrite 'random-writer' to use '-reducer NONE'
Posted by Arkady Borkovsky <ar...@yahoo-inc.com>.
I do not think that anyone relies on part names produced by reducer -
NONE.
On Jun 11, 2007, at 7:59 AM, Arun C Murthy (JIRA) wrote:
>
> [ https://issues.apache.org/jira/browse/HADOOP-1320?
> page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
> tabpanel#action_12503427 ]
>
> Arun C Murthy edited comment on HADOOP-1320 at 6/11/07 7:58 AM:
> ----------------------------------------------------------------
>
> I realise, rather belatedly, that changing the filename of the
> output of {{MapTask.DirectMapOutputCollector}} to make it
> compatible with that of {{ReduceTask}} results in changes to output
> of streaming's *-reducer NONE* too (e.g. I had to fix
> {{org.apache.hadoop.streaming.TestStreamReduceNone}}) - is this a
> big concern for streaming users? Runping/Arkady?
>
>
> was:
> I realise, rather belatedly, that changing the filename of the
> output of {{MapTask.DirectMapOutputCollector}} to make it
> compatible with that of {{ReduceTask}} results in changes to output
> of streaming's *-reducer NONE* too (e.g. I had to fix
> {{org.apache.hadoop.streaming.TestStreamReduceNone}}) - is this a
> big concern for streaming users? Runping? Arkday?
>
>> Rewrite 'random-writer' to use '-reducer NONE'
>> ----------------------------------------------
>>
>> Key: HADOOP-1320
>> URL: https://issues.apache.org/jira/browse/
>> HADOOP-1320
>> Project: Hadoop
>> Issue Type: Improvement
>> Components: examples
>> Affects Versions: 0.12.3
>> Reporter: Arun C Murthy
>> Assignee: Arun C Murthy
>> Priority: Minor
>> Fix For: 0.14.0
>>
>> Attachments: HADOOP-1320_1_20070611.patch,
>> HADOOP-1320_2_20070611.patch
>>
>>
>> Post HADOOP-1216 the 'random-writer' is archaic in the sense that
>> it uses custom code to write data to dfs, this is now supported by
>> the framework and doesn't make sense to reinvent the wheel here.
>
> --
> This message is automatically generated by JIRA.
> -
> You can reply to this email to add a comment to the issue online.
>
[jira] Issue Comment Edited: (HADOOP-1320) Rewrite 'random-writer'
to use '-reducer NONE'
Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12503427 ]
Arun C Murthy edited comment on HADOOP-1320 at 6/11/07 7:58 AM:
----------------------------------------------------------------
I realise, rather belatedly, that changing the filename of the output of {{MapTask.DirectMapOutputCollector}} to make it compatible with that of {{ReduceTask}} results in changes to output of streaming's *-reducer NONE* too (e.g. I had to fix {{org.apache.hadoop.streaming.TestStreamReduceNone}}) - is this a big concern for streaming users? Runping/Arkady?
was:
I realise, rather belatedly, that changing the filename of the output of {{MapTask.DirectMapOutputCollector}} to make it compatible with that of {{ReduceTask}} results in changes to output of streaming's *-reducer NONE* too (e.g. I had to fix {{org.apache.hadoop.streaming.TestStreamReduceNone}}) - is this a big concern for streaming users? Runping? Arkday?
> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
> Key: HADOOP-1320
> URL: https://issues.apache.org/jira/browse/HADOOP-1320
> Project: Hadoop
> Issue Type: Improvement
> Components: examples
> Affects Versions: 0.12.3
> Reporter: Arun C Murthy
> Assignee: Arun C Murthy
> Priority: Minor
> Fix For: 0.14.0
>
> Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (HADOOP-1320) Rewrite 'random-writer' to use
'-reducer NONE'
Posted by "Runping Qi (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12503472 ]
Runping Qi commented on HADOOP-1320:
------------------------------------
The changes are OK with streaming.
HADOOP-1440 may need to further change the way of constrcting the final output file name for reduce NONE case.
I'll mark 1440 related to this JIRA.
> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
> Key: HADOOP-1320
> URL: https://issues.apache.org/jira/browse/HADOOP-1320
> Project: Hadoop
> Issue Type: Improvement
> Components: examples
> Affects Versions: 0.12.3
> Reporter: Arun C Murthy
> Assignee: Arun C Murthy
> Priority: Minor
> Fix For: 0.14.0
>
> Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (HADOOP-1320) Rewrite 'random-writer' to use
'-reducer NONE'
Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Arun C Murthy updated HADOOP-1320:
----------------------------------
Attachment: HADOOP-1320_2_20070611.patch
Reworked patch to ensure output of {{MapTask.DirectMapOutputCollector}} is compatible with that of the {{ReduceTask}}... currently it just uses the {{taskId}} as the final path}.
> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
> Key: HADOOP-1320
> URL: https://issues.apache.org/jira/browse/HADOOP-1320
> Project: Hadoop
> Issue Type: Improvement
> Components: examples
> Affects Versions: 0.12.3
> Reporter: Arun C Murthy
> Assignee: Arun C Murthy
> Priority: Minor
> Fix For: 0.14.0
>
> Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (HADOOP-1320) Rewrite 'random-writer' to use
'-reducer NONE'
Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Arun C Murthy updated HADOOP-1320:
----------------------------------
Fix Version/s: 0.14.0
Status: Patch Available (was: Open)
> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
> Key: HADOOP-1320
> URL: https://issues.apache.org/jira/browse/HADOOP-1320
> Project: Hadoop
> Issue Type: Improvement
> Components: examples
> Affects Versions: 0.12.3
> Reporter: Arun C Murthy
> Assignee: Arun C Murthy
> Priority: Minor
> Fix For: 0.14.0
>
> Attachments: HADOOP-1320_1_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.