You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Arun C Murthy (JIRA)" <ji...@apache.org> on 2007/05/02 23:42:15 UTC

[jira] Created: (HADOOP-1320) Rewrite 'random-writer' to use '-reducer NONE'

Rewrite 'random-writer' to use '-reducer NONE'
----------------------------------------------

                 Key: HADOOP-1320
                 URL: https://issues.apache.org/jira/browse/HADOOP-1320
             Project: Hadoop
          Issue Type: Improvement
          Components: examples
    Affects Versions: 0.12.3
            Reporter: Arun C Murthy
         Assigned To: Arun C Murthy
            Priority: Minor


Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-1320) Rewrite 'random-writer' to use '-reducer NONE'

Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12505458 ] 

Hadoop QA commented on HADOOP-1320:
-----------------------------------

Integrated in Hadoop-Nightly #123 (See [http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Nightly/123/])

> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
>                 Key: HADOOP-1320
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1320
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: examples
>    Affects Versions: 0.12.3
>            Reporter: Arun C Murthy
>            Assignee: Arun C Murthy
>            Priority: Minor
>             Fix For: 0.14.0
>
>         Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1320) Rewrite 'random-writer' to use '-reducer NONE'

Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Arun C Murthy updated HADOOP-1320:
----------------------------------

    Attachment: HADOOP-1320_1_20070611.patch

RandomWriter now uses 'reducer {{NONE}}', and also extends {{ToolBase}} and hence options such as {{test.randomwriter.bytes_per_map}} and {{map.output.compress}} can be set via command-line... (phew! relief! :) )

> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
>                 Key: HADOOP-1320
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1320
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: examples
>    Affects Versions: 0.12.3
>            Reporter: Arun C Murthy
>            Assignee: Arun C Murthy
>            Priority: Minor
>             Fix For: 0.14.0
>
>         Attachments: HADOOP-1320_1_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1320) Rewrite 'random-writer' to use '-reducer NONE'

Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Arun C Murthy updated HADOOP-1320:
----------------------------------

    Attachment: HADOOP-1320_2_20070611.patch

> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
>                 Key: HADOOP-1320
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1320
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: examples
>    Affects Versions: 0.12.3
>            Reporter: Arun C Murthy
>            Assignee: Arun C Murthy
>            Priority: Minor
>             Fix For: 0.14.0
>
>         Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1320) Rewrite 'random-writer' to use '-reducer NONE'

Posted by "Doug Cutting (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Doug Cutting updated HADOOP-1320:
---------------------------------

    Resolution: Fixed
        Status: Resolved  (was: Patch Available)

I just committed this.  Thanks, Arun!

> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
>                 Key: HADOOP-1320
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1320
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: examples
>    Affects Versions: 0.12.3
>            Reporter: Arun C Murthy
>            Assignee: Arun C Murthy
>            Priority: Minor
>             Fix For: 0.14.0
>
>         Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-1320) Rewrite 'random-writer' to use '-reducer NONE'

Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12504050 ] 

Hadoop QA commented on HADOOP-1320:
-----------------------------------

+1

http://issues.apache.org/jira/secure/attachment/12359427/HADOOP-1320_2_20070611.patch applied and successfully tested against trunk revision r546635.

Test results:   http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/275/testReport/
Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/275/console

> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
>                 Key: HADOOP-1320
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1320
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: examples
>    Affects Versions: 0.12.3
>            Reporter: Arun C Murthy
>            Assignee: Arun C Murthy
>            Priority: Minor
>             Fix For: 0.14.0
>
>         Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1320) Rewrite 'random-writer' to use '-reducer NONE'

Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Arun C Murthy updated HADOOP-1320:
----------------------------------

    Attachment:     (was: HADOOP-1320_2_20070611.patch)

> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
>                 Key: HADOOP-1320
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1320
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: examples
>    Affects Versions: 0.12.3
>            Reporter: Arun C Murthy
>            Assignee: Arun C Murthy
>            Priority: Minor
>             Fix For: 0.14.0
>
>         Attachments: HADOOP-1320_1_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-1320) Rewrite 'random-writer' to use '-reducer NONE'

Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12503427 ] 

Arun C Murthy commented on HADOOP-1320:
---------------------------------------

I realise, rather belatedly, that changing the filename of the output of {{MapTask.DirectMapOutputCollector}} to make it compatible with that of {{ReduceTask}} results in changes to output of streaming's *-reducer NONE* too (e.g. I had to fix {{org.apache.hadoop.streaming.TestStreamReduceNone}}) - is this a big concern for streaming users? Runping? Arkday?

> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
>                 Key: HADOOP-1320
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1320
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: examples
>    Affects Versions: 0.12.3
>            Reporter: Arun C Murthy
>            Assignee: Arun C Murthy
>            Priority: Minor
>             Fix For: 0.14.0
>
>         Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Re: [jira] Issue Comment Edited: (HADOOP-1320) Rewrite 'random-writer' to use '-reducer NONE'

Posted by Arkady Borkovsky <ar...@yahoo-inc.com>.
I do not think that anyone relies on part names produced by reducer - 
NONE.

On Jun 11, 2007, at 7:59 AM, Arun C Murthy (JIRA) wrote:

>
>     [ https://issues.apache.org/jira/browse/HADOOP-1320? 
> page=com.atlassian.jira.plugin.system.issuetabpanels:comment- 
> tabpanel#action_12503427 ]
>
> Arun C Murthy edited comment on HADOOP-1320 at 6/11/07 7:58 AM:
> ----------------------------------------------------------------
>
> I realise, rather belatedly, that changing the filename of the  
> output of {{MapTask.DirectMapOutputCollector}} to make it  
> compatible with that of {{ReduceTask}} results in changes to output  
> of streaming's *-reducer NONE* too (e.g. I had to fix  
> {{org.apache.hadoop.streaming.TestStreamReduceNone}}) - is this a  
> big concern for streaming users? Runping/Arkady?
>
>
>  was:
> I realise, rather belatedly, that changing the filename of the  
> output of {{MapTask.DirectMapOutputCollector}} to make it  
> compatible with that of {{ReduceTask}} results in changes to output  
> of streaming's *-reducer NONE* too (e.g. I had to fix  
> {{org.apache.hadoop.streaming.TestStreamReduceNone}}) - is this a  
> big concern for streaming users? Runping? Arkday?
>
>> Rewrite 'random-writer' to use '-reducer NONE'
>> ----------------------------------------------
>>
>>                 Key: HADOOP-1320
>>                 URL: https://issues.apache.org/jira/browse/ 
>> HADOOP-1320
>>             Project: Hadoop
>>          Issue Type: Improvement
>>          Components: examples
>>    Affects Versions: 0.12.3
>>            Reporter: Arun C Murthy
>>            Assignee: Arun C Murthy
>>            Priority: Minor
>>             Fix For: 0.14.0
>>
>>         Attachments: HADOOP-1320_1_20070611.patch,  
>> HADOOP-1320_2_20070611.patch
>>
>>
>> Post HADOOP-1216 the 'random-writer' is archaic in the sense that  
>> it uses custom code to write data to dfs, this is now supported by  
>> the framework and doesn't make sense to reinvent the wheel here.
>
> -- 
> This message is automatically generated by JIRA.
> -
> You can reply to this email to add a comment to the issue online.
>


[jira] Issue Comment Edited: (HADOOP-1320) Rewrite 'random-writer' to use '-reducer NONE'

Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12503427 ] 

Arun C Murthy edited comment on HADOOP-1320 at 6/11/07 7:58 AM:
----------------------------------------------------------------

I realise, rather belatedly, that changing the filename of the output of {{MapTask.DirectMapOutputCollector}} to make it compatible with that of {{ReduceTask}} results in changes to output of streaming's *-reducer NONE* too (e.g. I had to fix {{org.apache.hadoop.streaming.TestStreamReduceNone}}) - is this a big concern for streaming users? Runping/Arkady?


 was:
I realise, rather belatedly, that changing the filename of the output of {{MapTask.DirectMapOutputCollector}} to make it compatible with that of {{ReduceTask}} results in changes to output of streaming's *-reducer NONE* too (e.g. I had to fix {{org.apache.hadoop.streaming.TestStreamReduceNone}}) - is this a big concern for streaming users? Runping? Arkday?

> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
>                 Key: HADOOP-1320
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1320
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: examples
>    Affects Versions: 0.12.3
>            Reporter: Arun C Murthy
>            Assignee: Arun C Murthy
>            Priority: Minor
>             Fix For: 0.14.0
>
>         Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-1320) Rewrite 'random-writer' to use '-reducer NONE'

Posted by "Runping Qi (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12503472 ] 

Runping Qi commented on HADOOP-1320:
------------------------------------


The changes are OK with streaming.

HADOOP-1440 may need to further change the way of constrcting  the final output file name for reduce NONE case.
I'll mark 1440 related to this JIRA.


> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
>                 Key: HADOOP-1320
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1320
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: examples
>    Affects Versions: 0.12.3
>            Reporter: Arun C Murthy
>            Assignee: Arun C Murthy
>            Priority: Minor
>             Fix For: 0.14.0
>
>         Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1320) Rewrite 'random-writer' to use '-reducer NONE'

Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Arun C Murthy updated HADOOP-1320:
----------------------------------

    Attachment: HADOOP-1320_2_20070611.patch

Reworked patch to ensure output of {{MapTask.DirectMapOutputCollector}} is compatible with that of the {{ReduceTask}}... currently it just uses the {{taskId}} as the final path}.

> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
>                 Key: HADOOP-1320
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1320
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: examples
>    Affects Versions: 0.12.3
>            Reporter: Arun C Murthy
>            Assignee: Arun C Murthy
>            Priority: Minor
>             Fix For: 0.14.0
>
>         Attachments: HADOOP-1320_1_20070611.patch, HADOOP-1320_2_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1320) Rewrite 'random-writer' to use '-reducer NONE'

Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Arun C Murthy updated HADOOP-1320:
----------------------------------

    Fix Version/s: 0.14.0
           Status: Patch Available  (was: Open)

> Rewrite 'random-writer' to use '-reducer NONE'
> ----------------------------------------------
>
>                 Key: HADOOP-1320
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1320
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: examples
>    Affects Versions: 0.12.3
>            Reporter: Arun C Murthy
>            Assignee: Arun C Murthy
>            Priority: Minor
>             Fix For: 0.14.0
>
>         Attachments: HADOOP-1320_1_20070611.patch
>
>
> Post HADOOP-1216 the 'random-writer' is archaic in the sense that it uses custom code to write data to dfs, this is now supported by the framework and doesn't make sense to reinvent the wheel here.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.