You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Devaraj Das (JIRA)" <ji...@apache.org> on 2007/05/16 11:25:16 UTC

[jira] Created: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

RandomWriter should be tweaked to generate input data for terasort
------------------------------------------------------------------

                 Key: HADOOP-1376
                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
             Project: Hadoop
          Issue Type: Improvement
          Components: mapred
            Reporter: Devaraj Das
         Assigned To: Devaraj Das
             Fix For: 0.14.0


We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Devaraj Das (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Devaraj Das updated HADOOP-1376:
--------------------------------

    Attachment: randomwriter.patch

Sorry Doug for this javadoc goof-up yet again. I have now moved the javadoc comments to what it was in the earlier patch but this one has the <xmp> tags and the author tag has been removed.

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch, randomwriter.patch, randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12498324 ] 

Hadoop QA commented on HADOOP-1376:
-----------------------------------

+1

http://issues.apache.org/jira/secure/attachment/12357950/randomwriter.patch applied and successfully tested against trunk revision r540973.

Test results:   http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/190/testReport/
Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/190/console

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch, randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Devaraj Das (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12498131 ] 

Devaraj Das commented on HADOOP-1376:
-------------------------------------

The hudson build failure with the latest patch is due to HADOOP-1423.

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch, randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Devaraj Das (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Devaraj Das updated HADOOP-1376:
--------------------------------

    Status: Patch Available  (was: Open)

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch, randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Devaraj Das (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Devaraj Das updated HADOOP-1376:
--------------------------------

    Attachment:     (was: randomwriter.patch)

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Devaraj Das (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Devaraj Das updated HADOOP-1376:
--------------------------------

    Attachment: randomwriter.patch

Attached is a patch that adds a config item called test.randomwrite.total_bytes. This can be set by the user to a number like '1099511627776' (1 TB). 

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12497764 ] 

Hadoop QA commented on HADOOP-1376:
-----------------------------------

+1

http://issues.apache.org/jira/secure/attachment/12357847/randomwriter.patch applied and successfully tested against trunk revision r540469.

Test results:   http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/179/testReport/
Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/179/console

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Doug Cutting (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Doug Cutting updated HADOOP-1376:
---------------------------------

    Resolution: Fixed
        Status: Resolved  (was: Patch Available)

I just committed this.  Thanks, Devaraj!

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch, randomwriter.patch, randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Doug Cutting (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Doug Cutting updated HADOOP-1376:
---------------------------------

    Status: Open  (was: Patch Available)

Embedding XML in a javadoc comment doesn't work well.  The javadoc output does not look like what you'd hope it might.

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12499306 ] 

Hadoop QA commented on HADOOP-1376:
-----------------------------------

Integrated in Hadoop-Nightly #101 (See http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Nightly/101/)

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch, randomwriter.patch, randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Owen O'Malley (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Owen O'Malley updated HADOOP-1376:
----------------------------------

    Status: Open  (was: Patch Available)

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch, randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Doug Cutting (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Doug Cutting updated HADOOP-1376:
---------------------------------

    Status: Open  (was: Patch Available)

The javadoc still looks terrible.  Much the same, in fact.  This config stuff should either be rendered as a table or as verbatim text.  The top-level overview.html uses <xmp></xmp> to provide configuration examples as verbatim text.

It should also be before the @author tag, as currently it's included in the author.  (Didn't we agree to get rid of author tags?  Ah, HADOOP-1147.)

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch, randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Devaraj Das (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Devaraj Das updated HADOOP-1376:
--------------------------------

    Status: Patch Available  (was: Open)

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Devaraj Das (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Devaraj Das updated HADOOP-1376:
--------------------------------

    Attachment: randomwriter.patch

Had uploaded the wrong patch earlier. Here is the right one.

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Owen O'Malley (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Owen O'Malley updated HADOOP-1376:
----------------------------------

    Status: Patch Available  (was: Open)

to get the qa auto-blessing

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch, randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12498517 ] 

Hadoop QA commented on HADOOP-1376:
-----------------------------------

+1

http://issues.apache.org/jira/secure/attachment/12358060/randomwriter.patch applied and successfully tested against trunk revision r541096.

Test results:   http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/195/testReport/
Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/195/console

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch, randomwriter.patch, randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Devaraj Das (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Devaraj Das updated HADOOP-1376:
--------------------------------

    Status: Patch Available  (was: Open)

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch, randomwriter.patch, randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Owen O'Malley (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12498298 ] 

Owen O'Malley commented on HADOOP-1376:
---------------------------------------

+1

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch, randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12498129 ] 

Hadoop QA commented on HADOOP-1376:
-----------------------------------

-1, build or testing failed

2 attempts failed to build and test the latest attachment http://issues.apache.org/jira/secure/attachment/12357950/randomwriter.patch against trunk revision r540737.

Test results:   http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/186/testReport/
Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/186/console

Please note that this message is automatically generated and may represent a problem with the automation system and not the patch.

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch, randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1376) RandomWriter should be tweaked to generate input data for terasort

Posted by "Devaraj Das (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Devaraj Das updated HADOOP-1376:
--------------------------------

    Attachment: randomwriter.patch

Thanks Doug for pointing out the xml-javadoc problem. Attached is the updated patch.

> RandomWriter should be tweaked to generate input data for terasort
> ------------------------------------------------------------------
>
>                 Key: HADOOP-1376
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1376
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>             Fix For: 0.14.0
>
>         Attachments: randomwriter.patch, randomwriter.patch
>
>
> We need input data to run the terasort benchmark. The RandomWriter can do that - it needs to generate - 10(power)10 number of 100 byte records (10-byte keys and 90-byte values).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.