You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-dev@hadoop.apache.org by "Khaled Elmeleegy (JIRA)" <ji...@apache.org> on 2009/07/06 22:39:14 UTC
[jira] Created: (MAPREDUCE-712) TextWritter example is CPU bound!!
TextWritter example is CPU bound!!
----------------------------------
Key: MAPREDUCE-712
URL: https://issues.apache.org/jira/browse/MAPREDUCE-712
Project: Hadoop Map/Reduce
Issue Type: Improvement
Components: task
Affects Versions: 0.20.1, 0.21.0
Environment: ~200 nodes cluster
Each node has the following configuration:
Processors: 2 x Xeon L5420 2.50GHz (8 cores) - Harpertown C0, 64-bit, quad-core (8 CPUs)
4 Disks
16 GB RAM
Linux 2.6
Hadoop version: trunk
Reporter: Khaled Elmeleegy
Running the RandomTextWritter example job ( from the examples jar) pegs the machiens' CPUs.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Resolved: (MAPREDUCE-712) TextWritter example is CPU bound!!
Posted by "Owen O'Malley (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Owen O'Malley resolved MAPREDUCE-712.
-------------------------------------
Resolution: Invalid
16 maps on 8 cpus running gzip is expected to completely saturate cpu. This is not a bug!!!
Also check to see if you were using the native codec. If you are using the Java codec, it will be very slow and cpu bound.
> TextWritter example is CPU bound!!
> ----------------------------------
>
> Key: MAPREDUCE-712
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-712
> Project: Hadoop Map/Reduce
> Issue Type: Improvement
> Components: task
> Affects Versions: 0.20.1, 0.21.0
> Environment: ~200 nodes cluster
> Each node has the following configuration:
> Processors: 2 x Xeon L5420 2.50GHz (8 cores) - Harpertown C0, 64-bit, quad-core (8 CPUs)
> 4 Disks
> 16 GB RAM
> Linux 2.6
> Hadoop version: trunk
> Reporter: Khaled Elmeleegy
>
> Running the RandomTextWritter example job ( from the examples jar) pegs the machiens' CPUs.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Reopened: (MAPREDUCE-712) TextWritter example is CPU bound!!
Posted by "Owen O'Malley (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Owen O'Malley reopened MAPREDUCE-712:
-------------------------------------
I notice now that you didn't have compression. I wonder how much time you were spending in gc with such small heaps. That might explain the cpu load.
> TextWritter example is CPU bound!!
> ----------------------------------
>
> Key: MAPREDUCE-712
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-712
> Project: Hadoop Map/Reduce
> Issue Type: Improvement
> Components: task
> Affects Versions: 0.20.1, 0.21.0
> Environment: ~200 nodes cluster
> Each node has the following configuration:
> Processors: 2 x Xeon L5420 2.50GHz (8 cores) - Harpertown C0, 64-bit, quad-core (8 CPUs)
> 4 Disks
> 16 GB RAM
> Linux 2.6
> Hadoop version: trunk
> Reporter: Khaled Elmeleegy
>
> Running the RandomTextWritter example job ( from the examples jar) pegs the machiens' CPUs.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.