You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Karam Singh (Created) (JIRA)" <ji...@apache.org> on 2012/04/03 17:08:23 UTC
[jira] [Created] (MAPREDUCE-4100) Sometimes gridmix emulates data
larger much larger then acutal counter for map only jobs
Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
----------------------------------------------------------------------------------------
Key: MAPREDUCE-4100
URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
Project: Hadoop Map/Reduce
Issue Type: Bug
Components: contrib/gridmix
Affects Versions: 1.1.0
Reporter: Karam Singh
Priority: Minor
While running 1400+ jobs trace I encountered this issue.
For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
This can sometimes also cause jobs to fail intermittently.
Other GridMix version coming be Hadoop-1.1.X and above might also effected
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4100) Sometimes gridmix emulates data
larger much larger then acutal counter for map only jobs
Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13257495#comment-13257495 ]
Hudson commented on MAPREDUCE-4100:
-----------------------------------
Integrated in Hadoop-Mapreduce-trunk #1054 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1054/])
MAPREDUCE-4100. [Gridmix] Bug fixed in compression emulation feature for map only jobs. (amarrk) (Revision 1327816)
Result = SUCCESS
amarrk : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1327816
Files :
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/CompressionEmulationUtil.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/LoadJob.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/test/org/apache/hadoop/mapred/gridmix/TestCompressionEmulationUtils.java
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-4100
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: contrib/gridmix
> Affects Versions: 1.1.0
> Reporter: Karam Singh
> Assignee: Amar Kamat
> Priority: Minor
> Fix For: 0.24.0
>
> Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (MAPREDUCE-4100) Sometimes gridmix emulates data
larger much larger then acutal counter for map only jobs
Posted by "Amar Kamat (Resolved) (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Amar Kamat resolved MAPREDUCE-4100.
-----------------------------------
Tags: gridmix compression-emulation map-only-jobs
Resolution: Fixed
Fix Version/s: 0.24.0
Assignee: Amar Kamat
Release Note: Bug fixed in compression emulation feature for map only jobs.
Hadoop Flags: Reviewed
I just committed this to trunk. Thanks Ravi for the review.
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-4100
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: contrib/gridmix
> Affects Versions: 1.1.0
> Reporter: Karam Singh
> Assignee: Amar Kamat
> Priority: Minor
> Fix For: 0.24.0
>
> Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4100) Sometimes gridmix emulates data
larger much larger then acutal counter for map only jobs
Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13257243#comment-13257243 ]
Hudson commented on MAPREDUCE-4100:
-----------------------------------
Integrated in Hadoop-Mapreduce-trunk-Commit #2121 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/2121/])
MAPREDUCE-4100. [Gridmix] Bug fixed in compression emulation feature for map only jobs. (amarrk) (Revision 1327816)
Result = ABORTED
amarrk : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1327816
Files :
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/CompressionEmulationUtil.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/LoadJob.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/test/org/apache/hadoop/mapred/gridmix/TestCompressionEmulationUtils.java
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-4100
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: contrib/gridmix
> Affects Versions: 1.1.0
> Reporter: Karam Singh
> Assignee: Amar Kamat
> Priority: Minor
> Fix For: 0.24.0
>
> Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4100) Sometimes gridmix emulates data
larger much larger then acutal counter for map only jobs
Posted by "Ravi Gummadi (Commented) (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13257220#comment-13257220 ]
Ravi Gummadi commented on MAPREDUCE-4100:
-----------------------------------------
Patch looks fine to me. +1
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-4100
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: contrib/gridmix
> Affects Versions: 1.1.0
> Reporter: Karam Singh
> Priority: Minor
> Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4100) Sometimes gridmix emulates data
larger much larger then acutal counter for map only jobs
Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13257465#comment-13257465 ]
Hudson commented on MAPREDUCE-4100:
-----------------------------------
Integrated in Hadoop-Hdfs-trunk #1019 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk/1019/])
MAPREDUCE-4100. [Gridmix] Bug fixed in compression emulation feature for map only jobs. (amarrk) (Revision 1327816)
Result = FAILURE
amarrk : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1327816
Files :
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/CompressionEmulationUtil.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/LoadJob.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/test/org/apache/hadoop/mapred/gridmix/TestCompressionEmulationUtils.java
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-4100
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: contrib/gridmix
> Affects Versions: 1.1.0
> Reporter: Karam Singh
> Assignee: Amar Kamat
> Priority: Minor
> Fix For: 0.24.0
>
> Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4100) Sometimes gridmix emulates data
larger much larger then acutal counter for map only jobs
Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13257228#comment-13257228 ]
Hudson commented on MAPREDUCE-4100:
-----------------------------------
Integrated in Hadoop-Hdfs-trunk-Commit #2178 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/2178/])
MAPREDUCE-4100. [Gridmix] Bug fixed in compression emulation feature for map only jobs. (amarrk) (Revision 1327816)
Result = SUCCESS
amarrk : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1327816
Files :
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/CompressionEmulationUtil.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/LoadJob.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/test/org/apache/hadoop/mapred/gridmix/TestCompressionEmulationUtils.java
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-4100
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: contrib/gridmix
> Affects Versions: 1.1.0
> Reporter: Karam Singh
> Assignee: Amar Kamat
> Priority: Minor
> Fix For: 0.24.0
>
> Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4100) Sometimes gridmix emulates data
larger much larger then acutal counter for map only jobs
Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13257229#comment-13257229 ]
Hudson commented on MAPREDUCE-4100:
-----------------------------------
Integrated in Hadoop-Common-trunk-Commit #2104 (See [https://builds.apache.org/job/Hadoop-Common-trunk-Commit/2104/])
MAPREDUCE-4100. [Gridmix] Bug fixed in compression emulation feature for map only jobs. (amarrk) (Revision 1327816)
Result = SUCCESS
amarrk : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1327816
Files :
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/CompressionEmulationUtil.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/LoadJob.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/test/org/apache/hadoop/mapred/gridmix/TestCompressionEmulationUtils.java
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-4100
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: contrib/gridmix
> Affects Versions: 1.1.0
> Reporter: Karam Singh
> Assignee: Amar Kamat
> Priority: Minor
> Fix For: 0.24.0
>
> Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4100) Sometimes gridmix emulates data
larger much larger then acutal counter for map only jobs
Posted by "Amar Kamat (Updated) (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Amar Kamat updated MAPREDUCE-4100:
----------------------------------
Attachment: mapreduce-4100-v1.2.patch
Attaching a patch that fixes the issue. test-patch and ant-test passed.
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-4100
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: contrib/gridmix
> Affects Versions: 1.1.0
> Reporter: Karam Singh
> Priority: Minor
> Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira