You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Karam Singh (Created) (JIRA)" <ji...@apache.org> on 2012/04/03 17:08:23 UTC

[jira] [Created] (MAPREDUCE-4100) Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs

Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
----------------------------------------------------------------------------------------

                 Key: MAPREDUCE-4100
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: contrib/gridmix
    Affects Versions: 1.1.0
            Reporter: Karam Singh
            Priority: Minor


While running 1400+ jobs trace I encountered this issue.
For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
This can sometimes also cause jobs to fail intermittently.

Other GridMix version coming be Hadoop-1.1.X and above might also effected 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4100) Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs

Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13257495#comment-13257495 ] 

Hudson commented on MAPREDUCE-4100:
-----------------------------------

Integrated in Hadoop-Mapreduce-trunk #1054 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1054/])
    MAPREDUCE-4100. [Gridmix] Bug fixed in compression emulation feature for map only jobs. (amarrk) (Revision 1327816)

     Result = SUCCESS
amarrk : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1327816
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/CompressionEmulationUtil.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/LoadJob.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/test/org/apache/hadoop/mapred/gridmix/TestCompressionEmulationUtils.java

                
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-4100
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: contrib/gridmix
>    Affects Versions: 1.1.0
>            Reporter: Karam Singh
>            Assignee: Amar Kamat
>            Priority: Minor
>             Fix For: 0.24.0
>
>         Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (MAPREDUCE-4100) Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs

Posted by "Amar Kamat (Resolved) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Amar Kamat resolved MAPREDUCE-4100.
-----------------------------------

             Tags: gridmix compression-emulation map-only-jobs
       Resolution: Fixed
    Fix Version/s: 0.24.0
         Assignee: Amar Kamat
     Release Note: Bug fixed in compression emulation feature for map only jobs.
     Hadoop Flags: Reviewed

I just committed this to trunk. Thanks Ravi for the review.
                
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-4100
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: contrib/gridmix
>    Affects Versions: 1.1.0
>            Reporter: Karam Singh
>            Assignee: Amar Kamat
>            Priority: Minor
>             Fix For: 0.24.0
>
>         Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4100) Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs

Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13257243#comment-13257243 ] 

Hudson commented on MAPREDUCE-4100:
-----------------------------------

Integrated in Hadoop-Mapreduce-trunk-Commit #2121 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/2121/])
    MAPREDUCE-4100. [Gridmix] Bug fixed in compression emulation feature for map only jobs. (amarrk) (Revision 1327816)

     Result = ABORTED
amarrk : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1327816
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/CompressionEmulationUtil.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/LoadJob.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/test/org/apache/hadoop/mapred/gridmix/TestCompressionEmulationUtils.java

                
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-4100
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: contrib/gridmix
>    Affects Versions: 1.1.0
>            Reporter: Karam Singh
>            Assignee: Amar Kamat
>            Priority: Minor
>             Fix For: 0.24.0
>
>         Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4100) Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs

Posted by "Ravi Gummadi (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13257220#comment-13257220 ] 

Ravi Gummadi commented on MAPREDUCE-4100:
-----------------------------------------

Patch looks fine to me. +1
                
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-4100
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: contrib/gridmix
>    Affects Versions: 1.1.0
>            Reporter: Karam Singh
>            Priority: Minor
>         Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4100) Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs

Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13257465#comment-13257465 ] 

Hudson commented on MAPREDUCE-4100:
-----------------------------------

Integrated in Hadoop-Hdfs-trunk #1019 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk/1019/])
    MAPREDUCE-4100. [Gridmix] Bug fixed in compression emulation feature for map only jobs. (amarrk) (Revision 1327816)

     Result = FAILURE
amarrk : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1327816
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/CompressionEmulationUtil.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/LoadJob.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/test/org/apache/hadoop/mapred/gridmix/TestCompressionEmulationUtils.java

                
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-4100
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: contrib/gridmix
>    Affects Versions: 1.1.0
>            Reporter: Karam Singh
>            Assignee: Amar Kamat
>            Priority: Minor
>             Fix For: 0.24.0
>
>         Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4100) Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs

Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13257228#comment-13257228 ] 

Hudson commented on MAPREDUCE-4100:
-----------------------------------

Integrated in Hadoop-Hdfs-trunk-Commit #2178 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/2178/])
    MAPREDUCE-4100. [Gridmix] Bug fixed in compression emulation feature for map only jobs. (amarrk) (Revision 1327816)

     Result = SUCCESS
amarrk : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1327816
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/CompressionEmulationUtil.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/LoadJob.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/test/org/apache/hadoop/mapred/gridmix/TestCompressionEmulationUtils.java

                
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-4100
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: contrib/gridmix
>    Affects Versions: 1.1.0
>            Reporter: Karam Singh
>            Assignee: Amar Kamat
>            Priority: Minor
>             Fix For: 0.24.0
>
>         Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4100) Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs

Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13257229#comment-13257229 ] 

Hudson commented on MAPREDUCE-4100:
-----------------------------------

Integrated in Hadoop-Common-trunk-Commit #2104 (See [https://builds.apache.org/job/Hadoop-Common-trunk-Commit/2104/])
    MAPREDUCE-4100. [Gridmix] Bug fixed in compression emulation feature for map only jobs. (amarrk) (Revision 1327816)

     Result = SUCCESS
amarrk : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1327816
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/CompressionEmulationUtil.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/java/org/apache/hadoop/mapred/gridmix/LoadJob.java
* /hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/gridmix/src/test/org/apache/hadoop/mapred/gridmix/TestCompressionEmulationUtils.java

                
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-4100
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: contrib/gridmix
>    Affects Versions: 1.1.0
>            Reporter: Karam Singh
>            Assignee: Amar Kamat
>            Priority: Minor
>             Fix For: 0.24.0
>
>         Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAPREDUCE-4100) Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs

Posted by "Amar Kamat (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Amar Kamat updated MAPREDUCE-4100:
----------------------------------

    Attachment: mapreduce-4100-v1.2.patch

Attaching a patch that fixes the issue. test-patch and ant-test passed.
                
> Sometimes gridmix emulates data larger much larger then acutal counter for map only jobs
> ----------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-4100
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4100
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: contrib/gridmix
>    Affects Versions: 1.1.0
>            Reporter: Karam Singh
>            Priority: Minor
>         Attachments: mapreduce-4100-v1.2.patch
>
>
> While running 1400+ jobs trace I encountered this issue.
> For map-only jobs, observed that some Maps generating data of around 9 GB (From HDFS_BYTES_WRITTEN) whereas actual value is around 5GB in trace.
> This can sometimes also cause jobs to fail intermittently.
> Other GridMix version coming be Hadoop-1.1.X and above might also effected 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira