You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "Jimmy Xiang (JIRA)" <ji...@apache.org> on 2012/11/21 18:23:58 UTC

[jira] [Created] (HADOOP-9079) LocalDirAllocator throws ArithmeticException

Jimmy Xiang created HADOOP-9079:
-----------------------------------

             Summary: LocalDirAllocator throws ArithmeticException
                 Key: HADOOP-9079
                 URL: https://issues.apache.org/jira/browse/HADOOP-9079
             Project: Hadoop Common
          Issue Type: Bug
            Reporter: Jimmy Xiang
            Priority: Minor


2012-11-19 22:07:41,709 WARN  [IPC Server handler 0 on 38671] nodemanager.NMAuditLogger(150): USER=UnknownUser	IP=****	OPERATION=Stop Container Request	TARGET=ContainerManagerImpl	RESULT=FAILURE	DESCRIPTION=Trying to stop unknown container!	APPID=application_1353391620476_0001	CONTAINERID=container_1353391620476_0001_01_000010
java.lang.ArithmeticException: / by zero
	at org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:368)
	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:150)
	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:131)
	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:115)
	at org.apache.hadoop.yarn.server.nodemanager.LocalDirsHandlerService.getLocalPathForWrite(LocalDirsHandlerService.java:263)
	at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:849)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HADOOP-9079) LocalDirAllocator throws ArithmeticException

Posted by "Jimmy Xiang (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-9079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jimmy Xiang updated HADOOP-9079:
--------------------------------

    Attachment: trunk-9079.patch
    
> LocalDirAllocator throws ArithmeticException
> --------------------------------------------
>
>                 Key: HADOOP-9079
>                 URL: https://issues.apache.org/jira/browse/HADOOP-9079
>             Project: Hadoop Common
>          Issue Type: Bug
>            Reporter: Jimmy Xiang
>            Priority: Minor
>         Attachments: trunk-9079.patch
>
>
> 2012-11-19 22:07:41,709 WARN  [IPC Server handler 0 on 38671] nodemanager.NMAuditLogger(150): USER=UnknownUser	IP=****	OPERATION=Stop Container Request	TARGET=ContainerManagerImpl	RESULT=FAILURE	DESCRIPTION=Trying to stop unknown container!	APPID=application_1353391620476_0001	CONTAINERID=container_1353391620476_0001_01_000010
> java.lang.ArithmeticException: / by zero
> 	at org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:368)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:150)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:131)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:115)
> 	at org.apache.hadoop.yarn.server.nodemanager.LocalDirsHandlerService.getLocalPathForWrite(LocalDirsHandlerService.java:263)
> 	at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:849)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HADOOP-9079) LocalDirAllocator throws ArithmeticException

Posted by "Eli Collins (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-9079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13502150#comment-13502150 ] 

Eli Collins commented on HADOOP-9079:
-------------------------------------

I assume this happens when the local dirs are out of space?  Don't think just checking totalAvailable > 0 fixes this, the following loop doesn't bounds check dir, needs to be re-written with a test.

{code}
          long randomPosition = Math.abs(r.nextLong()) % totalAvailable;
          int dir = 0;
          while (randomPosition > availableOnDisk[dir]) {
            randomPosition -= availableOnDisk[dir];
            dir++;
          }
{code}
                
> LocalDirAllocator throws ArithmeticException
> --------------------------------------------
>
>                 Key: HADOOP-9079
>                 URL: https://issues.apache.org/jira/browse/HADOOP-9079
>             Project: Hadoop Common
>          Issue Type: Bug
>            Reporter: Jimmy Xiang
>            Priority: Minor
>         Attachments: trunk-9079.patch
>
>
> 2012-11-19 22:07:41,709 WARN  [IPC Server handler 0 on 38671] nodemanager.NMAuditLogger(150): USER=UnknownUser	IP=****	OPERATION=Stop Container Request	TARGET=ContainerManagerImpl	RESULT=FAILURE	DESCRIPTION=Trying to stop unknown container!	APPID=application_1353391620476_0001	CONTAINERID=container_1353391620476_0001_01_000010
> java.lang.ArithmeticException: / by zero
> 	at org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:368)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:150)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:131)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:115)
> 	at org.apache.hadoop.yarn.server.nodemanager.LocalDirsHandlerService.getLocalPathForWrite(LocalDirsHandlerService.java:263)
> 	at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:849)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HADOOP-9079) LocalDirAllocator throws ArithmeticException

Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-9079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13502160#comment-13502160 ] 

Hadoop QA commented on HADOOP-9079:
-----------------------------------

{color:red}-1 overall{color}.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12554534/trunk-9079.patch
  against trunk revision .

    {color:green}+1 @author{color}.  The patch does not contain any @author tags.

    {color:red}-1 tests included{color}.  The patch doesn't appear to include any new or modified tests.
                        Please justify why no new tests are needed for this patch.
                        Also please list what manual steps were performed to verify this patch.

    {color:green}+1 javac{color}.  The applied patch does not increase the total number of javac compiler warnings.

    {color:green}+1 javadoc{color}.  The javadoc tool did not generate any warning messages.

    {color:green}+1 eclipse:eclipse{color}.  The patch built with eclipse:eclipse.

    {color:green}+1 findbugs{color}.  The patch does not introduce any new Findbugs (version 1.3.9) warnings.

    {color:green}+1 release audit{color}.  The applied patch does not increase the total number of release audit warnings.

    {color:green}+1 core tests{color}.  The patch passed unit tests in hadoop-common-project/hadoop-common.

    {color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: https://builds.apache.org/job/PreCommit-HADOOP-Build/1790//testReport/
Console output: https://builds.apache.org/job/PreCommit-HADOOP-Build/1790//console

This message is automatically generated.
                
> LocalDirAllocator throws ArithmeticException
> --------------------------------------------
>
>                 Key: HADOOP-9079
>                 URL: https://issues.apache.org/jira/browse/HADOOP-9079
>             Project: Hadoop Common
>          Issue Type: Bug
>            Reporter: Jimmy Xiang
>            Priority: Minor
>         Attachments: trunk-9079.patch
>
>
> 2012-11-19 22:07:41,709 WARN  [IPC Server handler 0 on 38671] nodemanager.NMAuditLogger(150): USER=UnknownUser	IP=****	OPERATION=Stop Container Request	TARGET=ContainerManagerImpl	RESULT=FAILURE	DESCRIPTION=Trying to stop unknown container!	APPID=application_1353391620476_0001	CONTAINERID=container_1353391620476_0001_01_000010
> java.lang.ArithmeticException: / by zero
> 	at org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:368)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:150)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:131)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:115)
> 	at org.apache.hadoop.yarn.server.nodemanager.LocalDirsHandlerService.getLocalPathForWrite(LocalDirsHandlerService.java:263)
> 	at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:849)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HADOOP-9079) LocalDirAllocator throws ArithmeticException

Posted by "Jimmy Xiang (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-9079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13502386#comment-13502386 ] 

Jimmy Xiang commented on HADOOP-9079:
-------------------------------------

Yes, some local dir should be out of space to reproduce this issue.  Is there a good way to simulate a disk out of space?
                
> LocalDirAllocator throws ArithmeticException
> --------------------------------------------
>
>                 Key: HADOOP-9079
>                 URL: https://issues.apache.org/jira/browse/HADOOP-9079
>             Project: Hadoop Common
>          Issue Type: Bug
>            Reporter: Jimmy Xiang
>            Assignee: Jimmy Xiang
>            Priority: Minor
>         Attachments: trunk-9079.patch
>
>
> 2012-11-19 22:07:41,709 WARN  [IPC Server handler 0 on 38671] nodemanager.NMAuditLogger(150): USER=UnknownUser	IP=****	OPERATION=Stop Container Request	TARGET=ContainerManagerImpl	RESULT=FAILURE	DESCRIPTION=Trying to stop unknown container!	APPID=application_1353391620476_0001	CONTAINERID=container_1353391620476_0001_01_000010
> java.lang.ArithmeticException: / by zero
> 	at org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:368)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:150)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:131)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:115)
> 	at org.apache.hadoop.yarn.server.nodemanager.LocalDirsHandlerService.getLocalPathForWrite(LocalDirsHandlerService.java:263)
> 	at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:849)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HADOOP-9079) LocalDirAllocator throws ArithmeticException

Posted by "Jimmy Xiang (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-9079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13502171#comment-13502171 ] 

Jimmy Xiang commented on HADOOP-9079:
-------------------------------------

@Eli,  could you add me as a Hadoop contributor?

This could also happen when the local dirs are not writable.

The second while loop seems to be bounded implicitly since randomPosition < totalAvailable and sum of (availableOnDisk[dir]) = totalAvailable.

Let me think it more.

Sure, will add a test.
                
> LocalDirAllocator throws ArithmeticException
> --------------------------------------------
>
>                 Key: HADOOP-9079
>                 URL: https://issues.apache.org/jira/browse/HADOOP-9079
>             Project: Hadoop Common
>          Issue Type: Bug
>            Reporter: Jimmy Xiang
>            Priority: Minor
>         Attachments: trunk-9079.patch
>
>
> 2012-11-19 22:07:41,709 WARN  [IPC Server handler 0 on 38671] nodemanager.NMAuditLogger(150): USER=UnknownUser	IP=****	OPERATION=Stop Container Request	TARGET=ContainerManagerImpl	RESULT=FAILURE	DESCRIPTION=Trying to stop unknown container!	APPID=application_1353391620476_0001	CONTAINERID=container_1353391620476_0001_01_000010
> java.lang.ArithmeticException: / by zero
> 	at org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:368)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:150)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:131)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:115)
> 	at org.apache.hadoop.yarn.server.nodemanager.LocalDirsHandlerService.getLocalPathForWrite(LocalDirsHandlerService.java:263)
> 	at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:849)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HADOOP-9079) LocalDirAllocator throws ArithmeticException

Posted by "Jimmy Xiang (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-9079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jimmy Xiang updated HADOOP-9079:
--------------------------------

    Status: Patch Available  (was: Open)
    
> LocalDirAllocator throws ArithmeticException
> --------------------------------------------
>
>                 Key: HADOOP-9079
>                 URL: https://issues.apache.org/jira/browse/HADOOP-9079
>             Project: Hadoop Common
>          Issue Type: Bug
>            Reporter: Jimmy Xiang
>            Priority: Minor
>         Attachments: trunk-9079.patch
>
>
> 2012-11-19 22:07:41,709 WARN  [IPC Server handler 0 on 38671] nodemanager.NMAuditLogger(150): USER=UnknownUser	IP=****	OPERATION=Stop Container Request	TARGET=ContainerManagerImpl	RESULT=FAILURE	DESCRIPTION=Trying to stop unknown container!	APPID=application_1353391620476_0001	CONTAINERID=container_1353391620476_0001_01_000010
> java.lang.ArithmeticException: / by zero
> 	at org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:368)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:150)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:131)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:115)
> 	at org.apache.hadoop.yarn.server.nodemanager.LocalDirsHandlerService.getLocalPathForWrite(LocalDirsHandlerService.java:263)
> 	at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:849)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Assigned] (HADOOP-9079) LocalDirAllocator throws ArithmeticException

Posted by "Eli Collins (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-9079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Eli Collins reassigned HADOOP-9079:
-----------------------------------

    Assignee: Jimmy Xiang
    
> LocalDirAllocator throws ArithmeticException
> --------------------------------------------
>
>                 Key: HADOOP-9079
>                 URL: https://issues.apache.org/jira/browse/HADOOP-9079
>             Project: Hadoop Common
>          Issue Type: Bug
>            Reporter: Jimmy Xiang
>            Assignee: Jimmy Xiang
>            Priority: Minor
>         Attachments: trunk-9079.patch
>
>
> 2012-11-19 22:07:41,709 WARN  [IPC Server handler 0 on 38671] nodemanager.NMAuditLogger(150): USER=UnknownUser	IP=****	OPERATION=Stop Container Request	TARGET=ContainerManagerImpl	RESULT=FAILURE	DESCRIPTION=Trying to stop unknown container!	APPID=application_1353391620476_0001	CONTAINERID=container_1353391620476_0001_01_000010
> java.lang.ArithmeticException: / by zero
> 	at org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:368)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:150)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:131)
> 	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:115)
> 	at org.apache.hadoop.yarn.server.nodemanager.LocalDirsHandlerService.getLocalPathForWrite(LocalDirsHandlerService.java:263)
> 	at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:849)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira