You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hama.apache.org by "Edward J. Yoon (JIRA)" <ji...@apache.org> on 2011/05/09 05:00:03 UTC

[jira] [Created] (HAMA-385) Trunk doesn't work on distributed environment

Trunk doesn't work on distributed environment
---------------------------------------------

                 Key: HAMA-385
                 URL: https://issues.apache.org/jira/browse/HAMA-385
             Project: Hama
          Issue Type: Bug
          Components: bsp
    Affects Versions: 0.3.0
            Reporter: Edward J. Yoon
            Assignee: ChiaHung Lin
            Priority: Critical
             Fix For: 0.3.0


I tried to run a Pi example with Hama Trunk but, doesn't work with below error message.

{code}
2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 44510: starting
2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 44510: starting
2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 44510: starting
2011-05-09 12:02:07,763 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 44510: starting
2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: GroomServer up at: localhost/127.0.0.1:44510
2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: Starting groom: groomd_cnode4.cloud_61000
2011-05-09 12:03:14,242 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 50000: readAndProcess threw exception java.io.EOFException. Count of bytes read: 0
java.io.EOFException
        at java.io.DataInputStream.readByte(DataInputStream.java:250)
        at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:298)
        at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:319)
        at org.apache.hama.bsp.DispatchTasksDirective.readFields(DispatchTasksDirective.java:86)
        at org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:237)
        at org.apache.hadoop.ipc.RPC$Invocation.readFields(RPC.java:103)
        at org.apache.hadoop.ipc.Server$Connection.processData(Server.java:906)
        at org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:845)
        at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:419)
        at org.apache.hadoop.ipc.Server$Listener.run(Server.java:328)
{code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HAMA-385) Trunk doesn't work on distributed environment

Posted by "ChiaHung Lin (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HAMA-385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030615#comment-13030615 ] 

ChiaHung Lin commented on HAMA-385:
-----------------------------------

Can we re-run the test several times to see if that happens again? For example, start/ stop servers and re-run pi example several times. If it is the flaw in the code, then the error may occur again. 

> Trunk doesn't work on distributed environment
> ---------------------------------------------
>
>                 Key: HAMA-385
>                 URL: https://issues.apache.org/jira/browse/HAMA-385
>             Project: Hama
>          Issue Type: Bug
>          Components: bsp
>    Affects Versions: 0.3.0
>            Reporter: Edward J. Yoon
>            Assignee: ChiaHung Lin
>            Priority: Critical
>             Fix For: 0.3.0
>
>
> I tried to run a Pi example with Hama Trunk but, doesn't work with below error message.
> {code}
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: GroomServer up at: localhost/127.0.0.1:44510
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: Starting groom: groomd_cnode4.cloud_61000
> 2011-05-09 12:03:14,242 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 50000: readAndProcess threw exception java.io.EOFException. Count of bytes read: 0
> java.io.EOFException
>         at java.io.DataInputStream.readByte(DataInputStream.java:250)
>         at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:298)
>         at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:319)
>         at org.apache.hama.bsp.DispatchTasksDirective.readFields(DispatchTasksDirective.java:86)
>         at org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:237)
>         at org.apache.hadoop.ipc.RPC$Invocation.readFields(RPC.java:103)
>         at org.apache.hadoop.ipc.Server$Connection.processData(Server.java:906)
>         at org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:845)
>         at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:419)
>         at org.apache.hadoop.ipc.Server$Listener.run(Server.java:328)
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HAMA-385) Trunk doesn't work on distributed environment

Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HAMA-385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Edward J. Yoon resolved HAMA-385.
---------------------------------

    Resolution: Invalid
      Assignee: Edward J. Yoon  (was: ChiaHung Lin)

The problem was my configuration error.

> Trunk doesn't work on distributed environment
> ---------------------------------------------
>
>                 Key: HAMA-385
>                 URL: https://issues.apache.org/jira/browse/HAMA-385
>             Project: Hama
>          Issue Type: Bug
>          Components: bsp
>    Affects Versions: 0.3.0
>            Reporter: Edward J. Yoon
>            Assignee: Edward J. Yoon
>            Priority: Critical
>             Fix For: 0.3.0
>
>
> I tried to run a Pi example with Hama Trunk but, doesn't work with below error message.
> {code}
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: GroomServer up at: localhost/127.0.0.1:44510
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: Starting groom: groomd_cnode4.cloud_61000
> 2011-05-09 12:03:14,242 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 50000: readAndProcess threw exception java.io.EOFException. Count of bytes read: 0
> java.io.EOFException
>         at java.io.DataInputStream.readByte(DataInputStream.java:250)
>         at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:298)
>         at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:319)
>         at org.apache.hama.bsp.DispatchTasksDirective.readFields(DispatchTasksDirective.java:86)
>         at org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:237)
>         at org.apache.hadoop.ipc.RPC$Invocation.readFields(RPC.java:103)
>         at org.apache.hadoop.ipc.Server$Connection.processData(Server.java:906)
>         at org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:845)
>         at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:419)
>         at org.apache.hadoop.ipc.Server$Listener.run(Server.java:328)
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Issue Comment Edited] (HAMA-385) Trunk doesn't work on distributed environment

Posted by "ChiaHung Lin (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HAMA-385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030607#comment-13030607 ] 

ChiaHung Lin edited comment on HAMA-385 at 5/9/11 5:02 AM:
-----------------------------------------------------------

My test with code in trunk on my 5 vms works ok, but probably due to they are all single core. So my first thought is race condition issue. Is there any chance that we can have a look at the directive value before/ after serialized and de-serialized? Or synchronize the methods e.g. write/ readFields to see if that is the case.  

      was (Author: chl501):
    My test with code in trunk on my 5 vms works ok, but probably due to they are all single core. So my first thought is race condition issue. Is there any chance that we can have a look at the directive value before serialized and de-serialized? Or synchronize the methods e.g. write/ readFields to see if that is the case.  
  
> Trunk doesn't work on distributed environment
> ---------------------------------------------
>
>                 Key: HAMA-385
>                 URL: https://issues.apache.org/jira/browse/HAMA-385
>             Project: Hama
>          Issue Type: Bug
>          Components: bsp
>    Affects Versions: 0.3.0
>            Reporter: Edward J. Yoon
>            Assignee: ChiaHung Lin
>            Priority: Critical
>             Fix For: 0.3.0
>
>
> I tried to run a Pi example with Hama Trunk but, doesn't work with below error message.
> {code}
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: GroomServer up at: localhost/127.0.0.1:44510
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: Starting groom: groomd_cnode4.cloud_61000
> 2011-05-09 12:03:14,242 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 50000: readAndProcess threw exception java.io.EOFException. Count of bytes read: 0
> java.io.EOFException
>         at java.io.DataInputStream.readByte(DataInputStream.java:250)
>         at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:298)
>         at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:319)
>         at org.apache.hama.bsp.DispatchTasksDirective.readFields(DispatchTasksDirective.java:86)
>         at org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:237)
>         at org.apache.hadoop.ipc.RPC$Invocation.readFields(RPC.java:103)
>         at org.apache.hadoop.ipc.Server$Connection.processData(Server.java:906)
>         at org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:845)
>         at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:419)
>         at org.apache.hadoop.ipc.Server$Listener.run(Server.java:328)
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HAMA-385) Trunk doesn't work on distributed environment

Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HAMA-385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030617#comment-13030617 ] 

Edward J. Yoon commented on HAMA-385:
-------------------------------------

Okay, I'll test again.

> Trunk doesn't work on distributed environment
> ---------------------------------------------
>
>                 Key: HAMA-385
>                 URL: https://issues.apache.org/jira/browse/HAMA-385
>             Project: Hama
>          Issue Type: Bug
>          Components: bsp
>    Affects Versions: 0.3.0
>            Reporter: Edward J. Yoon
>            Assignee: ChiaHung Lin
>            Priority: Critical
>             Fix For: 0.3.0
>
>
> I tried to run a Pi example with Hama Trunk but, doesn't work with below error message.
> {code}
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: GroomServer up at: localhost/127.0.0.1:44510
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: Starting groom: groomd_cnode4.cloud_61000
> 2011-05-09 12:03:14,242 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 50000: readAndProcess threw exception java.io.EOFException. Count of bytes read: 0
> java.io.EOFException
>         at java.io.DataInputStream.readByte(DataInputStream.java:250)
>         at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:298)
>         at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:319)
>         at org.apache.hama.bsp.DispatchTasksDirective.readFields(DispatchTasksDirective.java:86)
>         at org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:237)
>         at org.apache.hadoop.ipc.RPC$Invocation.readFields(RPC.java:103)
>         at org.apache.hadoop.ipc.Server$Connection.processData(Server.java:906)
>         at org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:845)
>         at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:419)
>         at org.apache.hadoop.ipc.Server$Listener.run(Server.java:328)
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HAMA-385) Trunk doesn't work on distributed environment

Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HAMA-385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030608#comment-13030608 ] 

Edward J. Yoon commented on HAMA-385:
-------------------------------------

Hmm. It works again. :/

> Trunk doesn't work on distributed environment
> ---------------------------------------------
>
>                 Key: HAMA-385
>                 URL: https://issues.apache.org/jira/browse/HAMA-385
>             Project: Hama
>          Issue Type: Bug
>          Components: bsp
>    Affects Versions: 0.3.0
>            Reporter: Edward J. Yoon
>            Assignee: ChiaHung Lin
>            Priority: Critical
>             Fix For: 0.3.0
>
>
> I tried to run a Pi example with Hama Trunk but, doesn't work with below error message.
> {code}
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: GroomServer up at: localhost/127.0.0.1:44510
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: Starting groom: groomd_cnode4.cloud_61000
> 2011-05-09 12:03:14,242 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 50000: readAndProcess threw exception java.io.EOFException. Count of bytes read: 0
> java.io.EOFException
>         at java.io.DataInputStream.readByte(DataInputStream.java:250)
>         at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:298)
>         at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:319)
>         at org.apache.hama.bsp.DispatchTasksDirective.readFields(DispatchTasksDirective.java:86)
>         at org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:237)
>         at org.apache.hadoop.ipc.RPC$Invocation.readFields(RPC.java:103)
>         at org.apache.hadoop.ipc.Server$Connection.processData(Server.java:906)
>         at org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:845)
>         at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:419)
>         at org.apache.hadoop.ipc.Server$Listener.run(Server.java:328)
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HAMA-385) Trunk doesn't work on distributed environment

Posted by "ChiaHung Lin (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HAMA-385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030607#comment-13030607 ] 

ChiaHung Lin commented on HAMA-385:
-----------------------------------

My test with code in trunk on my 5 vms works ok, but probably due to they are all single core. So my first thought is race condition issue. Is there any chance that we can have a look at the directive value before serialized and de-serialized? Or synchronize the methods e.g. write/ readFields to see if that is the case.  

> Trunk doesn't work on distributed environment
> ---------------------------------------------
>
>                 Key: HAMA-385
>                 URL: https://issues.apache.org/jira/browse/HAMA-385
>             Project: Hama
>          Issue Type: Bug
>          Components: bsp
>    Affects Versions: 0.3.0
>            Reporter: Edward J. Yoon
>            Assignee: ChiaHung Lin
>            Priority: Critical
>             Fix For: 0.3.0
>
>
> I tried to run a Pi example with Hama Trunk but, doesn't work with below error message.
> {code}
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: GroomServer up at: localhost/127.0.0.1:44510
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: Starting groom: groomd_cnode4.cloud_61000
> 2011-05-09 12:03:14,242 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 50000: readAndProcess threw exception java.io.EOFException. Count of bytes read: 0
> java.io.EOFException
>         at java.io.DataInputStream.readByte(DataInputStream.java:250)
>         at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:298)
>         at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:319)
>         at org.apache.hama.bsp.DispatchTasksDirective.readFields(DispatchTasksDirective.java:86)
>         at org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:237)
>         at org.apache.hadoop.ipc.RPC$Invocation.readFields(RPC.java:103)
>         at org.apache.hadoop.ipc.Server$Connection.processData(Server.java:906)
>         at org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:845)
>         at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:419)
>         at org.apache.hadoop.ipc.Server$Listener.run(Server.java:328)
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira