You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hama.apache.org by "ChiaHung Lin (JIRA)" <ji...@apache.org> on 2011/05/09 07:02:03 UTC

[jira] [Commented] (HAMA-385) Trunk doesn't work on distributed environment

    [ https://issues.apache.org/jira/browse/HAMA-385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030607#comment-13030607 ] 

ChiaHung Lin commented on HAMA-385:
-----------------------------------

My test with code in trunk on my 5 vms works ok, but probably due to they are all single core. So my first thought is race condition issue. Is there any chance that we can have a look at the directive value before serialized and de-serialized? Or synchronize the methods e.g. write/ readFields to see if that is the case.  

> Trunk doesn't work on distributed environment
> ---------------------------------------------
>
>                 Key: HAMA-385
>                 URL: https://issues.apache.org/jira/browse/HAMA-385
>             Project: Hama
>          Issue Type: Bug
>          Components: bsp
>    Affects Versions: 0.3.0
>            Reporter: Edward J. Yoon
>            Assignee: ChiaHung Lin
>            Priority: Critical
>             Fix For: 0.3.0
>
>
> I tried to run a Pi example with Hama Trunk but, doesn't work with below error message.
> {code}
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: GroomServer up at: localhost/127.0.0.1:44510
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: Starting groom: groomd_cnode4.cloud_61000
> 2011-05-09 12:03:14,242 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 50000: readAndProcess threw exception java.io.EOFException. Count of bytes read: 0
> java.io.EOFException
>         at java.io.DataInputStream.readByte(DataInputStream.java:250)
>         at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:298)
>         at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:319)
>         at org.apache.hama.bsp.DispatchTasksDirective.readFields(DispatchTasksDirective.java:86)
>         at org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:237)
>         at org.apache.hadoop.ipc.RPC$Invocation.readFields(RPC.java:103)
>         at org.apache.hadoop.ipc.Server$Connection.processData(Server.java:906)
>         at org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:845)
>         at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:419)
>         at org.apache.hadoop.ipc.Server$Listener.run(Server.java:328)
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira