You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hama.apache.org by "Edward J. Yoon (JIRA)" <ji...@apache.org> on 2011/05/09 05:00:03 UTC
[jira] [Created] (HAMA-385) Trunk doesn't work on distributed
environment
Trunk doesn't work on distributed environment
---------------------------------------------
Key: HAMA-385
URL: https://issues.apache.org/jira/browse/HAMA-385
Project: Hama
Issue Type: Bug
Components: bsp
Affects Versions: 0.3.0
Reporter: Edward J. Yoon
Assignee: ChiaHung Lin
Priority: Critical
Fix For: 0.3.0
I tried to run a Pi example with Hama Trunk but, doesn't work with below error message.
{code}
2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 44510: starting
2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 44510: starting
2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 44510: starting
2011-05-09 12:02:07,763 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 44510: starting
2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: GroomServer up at: localhost/127.0.0.1:44510
2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: Starting groom: groomd_cnode4.cloud_61000
2011-05-09 12:03:14,242 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 50000: readAndProcess threw exception java.io.EOFException. Count of bytes read: 0
java.io.EOFException
at java.io.DataInputStream.readByte(DataInputStream.java:250)
at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:298)
at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:319)
at org.apache.hama.bsp.DispatchTasksDirective.readFields(DispatchTasksDirective.java:86)
at org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:237)
at org.apache.hadoop.ipc.RPC$Invocation.readFields(RPC.java:103)
at org.apache.hadoop.ipc.Server$Connection.processData(Server.java:906)
at org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:845)
at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:419)
at org.apache.hadoop.ipc.Server$Listener.run(Server.java:328)
{code}
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HAMA-385) Trunk doesn't work on distributed
environment
Posted by "ChiaHung Lin (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HAMA-385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030615#comment-13030615 ]
ChiaHung Lin commented on HAMA-385:
-----------------------------------
Can we re-run the test several times to see if that happens again? For example, start/ stop servers and re-run pi example several times. If it is the flaw in the code, then the error may occur again.
> Trunk doesn't work on distributed environment
> ---------------------------------------------
>
> Key: HAMA-385
> URL: https://issues.apache.org/jira/browse/HAMA-385
> Project: Hama
> Issue Type: Bug
> Components: bsp
> Affects Versions: 0.3.0
> Reporter: Edward J. Yoon
> Assignee: ChiaHung Lin
> Priority: Critical
> Fix For: 0.3.0
>
>
> I tried to run a Pi example with Hama Trunk but, doesn't work with below error message.
> {code}
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: GroomServer up at: localhost/127.0.0.1:44510
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: Starting groom: groomd_cnode4.cloud_61000
> 2011-05-09 12:03:14,242 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 50000: readAndProcess threw exception java.io.EOFException. Count of bytes read: 0
> java.io.EOFException
> at java.io.DataInputStream.readByte(DataInputStream.java:250)
> at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:298)
> at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:319)
> at org.apache.hama.bsp.DispatchTasksDirective.readFields(DispatchTasksDirective.java:86)
> at org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:237)
> at org.apache.hadoop.ipc.RPC$Invocation.readFields(RPC.java:103)
> at org.apache.hadoop.ipc.Server$Connection.processData(Server.java:906)
> at org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:845)
> at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:419)
> at org.apache.hadoop.ipc.Server$Listener.run(Server.java:328)
> {code}
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (HAMA-385) Trunk doesn't work on distributed
environment
Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HAMA-385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Edward J. Yoon resolved HAMA-385.
---------------------------------
Resolution: Invalid
Assignee: Edward J. Yoon (was: ChiaHung Lin)
The problem was my configuration error.
> Trunk doesn't work on distributed environment
> ---------------------------------------------
>
> Key: HAMA-385
> URL: https://issues.apache.org/jira/browse/HAMA-385
> Project: Hama
> Issue Type: Bug
> Components: bsp
> Affects Versions: 0.3.0
> Reporter: Edward J. Yoon
> Assignee: Edward J. Yoon
> Priority: Critical
> Fix For: 0.3.0
>
>
> I tried to run a Pi example with Hama Trunk but, doesn't work with below error message.
> {code}
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: GroomServer up at: localhost/127.0.0.1:44510
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: Starting groom: groomd_cnode4.cloud_61000
> 2011-05-09 12:03:14,242 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 50000: readAndProcess threw exception java.io.EOFException. Count of bytes read: 0
> java.io.EOFException
> at java.io.DataInputStream.readByte(DataInputStream.java:250)
> at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:298)
> at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:319)
> at org.apache.hama.bsp.DispatchTasksDirective.readFields(DispatchTasksDirective.java:86)
> at org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:237)
> at org.apache.hadoop.ipc.RPC$Invocation.readFields(RPC.java:103)
> at org.apache.hadoop.ipc.Server$Connection.processData(Server.java:906)
> at org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:845)
> at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:419)
> at org.apache.hadoop.ipc.Server$Listener.run(Server.java:328)
> {code}
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Issue Comment Edited] (HAMA-385) Trunk doesn't work on
distributed environment
Posted by "ChiaHung Lin (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HAMA-385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030607#comment-13030607 ]
ChiaHung Lin edited comment on HAMA-385 at 5/9/11 5:02 AM:
-----------------------------------------------------------
My test with code in trunk on my 5 vms works ok, but probably due to they are all single core. So my first thought is race condition issue. Is there any chance that we can have a look at the directive value before/ after serialized and de-serialized? Or synchronize the methods e.g. write/ readFields to see if that is the case.
was (Author: chl501):
My test with code in trunk on my 5 vms works ok, but probably due to they are all single core. So my first thought is race condition issue. Is there any chance that we can have a look at the directive value before serialized and de-serialized? Or synchronize the methods e.g. write/ readFields to see if that is the case.
> Trunk doesn't work on distributed environment
> ---------------------------------------------
>
> Key: HAMA-385
> URL: https://issues.apache.org/jira/browse/HAMA-385
> Project: Hama
> Issue Type: Bug
> Components: bsp
> Affects Versions: 0.3.0
> Reporter: Edward J. Yoon
> Assignee: ChiaHung Lin
> Priority: Critical
> Fix For: 0.3.0
>
>
> I tried to run a Pi example with Hama Trunk but, doesn't work with below error message.
> {code}
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: GroomServer up at: localhost/127.0.0.1:44510
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: Starting groom: groomd_cnode4.cloud_61000
> 2011-05-09 12:03:14,242 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 50000: readAndProcess threw exception java.io.EOFException. Count of bytes read: 0
> java.io.EOFException
> at java.io.DataInputStream.readByte(DataInputStream.java:250)
> at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:298)
> at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:319)
> at org.apache.hama.bsp.DispatchTasksDirective.readFields(DispatchTasksDirective.java:86)
> at org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:237)
> at org.apache.hadoop.ipc.RPC$Invocation.readFields(RPC.java:103)
> at org.apache.hadoop.ipc.Server$Connection.processData(Server.java:906)
> at org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:845)
> at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:419)
> at org.apache.hadoop.ipc.Server$Listener.run(Server.java:328)
> {code}
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HAMA-385) Trunk doesn't work on distributed
environment
Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HAMA-385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030617#comment-13030617 ]
Edward J. Yoon commented on HAMA-385:
-------------------------------------
Okay, I'll test again.
> Trunk doesn't work on distributed environment
> ---------------------------------------------
>
> Key: HAMA-385
> URL: https://issues.apache.org/jira/browse/HAMA-385
> Project: Hama
> Issue Type: Bug
> Components: bsp
> Affects Versions: 0.3.0
> Reporter: Edward J. Yoon
> Assignee: ChiaHung Lin
> Priority: Critical
> Fix For: 0.3.0
>
>
> I tried to run a Pi example with Hama Trunk but, doesn't work with below error message.
> {code}
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: GroomServer up at: localhost/127.0.0.1:44510
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: Starting groom: groomd_cnode4.cloud_61000
> 2011-05-09 12:03:14,242 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 50000: readAndProcess threw exception java.io.EOFException. Count of bytes read: 0
> java.io.EOFException
> at java.io.DataInputStream.readByte(DataInputStream.java:250)
> at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:298)
> at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:319)
> at org.apache.hama.bsp.DispatchTasksDirective.readFields(DispatchTasksDirective.java:86)
> at org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:237)
> at org.apache.hadoop.ipc.RPC$Invocation.readFields(RPC.java:103)
> at org.apache.hadoop.ipc.Server$Connection.processData(Server.java:906)
> at org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:845)
> at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:419)
> at org.apache.hadoop.ipc.Server$Listener.run(Server.java:328)
> {code}
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HAMA-385) Trunk doesn't work on distributed
environment
Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HAMA-385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030608#comment-13030608 ]
Edward J. Yoon commented on HAMA-385:
-------------------------------------
Hmm. It works again. :/
> Trunk doesn't work on distributed environment
> ---------------------------------------------
>
> Key: HAMA-385
> URL: https://issues.apache.org/jira/browse/HAMA-385
> Project: Hama
> Issue Type: Bug
> Components: bsp
> Affects Versions: 0.3.0
> Reporter: Edward J. Yoon
> Assignee: ChiaHung Lin
> Priority: Critical
> Fix For: 0.3.0
>
>
> I tried to run a Pi example with Hama Trunk but, doesn't work with below error message.
> {code}
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: GroomServer up at: localhost/127.0.0.1:44510
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: Starting groom: groomd_cnode4.cloud_61000
> 2011-05-09 12:03:14,242 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 50000: readAndProcess threw exception java.io.EOFException. Count of bytes read: 0
> java.io.EOFException
> at java.io.DataInputStream.readByte(DataInputStream.java:250)
> at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:298)
> at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:319)
> at org.apache.hama.bsp.DispatchTasksDirective.readFields(DispatchTasksDirective.java:86)
> at org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:237)
> at org.apache.hadoop.ipc.RPC$Invocation.readFields(RPC.java:103)
> at org.apache.hadoop.ipc.Server$Connection.processData(Server.java:906)
> at org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:845)
> at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:419)
> at org.apache.hadoop.ipc.Server$Listener.run(Server.java:328)
> {code}
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HAMA-385) Trunk doesn't work on distributed
environment
Posted by "ChiaHung Lin (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HAMA-385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030607#comment-13030607 ]
ChiaHung Lin commented on HAMA-385:
-----------------------------------
My test with code in trunk on my 5 vms works ok, but probably due to they are all single core. So my first thought is race condition issue. Is there any chance that we can have a look at the directive value before serialized and de-serialized? Or synchronize the methods e.g. write/ readFields to see if that is the case.
> Trunk doesn't work on distributed environment
> ---------------------------------------------
>
> Key: HAMA-385
> URL: https://issues.apache.org/jira/browse/HAMA-385
> Project: Hama
> Issue Type: Bug
> Components: bsp
> Affects Versions: 0.3.0
> Reporter: Edward J. Yoon
> Assignee: ChiaHung Lin
> Priority: Critical
> Fix For: 0.3.0
>
>
> I tried to run a Pi example with Hama Trunk but, doesn't work with below error message.
> {code}
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 44510: starting
> 2011-05-09 12:02:07,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 44510: starting
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: GroomServer up at: localhost/127.0.0.1:44510
> 2011-05-09 12:02:07,763 INFO org.apache.hama.bsp.GroomServer: Starting groom: groomd_cnode4.cloud_61000
> 2011-05-09 12:03:14,242 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 50000: readAndProcess threw exception java.io.EOFException. Count of bytes read: 0
> java.io.EOFException
> at java.io.DataInputStream.readByte(DataInputStream.java:250)
> at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:298)
> at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:319)
> at org.apache.hama.bsp.DispatchTasksDirective.readFields(DispatchTasksDirective.java:86)
> at org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:237)
> at org.apache.hadoop.ipc.RPC$Invocation.readFields(RPC.java:103)
> at org.apache.hadoop.ipc.Server$Connection.processData(Server.java:906)
> at org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:845)
> at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:419)
> at org.apache.hadoop.ipc.Server$Listener.run(Server.java:328)
> {code}
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira