You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hama.apache.org by Lyu Xuedong <lx...@gmail.com> on 2013/04/16 15:55:21 UTC

Why my distributed mode does not work?

My project can work well in eclipse, but when I export it as a jar file 
and submit it to a two-node hama cluster, some errors and fatal occur if 
the input file is over 64M.
I suspect that my configure files of hadoop or hama is somewhere not 
right, but a pi estimator can run normally on my cluster and 
GroomServer$BSPPerChild can be seen on each nodes. I debug for a whole 
day, nothing improved.
What in my input file are a large number of RDF triples: "<subject> 
<predicate> <object> ." My task is to create vertices for subjects and 
objects. Predicates are subjects' edges.
What should I do ?

Terminal output:

13/04/16 21:13:36 INFO bgp.HamaBgpComplete: Job begain.
13/04/16 21:13:37 INFO bsp.FileInputFormat: Total input paths to process : 2
13/04/16 21:13:38 INFO bsp.BSPJobClient: Running job: job_201304161357_0015
13/04/16 21:13:41 INFO bsp.BSPJobClient: Current supersteps number: 0
13/04/16 21:13:47 INFO bsp.BSPJobClient: Current supersteps number: 2
13/04/16 21:13:53 INFO bsp.BSPJobClient: Current supersteps number: 3
13/04/16 21:13:59 INFO bsp.BSPJobClient: Current supersteps number: 4
13/04/16 21:14:05 INFO bsp.BSPJobClient: Current supersteps number: 5
attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO 
sync.ZKSyncClient: Initializing ZK Sync Client
attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO 
sync.ZooKeeperSyncClientImpl: Start connecting to Zookeeper! At 
hadoop1/1.2.3.4:61002
attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: 
Starting SocketReader
attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: 
IPC Server Responder: starting
attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: 
IPC Server handler 0 on 61002: starting
attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO 
message.HadoopMessageManagerImpl:  BSPPeer address:hadoop1 port:61002
attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: 
IPC Server listener on 61002: starting
attempt_201304161357_0015_000000_0: 13/04/16 21:14:03 ERROR bsp.BSPTask: 
Error running bsp setup and bsp function.
attempt_201304161357_0015_000000_0: java.lang.NullPointerException
attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server: 
Stopping server on 61002
attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server: 
IPC Server handler 0 on 61002: exiting
attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server: 
Stopping IPC Server listener on 61002
attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server: 
Stopping IPC Server Responder
attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO 
metrics.RpcInstrumentation: shut down
attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 ERROR bsp.BSPTask: 
Shutting down ping service.
attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 FATAL 
bsp.GroomServer: Error running child
attempt_201304161357_0015_000000_0: java.lang.NullPointerException
attempt_201304161357_0015_000000_0: java.lang.NullPointerException
13/04/16 21:15:11 INFO bsp.BSPJobClient: Job failed.

tasklogs:
attempt_201304161357_0015_000000_0.log
13/04/16 21:13:43 INFO sync.ZKSyncClient: Initializing ZK Sync Client
13/04/16 21:13:43 INFO sync.ZooKeeperSyncClientImpl: Start connecting to 
Zookeeper! At hadoop1/1.2.3.4:61002
13/04/16 21:13:43 INFO ipc.Server: Starting SocketReader
13/04/16 21:13:43 INFO ipc.Server: IPC Server Responder: starting
13/04/16 21:13:43 INFO ipc.Server: IPC Server handler 0 on 61002: starting
13/04/16 21:13:43 INFO message.HadoopMessageManagerImpl:  BSPPeer 
address:hadoop1 port:61002
13/04/16 21:13:43 INFO ipc.Server: IPC Server listener on 61002: starting
13/04/16 21:14:03 ERROR bsp.BSPTask: Error running bsp setup and bsp 
function.
java.lang.NullPointerException
13/04/16 21:14:04 INFO ipc.Server: Stopping server on 61002
13/04/16 21:14:04 INFO ipc.Server: IPC Server handler 0 on 61002: exiting
13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server listener on 61002
13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server Responder
13/04/16 21:14:04 INFO metrics.RpcInstrumentation: shut down
13/04/16 21:14:04 ERROR bsp.BSPTask: Shutting down ping service.
13/04/16 21:14:04 FATAL bsp.GroomServer: Error running child
java.lang.NullPointerException
java.lang.NullPointerException

attempt_201304161357_0015_000001_0.log
13/04/16 21:13:42 INFO sync.ZKSyncClient: Initializing ZK Sync Client
13/04/16 21:13:42 INFO sync.ZooKeeperSyncClientImpl: Start connecting to 
Zookeeper! At hadoop1/1.2.3.4:61001
13/04/16 21:13:42 ERROR sync.ZooKeeperSyncClientImpl: 
org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode = 
NoNode for /bsp/job_201304161357_0015/peers
13/04/16 21:13:42 INFO ipc.Server: Starting SocketReader
13/04/16 21:13:42 INFO ipc.Server: IPC Server Responder: starting
13/04/16 21:13:42 INFO message.HadoopMessageManagerImpl:  BSPPeer 
address:hadoop1 port:61001
13/04/16 21:13:42 INFO ipc.Server: IPC Server listener on 61001: starting
13/04/16 21:13:42 INFO ipc.Server: IPC Server handler 0 on 61001: starting
13/04/16 21:14:06 ERROR bsp.BSPPeerImpl: Error while sending messages
java.io.IOException: Call to hadoop1/1.2.3.4:61002 failed on local 
exception: java.io.EOFException
     at org.apache.hadoop.ipc.Client.wrapException(Client.java:1103)
     at org.apache.hadoop.ipc.Client.call(Client.java:1071)
     at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225)
     at $Proxy3.put(Unknown Source)
     at 
org.apache.hama.bsp.message.HadoopMessageManagerImpl.transfer(HadoopMessageManagerImpl.java:108)
     at org.apache.hama.bsp.BSPPeerImpl.sync(BSPPeerImpl.java:410)
     at org.apache.hama.graph.GraphJobRunner.bsp(GraphJobRunner.java:118)
     at org.apache.hama.bsp.BSPTask.runBSP(BSPTask.java:166)
     at org.apache.hama.bsp.BSPTask.run(BSPTask.java:143)
     at 
org.apache.hama.bsp.GroomServer$BSPPeerChild.main(GroomServer.java:1266)
Caused by: java.io.EOFException
     at java.io.DataInputStream.readInt(DataInputStream.java:375)
     at 
org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:800)
     at org.apache.hadoop.ipc.Client$Connection.run(Client.java:745)










Re: Why my distributed mode does not work?

Posted by "Edward J. Yoon" <ed...@apache.org>.
P.S., please see our official wiki.

On Sat, Apr 20, 2013 at 9:34 PM, Edward J. Yoon <ed...@apache.org> wrote:
> Yes, the repair function has deleted.
>
>
> On Sat, Apr 20, 2013 at 7:26 PM, Lyu Xuedong <lx...@gmail.com> wrote:
>> Hi, Edward,
>>
>> Thank you.
>> I updated the version. There comes a new problem.
>>
>> In the /Apache Hama BSP Programming Model
>> (http://people.apache.org/~tjungblut/downloads/hamadocs/ApacheHamaBSPProgrammingmodel_06.pdf)/
>> there is a paragraph describes 'Graph repair' : "Hama requires a graph to be
>> completed before feeding it to an algorithm. By complete we mean that every
>> vertex that is referenced by an edge must somewhere be a vertex in the
>> graph. In many cases of leafs this is not always the case, therefore we have
>> added a repair functionality which is traversing the whole graph for leafs
>> and adding them to the vertex structure to prevent algorithms from breaking
>> with NullPointerExceptions when it does not find a referenced vertex. You
>> can turn this feature on by setting it in your configuration like this:
>> conf.setBoolean(GraphJobRunner.GRAPH_REPAIR, true);"
>>
>> I followed the guidance but got a hint 'GRAPH_REPAIR cannot be resolved or
>> is not a field' programming in Eclipse. I read the source code later and
>> there is really no variable named GRAPH_REPAIR in GraphJobRunner.java which
>> exists in the version 0.5.0. Is this function not supported any longer ? How
>> can I make a 'repaired graph'?
>> Thank you.
>>
>>
>> On 04/17/2013 12:51 PM, Edward J. Yoon wrote:
>>>
>>> Please use 0.6.1 and try your application with small data again.
>>>
>>> See also http://hama.apache.org/run_examples.html
>>>
>>> On Wed, Apr 17, 2013 at 10:27 AM, Lyu Xuedong <lx...@gmail.com> wrote:
>>>>
>>>> hama: 0.6.0
>>>> hadoop : 1.0.4
>>>> JDK : 1.6
>>>> OS : ubuntu 12.04
>>>>
>>>>
>>>> On 04/17/2013 05:47 AM, Edward J. Yoon wrote:
>>>>>
>>>>> Your version?
>>>>>
>>>>> On Wed, Apr 17, 2013 at 12:07 AM, Lvxuedong <lx...@gmail.com> wrote:
>>>>>>
>>>>>> Hi, Edward, thank you. But your suggestion seems do not work, do you
>>>>>> have
>>>>>> some other advice ?
>>>>>>
>>>>>> Is java.lang.NullPointerException related with heap size ?
>>>>>>
>>>>>>
>>>>>>
>>>>>> 在 2013-4-16,22:13,"Edward J. Yoon" <ed...@apache.org> 写道:
>>>>>>
>>>>>>> I guess you need to increase the child processor JVM heap size.
>>>>>>>
>>>>>>> - conf/hama-site.xml:
>>>>>>>
>>>>>>>    <property>
>>>>>>>      <name>bsp.child.java.opts</name>
>>>>>>>      <value>-Xmx2048m</value>
>>>>>>>    </property>
>>>>>>>
>>>>>>> On Tue, Apr 16, 2013 at 10:55 PM, Lyu Xuedong <lx...@gmail.com>
>>>>>>> wrote:
>>>>>>>>
>>>>>>>> My project can work well in eclipse, but when I export it as a jar
>>>>>>>> file
>>>>>>>> and
>>>>>>>> submit it to a two-node hama cluster, some errors and fatal occur if
>>>>>>>> the
>>>>>>>> input file is over 64M.
>>>>>>>> I suspect that my configure files of hadoop or hama is somewhere not
>>>>>>>> right,
>>>>>>>> but a pi estimator can run normally on my cluster and
>>>>>>>> GroomServer$BSPPerChild can be seen on each nodes. I debug for a
>>>>>>>> whole
>>>>>>>> day,
>>>>>>>> nothing improved.
>>>>>>>> What in my input file are a large number of RDF triples: "<subject>
>>>>>>>> <predicate> <object> ." My task is to create vertices for subjects
>>>>>>>> and
>>>>>>>> objects. Predicates are subjects' edges.
>>>>>>>> What should I do ?
>>>>>>>>
>>>>>>>> Terminal output:
>>>>>>>>
>>>>>>>> 13/04/16 21:13:36 INFO bgp.HamaBgpComplete: Job begain.
>>>>>>>> 13/04/16 21:13:37 INFO bsp.FileInputFormat: Total input paths to
>>>>>>>> process : 2
>>>>>>>> 13/04/16 21:13:38 INFO bsp.BSPJobClient: Running job:
>>>>>>>> job_201304161357_0015
>>>>>>>> 13/04/16 21:13:41 INFO bsp.BSPJobClient: Current supersteps number: 0
>>>>>>>> 13/04/16 21:13:47 INFO bsp.BSPJobClient: Current supersteps number: 2
>>>>>>>> 13/04/16 21:13:53 INFO bsp.BSPJobClient: Current supersteps number: 3
>>>>>>>> 13/04/16 21:13:59 INFO bsp.BSPJobClient: Current supersteps number: 4
>>>>>>>> 13/04/16 21:14:05 INFO bsp.BSPJobClient: Current supersteps number: 5
>>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>>>> sync.ZKSyncClient: Initializing ZK Sync Client
>>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>>>> sync.ZooKeeperSyncClientImpl: Start connecting to Zookeeper! At
>>>>>>>> hadoop1/1.2.3.4:61002
>>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>>>> ipc.Server:
>>>>>>>> Starting SocketReader
>>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>>>> ipc.Server:
>>>>>>>> IPC
>>>>>>>> Server Responder: starting
>>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>>>> ipc.Server:
>>>>>>>> IPC
>>>>>>>> Server handler 0 on 61002: starting
>>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>>>> message.HadoopMessageManagerImpl:  BSPPeer address:hadoop1 port:61002
>>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>>>> ipc.Server:
>>>>>>>> IPC
>>>>>>>> Server listener on 61002: starting
>>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:03 ERROR
>>>>>>>> bsp.BSPTask:
>>>>>>>> Error running bsp setup and bsp function.
>>>>>>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
>>>>>>>> ipc.Server:
>>>>>>>> Stopping server on 61002
>>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
>>>>>>>> ipc.Server:
>>>>>>>> IPC
>>>>>>>> Server handler 0 on 61002: exiting
>>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
>>>>>>>> ipc.Server:
>>>>>>>> Stopping IPC Server listener on 61002
>>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
>>>>>>>> ipc.Server:
>>>>>>>> Stopping IPC Server Responder
>>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
>>>>>>>> metrics.RpcInstrumentation: shut down
>>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 ERROR
>>>>>>>> bsp.BSPTask:
>>>>>>>> Shutting down ping service.
>>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 FATAL
>>>>>>>> bsp.GroomServer:
>>>>>>>> Error running child
>>>>>>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>>>>>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>>>>>>> 13/04/16 21:15:11 INFO bsp.BSPJobClient: Job failed.
>>>>>>>>
>>>>>>>> tasklogs:
>>>>>>>> attempt_201304161357_0015_000000_0.log
>>>>>>>> 13/04/16 21:13:43 INFO sync.ZKSyncClient: Initializing ZK Sync Client
>>>>>>>> 13/04/16 21:13:43 INFO sync.ZooKeeperSyncClientImpl: Start connecting
>>>>>>>> to
>>>>>>>> Zookeeper! At hadoop1/1.2.3.4:61002
>>>>>>>> 13/04/16 21:13:43 INFO ipc.Server: Starting SocketReader
>>>>>>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server Responder: starting
>>>>>>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server handler 0 on 61002:
>>>>>>>> starting
>>>>>>>> 13/04/16 21:13:43 INFO message.HadoopMessageManagerImpl:  BSPPeer
>>>>>>>> address:hadoop1 port:61002
>>>>>>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server listener on 61002:
>>>>>>>> starting
>>>>>>>> 13/04/16 21:14:03 ERROR bsp.BSPTask: Error running bsp setup and bsp
>>>>>>>> function.
>>>>>>>> java.lang.NullPointerException
>>>>>>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping server on 61002
>>>>>>>> 13/04/16 21:14:04 INFO ipc.Server: IPC Server handler 0 on 61002:
>>>>>>>> exiting
>>>>>>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server listener on
>>>>>>>> 61002
>>>>>>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server Responder
>>>>>>>> 13/04/16 21:14:04 INFO metrics.RpcInstrumentation: shut down
>>>>>>>> 13/04/16 21:14:04 ERROR bsp.BSPTask: Shutting down ping service.
>>>>>>>> 13/04/16 21:14:04 FATAL bsp.GroomServer: Error running child
>>>>>>>> java.lang.NullPointerException
>>>>>>>> java.lang.NullPointerException
>>>>>>>>
>>>>>>>> attempt_201304161357_0015_000001_0.log
>>>>>>>> 13/04/16 21:13:42 INFO sync.ZKSyncClient: Initializing ZK Sync Client
>>>>>>>> 13/04/16 21:13:42 INFO sync.ZooKeeperSyncClientImpl: Start connecting
>>>>>>>> to
>>>>>>>> Zookeeper! At hadoop1/1.2.3.4:61001
>>>>>>>> 13/04/16 21:13:42 ERROR sync.ZooKeeperSyncClientImpl:
>>>>>>>> org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode
>>>>>>>> =
>>>>>>>> NoNode for /bsp/job_201304161357_0015/peers
>>>>>>>> 13/04/16 21:13:42 INFO ipc.Server: Starting SocketReader
>>>>>>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server Responder: starting
>>>>>>>> 13/04/16 21:13:42 INFO message.HadoopMessageManagerImpl:  BSPPeer
>>>>>>>> address:hadoop1 port:61001
>>>>>>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server listener on 61001:
>>>>>>>> starting
>>>>>>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server handler 0 on 61001:
>>>>>>>> starting
>>>>>>>> 13/04/16 21:14:06 ERROR bsp.BSPPeerImpl: Error while sending messages
>>>>>>>> java.io.IOException: Call to hadoop1/1.2.3.4:61002 failed on local
>>>>>>>> exception: java.io.EOFException
>>>>>>>>      at org.apache.hadoop.ipc.Client.wrapException(Client.java:1103)
>>>>>>>>      at org.apache.hadoop.ipc.Client.call(Client.java:1071)
>>>>>>>>      at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225)
>>>>>>>>      at $Proxy3.put(Unknown Source)
>>>>>>>>      at
>>>>>>>>
>>>>>>>>
>>>>>>>> org.apache.hama.bsp.message.HadoopMessageManagerImpl.transfer(HadoopMessageManagerImpl.java:108)
>>>>>>>>      at org.apache.hama.bsp.BSPPeerImpl.sync(BSPPeerImpl.java:410)
>>>>>>>>      at
>>>>>>>> org.apache.hama.graph.GraphJobRunner.bsp(GraphJobRunner.java:118)
>>>>>>>>      at org.apache.hama.bsp.BSPTask.runBSP(BSPTask.java:166)
>>>>>>>>      at org.apache.hama.bsp.BSPTask.run(BSPTask.java:143)
>>>>>>>>      at
>>>>>>>>
>>>>>>>>
>>>>>>>> org.apache.hama.bsp.GroomServer$BSPPeerChild.main(GroomServer.java:1266)
>>>>>>>> Caused by: java.io.EOFException
>>>>>>>>      at java.io.DataInputStream.readInt(DataInputStream.java:375)
>>>>>>>>      at
>>>>>>>>
>>>>>>>>
>>>>>>>> org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:800)
>>>>>>>>      at org.apache.hadoop.ipc.Client$Connection.run(Client.java:745)
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> --
>>>>>>> Best Regards, Edward J. Yoon
>>>>>>> @eddieyoon
>>>>>
>>>>>
>>>>>
>>>
>>>
>>
>
>
>
> --
> Best Regards, Edward J. Yoon
> @eddieyoon



-- 
Best Regards, Edward J. Yoon
@eddieyoon

Re: Why my distributed mode does not work?

Posted by "Edward J. Yoon" <ed...@apache.org>.
Yes, the repair function has deleted.


On Sat, Apr 20, 2013 at 7:26 PM, Lyu Xuedong <lx...@gmail.com> wrote:
> Hi, Edward,
>
> Thank you.
> I updated the version. There comes a new problem.
>
> In the /Apache Hama BSP Programming Model
> (http://people.apache.org/~tjungblut/downloads/hamadocs/ApacheHamaBSPProgrammingmodel_06.pdf)/
> there is a paragraph describes 'Graph repair' : "Hama requires a graph to be
> completed before feeding it to an algorithm. By complete we mean that every
> vertex that is referenced by an edge must somewhere be a vertex in the
> graph. In many cases of leafs this is not always the case, therefore we have
> added a repair functionality which is traversing the whole graph for leafs
> and adding them to the vertex structure to prevent algorithms from breaking
> with NullPointerExceptions when it does not find a referenced vertex. You
> can turn this feature on by setting it in your configuration like this:
> conf.setBoolean(GraphJobRunner.GRAPH_REPAIR, true);"
>
> I followed the guidance but got a hint 'GRAPH_REPAIR cannot be resolved or
> is not a field' programming in Eclipse. I read the source code later and
> there is really no variable named GRAPH_REPAIR in GraphJobRunner.java which
> exists in the version 0.5.0. Is this function not supported any longer ? How
> can I make a 'repaired graph'?
> Thank you.
>
>
> On 04/17/2013 12:51 PM, Edward J. Yoon wrote:
>>
>> Please use 0.6.1 and try your application with small data again.
>>
>> See also http://hama.apache.org/run_examples.html
>>
>> On Wed, Apr 17, 2013 at 10:27 AM, Lyu Xuedong <lx...@gmail.com> wrote:
>>>
>>> hama: 0.6.0
>>> hadoop : 1.0.4
>>> JDK : 1.6
>>> OS : ubuntu 12.04
>>>
>>>
>>> On 04/17/2013 05:47 AM, Edward J. Yoon wrote:
>>>>
>>>> Your version?
>>>>
>>>> On Wed, Apr 17, 2013 at 12:07 AM, Lvxuedong <lx...@gmail.com> wrote:
>>>>>
>>>>> Hi, Edward, thank you. But your suggestion seems do not work, do you
>>>>> have
>>>>> some other advice ?
>>>>>
>>>>> Is java.lang.NullPointerException related with heap size ?
>>>>>
>>>>>
>>>>>
>>>>> 在 2013-4-16,22:13,"Edward J. Yoon" <ed...@apache.org> 写道:
>>>>>
>>>>>> I guess you need to increase the child processor JVM heap size.
>>>>>>
>>>>>> - conf/hama-site.xml:
>>>>>>
>>>>>>    <property>
>>>>>>      <name>bsp.child.java.opts</name>
>>>>>>      <value>-Xmx2048m</value>
>>>>>>    </property>
>>>>>>
>>>>>> On Tue, Apr 16, 2013 at 10:55 PM, Lyu Xuedong <lx...@gmail.com>
>>>>>> wrote:
>>>>>>>
>>>>>>> My project can work well in eclipse, but when I export it as a jar
>>>>>>> file
>>>>>>> and
>>>>>>> submit it to a two-node hama cluster, some errors and fatal occur if
>>>>>>> the
>>>>>>> input file is over 64M.
>>>>>>> I suspect that my configure files of hadoop or hama is somewhere not
>>>>>>> right,
>>>>>>> but a pi estimator can run normally on my cluster and
>>>>>>> GroomServer$BSPPerChild can be seen on each nodes. I debug for a
>>>>>>> whole
>>>>>>> day,
>>>>>>> nothing improved.
>>>>>>> What in my input file are a large number of RDF triples: "<subject>
>>>>>>> <predicate> <object> ." My task is to create vertices for subjects
>>>>>>> and
>>>>>>> objects. Predicates are subjects' edges.
>>>>>>> What should I do ?
>>>>>>>
>>>>>>> Terminal output:
>>>>>>>
>>>>>>> 13/04/16 21:13:36 INFO bgp.HamaBgpComplete: Job begain.
>>>>>>> 13/04/16 21:13:37 INFO bsp.FileInputFormat: Total input paths to
>>>>>>> process : 2
>>>>>>> 13/04/16 21:13:38 INFO bsp.BSPJobClient: Running job:
>>>>>>> job_201304161357_0015
>>>>>>> 13/04/16 21:13:41 INFO bsp.BSPJobClient: Current supersteps number: 0
>>>>>>> 13/04/16 21:13:47 INFO bsp.BSPJobClient: Current supersteps number: 2
>>>>>>> 13/04/16 21:13:53 INFO bsp.BSPJobClient: Current supersteps number: 3
>>>>>>> 13/04/16 21:13:59 INFO bsp.BSPJobClient: Current supersteps number: 4
>>>>>>> 13/04/16 21:14:05 INFO bsp.BSPJobClient: Current supersteps number: 5
>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>>> sync.ZKSyncClient: Initializing ZK Sync Client
>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>>> sync.ZooKeeperSyncClientImpl: Start connecting to Zookeeper! At
>>>>>>> hadoop1/1.2.3.4:61002
>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>>> ipc.Server:
>>>>>>> Starting SocketReader
>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>>> ipc.Server:
>>>>>>> IPC
>>>>>>> Server Responder: starting
>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>>> ipc.Server:
>>>>>>> IPC
>>>>>>> Server handler 0 on 61002: starting
>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>>> message.HadoopMessageManagerImpl:  BSPPeer address:hadoop1 port:61002
>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>>> ipc.Server:
>>>>>>> IPC
>>>>>>> Server listener on 61002: starting
>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:03 ERROR
>>>>>>> bsp.BSPTask:
>>>>>>> Error running bsp setup and bsp function.
>>>>>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
>>>>>>> ipc.Server:
>>>>>>> Stopping server on 61002
>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
>>>>>>> ipc.Server:
>>>>>>> IPC
>>>>>>> Server handler 0 on 61002: exiting
>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
>>>>>>> ipc.Server:
>>>>>>> Stopping IPC Server listener on 61002
>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
>>>>>>> ipc.Server:
>>>>>>> Stopping IPC Server Responder
>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
>>>>>>> metrics.RpcInstrumentation: shut down
>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 ERROR
>>>>>>> bsp.BSPTask:
>>>>>>> Shutting down ping service.
>>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 FATAL
>>>>>>> bsp.GroomServer:
>>>>>>> Error running child
>>>>>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>>>>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>>>>>> 13/04/16 21:15:11 INFO bsp.BSPJobClient: Job failed.
>>>>>>>
>>>>>>> tasklogs:
>>>>>>> attempt_201304161357_0015_000000_0.log
>>>>>>> 13/04/16 21:13:43 INFO sync.ZKSyncClient: Initializing ZK Sync Client
>>>>>>> 13/04/16 21:13:43 INFO sync.ZooKeeperSyncClientImpl: Start connecting
>>>>>>> to
>>>>>>> Zookeeper! At hadoop1/1.2.3.4:61002
>>>>>>> 13/04/16 21:13:43 INFO ipc.Server: Starting SocketReader
>>>>>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server Responder: starting
>>>>>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server handler 0 on 61002:
>>>>>>> starting
>>>>>>> 13/04/16 21:13:43 INFO message.HadoopMessageManagerImpl:  BSPPeer
>>>>>>> address:hadoop1 port:61002
>>>>>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server listener on 61002:
>>>>>>> starting
>>>>>>> 13/04/16 21:14:03 ERROR bsp.BSPTask: Error running bsp setup and bsp
>>>>>>> function.
>>>>>>> java.lang.NullPointerException
>>>>>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping server on 61002
>>>>>>> 13/04/16 21:14:04 INFO ipc.Server: IPC Server handler 0 on 61002:
>>>>>>> exiting
>>>>>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server listener on
>>>>>>> 61002
>>>>>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server Responder
>>>>>>> 13/04/16 21:14:04 INFO metrics.RpcInstrumentation: shut down
>>>>>>> 13/04/16 21:14:04 ERROR bsp.BSPTask: Shutting down ping service.
>>>>>>> 13/04/16 21:14:04 FATAL bsp.GroomServer: Error running child
>>>>>>> java.lang.NullPointerException
>>>>>>> java.lang.NullPointerException
>>>>>>>
>>>>>>> attempt_201304161357_0015_000001_0.log
>>>>>>> 13/04/16 21:13:42 INFO sync.ZKSyncClient: Initializing ZK Sync Client
>>>>>>> 13/04/16 21:13:42 INFO sync.ZooKeeperSyncClientImpl: Start connecting
>>>>>>> to
>>>>>>> Zookeeper! At hadoop1/1.2.3.4:61001
>>>>>>> 13/04/16 21:13:42 ERROR sync.ZooKeeperSyncClientImpl:
>>>>>>> org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode
>>>>>>> =
>>>>>>> NoNode for /bsp/job_201304161357_0015/peers
>>>>>>> 13/04/16 21:13:42 INFO ipc.Server: Starting SocketReader
>>>>>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server Responder: starting
>>>>>>> 13/04/16 21:13:42 INFO message.HadoopMessageManagerImpl:  BSPPeer
>>>>>>> address:hadoop1 port:61001
>>>>>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server listener on 61001:
>>>>>>> starting
>>>>>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server handler 0 on 61001:
>>>>>>> starting
>>>>>>> 13/04/16 21:14:06 ERROR bsp.BSPPeerImpl: Error while sending messages
>>>>>>> java.io.IOException: Call to hadoop1/1.2.3.4:61002 failed on local
>>>>>>> exception: java.io.EOFException
>>>>>>>      at org.apache.hadoop.ipc.Client.wrapException(Client.java:1103)
>>>>>>>      at org.apache.hadoop.ipc.Client.call(Client.java:1071)
>>>>>>>      at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225)
>>>>>>>      at $Proxy3.put(Unknown Source)
>>>>>>>      at
>>>>>>>
>>>>>>>
>>>>>>> org.apache.hama.bsp.message.HadoopMessageManagerImpl.transfer(HadoopMessageManagerImpl.java:108)
>>>>>>>      at org.apache.hama.bsp.BSPPeerImpl.sync(BSPPeerImpl.java:410)
>>>>>>>      at
>>>>>>> org.apache.hama.graph.GraphJobRunner.bsp(GraphJobRunner.java:118)
>>>>>>>      at org.apache.hama.bsp.BSPTask.runBSP(BSPTask.java:166)
>>>>>>>      at org.apache.hama.bsp.BSPTask.run(BSPTask.java:143)
>>>>>>>      at
>>>>>>>
>>>>>>>
>>>>>>> org.apache.hama.bsp.GroomServer$BSPPeerChild.main(GroomServer.java:1266)
>>>>>>> Caused by: java.io.EOFException
>>>>>>>      at java.io.DataInputStream.readInt(DataInputStream.java:375)
>>>>>>>      at
>>>>>>>
>>>>>>>
>>>>>>> org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:800)
>>>>>>>      at org.apache.hadoop.ipc.Client$Connection.run(Client.java:745)
>>>>>>
>>>>>>
>>>>>>
>>>>>> --
>>>>>> Best Regards, Edward J. Yoon
>>>>>> @eddieyoon
>>>>
>>>>
>>>>
>>
>>
>



-- 
Best Regards, Edward J. Yoon
@eddieyoon

Re: Why my distributed mode does not work?

Posted by Lyu Xuedong <lx...@gmail.com>.
Hi, Edward,

Thank you.
I updated the version. There comes a new problem.

In the /Apache Hama BSP Programming Model 
(http://people.apache.org/~tjungblut/downloads/hamadocs/ApacheHamaBSPProgrammingmodel_06.pdf)/ 
there is a paragraph describes 'Graph repair' : "Hama requires a graph 
to be completed before feeding it to an algorithm. By complete we mean 
that every vertex that is referenced by an edge must somewhere be a 
vertex in the graph. In many cases of leafs this is not always the case, 
therefore we have added a repair functionality which is traversing the 
whole graph for leafs and adding them to the vertex structure to prevent 
algorithms from breaking with NullPointerExceptions when it does not 
find a referenced vertex. You can turn this feature on by setting it in 
your configuration like this: 
conf.setBoolean(GraphJobRunner.GRAPH_REPAIR, true);"

I followed the guidance but got a hint 'GRAPH_REPAIR cannot be resolved 
or is not a field' programming in Eclipse. I read the source code later 
and there is really no variable named GRAPH_REPAIR in 
GraphJobRunner.java which exists in the version 0.5.0. Is this function 
not supported any longer ? How can I make a 'repaired graph'?
Thank you.

On 04/17/2013 12:51 PM, Edward J. Yoon wrote:
> Please use 0.6.1 and try your application with small data again.
>
> See also http://hama.apache.org/run_examples.html
>
> On Wed, Apr 17, 2013 at 10:27 AM, Lyu Xuedong <lx...@gmail.com> wrote:
>> hama: 0.6.0
>> hadoop : 1.0.4
>> JDK : 1.6
>> OS : ubuntu 12.04
>>
>>
>> On 04/17/2013 05:47 AM, Edward J. Yoon wrote:
>>> Your version?
>>>
>>> On Wed, Apr 17, 2013 at 12:07 AM, Lvxuedong <lx...@gmail.com> wrote:
>>>> Hi, Edward, thank you. But your suggestion seems do not work, do you have
>>>> some other advice ?
>>>>
>>>> Is java.lang.NullPointerException related with heap size ?
>>>>
>>>>
>>>>
>>>> 在 2013-4-16,22:13,"Edward J. Yoon" <ed...@apache.org> 写道:
>>>>
>>>>> I guess you need to increase the child processor JVM heap size.
>>>>>
>>>>> - conf/hama-site.xml:
>>>>>
>>>>>    <property>
>>>>>      <name>bsp.child.java.opts</name>
>>>>>      <value>-Xmx2048m</value>
>>>>>    </property>
>>>>>
>>>>> On Tue, Apr 16, 2013 at 10:55 PM, Lyu Xuedong <lx...@gmail.com>
>>>>> wrote:
>>>>>> My project can work well in eclipse, but when I export it as a jar file
>>>>>> and
>>>>>> submit it to a two-node hama cluster, some errors and fatal occur if
>>>>>> the
>>>>>> input file is over 64M.
>>>>>> I suspect that my configure files of hadoop or hama is somewhere not
>>>>>> right,
>>>>>> but a pi estimator can run normally on my cluster and
>>>>>> GroomServer$BSPPerChild can be seen on each nodes. I debug for a whole
>>>>>> day,
>>>>>> nothing improved.
>>>>>> What in my input file are a large number of RDF triples: "<subject>
>>>>>> <predicate> <object> ." My task is to create vertices for subjects and
>>>>>> objects. Predicates are subjects' edges.
>>>>>> What should I do ?
>>>>>>
>>>>>> Terminal output:
>>>>>>
>>>>>> 13/04/16 21:13:36 INFO bgp.HamaBgpComplete: Job begain.
>>>>>> 13/04/16 21:13:37 INFO bsp.FileInputFormat: Total input paths to
>>>>>> process : 2
>>>>>> 13/04/16 21:13:38 INFO bsp.BSPJobClient: Running job:
>>>>>> job_201304161357_0015
>>>>>> 13/04/16 21:13:41 INFO bsp.BSPJobClient: Current supersteps number: 0
>>>>>> 13/04/16 21:13:47 INFO bsp.BSPJobClient: Current supersteps number: 2
>>>>>> 13/04/16 21:13:53 INFO bsp.BSPJobClient: Current supersteps number: 3
>>>>>> 13/04/16 21:13:59 INFO bsp.BSPJobClient: Current supersteps number: 4
>>>>>> 13/04/16 21:14:05 INFO bsp.BSPJobClient: Current supersteps number: 5
>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>> sync.ZKSyncClient: Initializing ZK Sync Client
>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>> sync.ZooKeeperSyncClientImpl: Start connecting to Zookeeper! At
>>>>>> hadoop1/1.2.3.4:61002
>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server:
>>>>>> Starting SocketReader
>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server:
>>>>>> IPC
>>>>>> Server Responder: starting
>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server:
>>>>>> IPC
>>>>>> Server handler 0 on 61002: starting
>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>>> message.HadoopMessageManagerImpl:  BSPPeer address:hadoop1 port:61002
>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server:
>>>>>> IPC
>>>>>> Server listener on 61002: starting
>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:03 ERROR
>>>>>> bsp.BSPTask:
>>>>>> Error running bsp setup and bsp function.
>>>>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>>>>>> Stopping server on 61002
>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>>>>>> IPC
>>>>>> Server handler 0 on 61002: exiting
>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>>>>>> Stopping IPC Server listener on 61002
>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>>>>>> Stopping IPC Server Responder
>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
>>>>>> metrics.RpcInstrumentation: shut down
>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 ERROR
>>>>>> bsp.BSPTask:
>>>>>> Shutting down ping service.
>>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 FATAL
>>>>>> bsp.GroomServer:
>>>>>> Error running child
>>>>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>>>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>>>>> 13/04/16 21:15:11 INFO bsp.BSPJobClient: Job failed.
>>>>>>
>>>>>> tasklogs:
>>>>>> attempt_201304161357_0015_000000_0.log
>>>>>> 13/04/16 21:13:43 INFO sync.ZKSyncClient: Initializing ZK Sync Client
>>>>>> 13/04/16 21:13:43 INFO sync.ZooKeeperSyncClientImpl: Start connecting
>>>>>> to
>>>>>> Zookeeper! At hadoop1/1.2.3.4:61002
>>>>>> 13/04/16 21:13:43 INFO ipc.Server: Starting SocketReader
>>>>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server Responder: starting
>>>>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server handler 0 on 61002:
>>>>>> starting
>>>>>> 13/04/16 21:13:43 INFO message.HadoopMessageManagerImpl:  BSPPeer
>>>>>> address:hadoop1 port:61002
>>>>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server listener on 61002:
>>>>>> starting
>>>>>> 13/04/16 21:14:03 ERROR bsp.BSPTask: Error running bsp setup and bsp
>>>>>> function.
>>>>>> java.lang.NullPointerException
>>>>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping server on 61002
>>>>>> 13/04/16 21:14:04 INFO ipc.Server: IPC Server handler 0 on 61002:
>>>>>> exiting
>>>>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server listener on
>>>>>> 61002
>>>>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server Responder
>>>>>> 13/04/16 21:14:04 INFO metrics.RpcInstrumentation: shut down
>>>>>> 13/04/16 21:14:04 ERROR bsp.BSPTask: Shutting down ping service.
>>>>>> 13/04/16 21:14:04 FATAL bsp.GroomServer: Error running child
>>>>>> java.lang.NullPointerException
>>>>>> java.lang.NullPointerException
>>>>>>
>>>>>> attempt_201304161357_0015_000001_0.log
>>>>>> 13/04/16 21:13:42 INFO sync.ZKSyncClient: Initializing ZK Sync Client
>>>>>> 13/04/16 21:13:42 INFO sync.ZooKeeperSyncClientImpl: Start connecting
>>>>>> to
>>>>>> Zookeeper! At hadoop1/1.2.3.4:61001
>>>>>> 13/04/16 21:13:42 ERROR sync.ZooKeeperSyncClientImpl:
>>>>>> org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode =
>>>>>> NoNode for /bsp/job_201304161357_0015/peers
>>>>>> 13/04/16 21:13:42 INFO ipc.Server: Starting SocketReader
>>>>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server Responder: starting
>>>>>> 13/04/16 21:13:42 INFO message.HadoopMessageManagerImpl:  BSPPeer
>>>>>> address:hadoop1 port:61001
>>>>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server listener on 61001:
>>>>>> starting
>>>>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server handler 0 on 61001:
>>>>>> starting
>>>>>> 13/04/16 21:14:06 ERROR bsp.BSPPeerImpl: Error while sending messages
>>>>>> java.io.IOException: Call to hadoop1/1.2.3.4:61002 failed on local
>>>>>> exception: java.io.EOFException
>>>>>>      at org.apache.hadoop.ipc.Client.wrapException(Client.java:1103)
>>>>>>      at org.apache.hadoop.ipc.Client.call(Client.java:1071)
>>>>>>      at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225)
>>>>>>      at $Proxy3.put(Unknown Source)
>>>>>>      at
>>>>>>
>>>>>> org.apache.hama.bsp.message.HadoopMessageManagerImpl.transfer(HadoopMessageManagerImpl.java:108)
>>>>>>      at org.apache.hama.bsp.BSPPeerImpl.sync(BSPPeerImpl.java:410)
>>>>>>      at
>>>>>> org.apache.hama.graph.GraphJobRunner.bsp(GraphJobRunner.java:118)
>>>>>>      at org.apache.hama.bsp.BSPTask.runBSP(BSPTask.java:166)
>>>>>>      at org.apache.hama.bsp.BSPTask.run(BSPTask.java:143)
>>>>>>      at
>>>>>>
>>>>>> org.apache.hama.bsp.GroomServer$BSPPeerChild.main(GroomServer.java:1266)
>>>>>> Caused by: java.io.EOFException
>>>>>>      at java.io.DataInputStream.readInt(DataInputStream.java:375)
>>>>>>      at
>>>>>>
>>>>>> org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:800)
>>>>>>      at org.apache.hadoop.ipc.Client$Connection.run(Client.java:745)
>>>>>
>>>>>
>>>>> --
>>>>> Best Regards, Edward J. Yoon
>>>>> @eddieyoon
>>>
>>>
>
>


Re: Why my distributed mode does not work?

Posted by "Edward J. Yoon" <ed...@apache.org>.
Please use 0.6.1 and try your application with small data again.

See also http://hama.apache.org/run_examples.html

On Wed, Apr 17, 2013 at 10:27 AM, Lyu Xuedong <lx...@gmail.com> wrote:
> hama: 0.6.0
> hadoop : 1.0.4
> JDK : 1.6
> OS : ubuntu 12.04
>
>
> On 04/17/2013 05:47 AM, Edward J. Yoon wrote:
>>
>> Your version?
>>
>> On Wed, Apr 17, 2013 at 12:07 AM, Lvxuedong <lx...@gmail.com> wrote:
>>>
>>> Hi, Edward, thank you. But your suggestion seems do not work, do you have
>>> some other advice ?
>>>
>>> Is java.lang.NullPointerException related with heap size ?
>>>
>>>
>>>
>>> 在 2013-4-16,22:13,"Edward J. Yoon" <ed...@apache.org> 写道:
>>>
>>>> I guess you need to increase the child processor JVM heap size.
>>>>
>>>> - conf/hama-site.xml:
>>>>
>>>>   <property>
>>>>     <name>bsp.child.java.opts</name>
>>>>     <value>-Xmx2048m</value>
>>>>   </property>
>>>>
>>>> On Tue, Apr 16, 2013 at 10:55 PM, Lyu Xuedong <lx...@gmail.com>
>>>> wrote:
>>>>>
>>>>> My project can work well in eclipse, but when I export it as a jar file
>>>>> and
>>>>> submit it to a two-node hama cluster, some errors and fatal occur if
>>>>> the
>>>>> input file is over 64M.
>>>>> I suspect that my configure files of hadoop or hama is somewhere not
>>>>> right,
>>>>> but a pi estimator can run normally on my cluster and
>>>>> GroomServer$BSPPerChild can be seen on each nodes. I debug for a whole
>>>>> day,
>>>>> nothing improved.
>>>>> What in my input file are a large number of RDF triples: "<subject>
>>>>> <predicate> <object> ." My task is to create vertices for subjects and
>>>>> objects. Predicates are subjects' edges.
>>>>> What should I do ?
>>>>>
>>>>> Terminal output:
>>>>>
>>>>> 13/04/16 21:13:36 INFO bgp.HamaBgpComplete: Job begain.
>>>>> 13/04/16 21:13:37 INFO bsp.FileInputFormat: Total input paths to
>>>>> process : 2
>>>>> 13/04/16 21:13:38 INFO bsp.BSPJobClient: Running job:
>>>>> job_201304161357_0015
>>>>> 13/04/16 21:13:41 INFO bsp.BSPJobClient: Current supersteps number: 0
>>>>> 13/04/16 21:13:47 INFO bsp.BSPJobClient: Current supersteps number: 2
>>>>> 13/04/16 21:13:53 INFO bsp.BSPJobClient: Current supersteps number: 3
>>>>> 13/04/16 21:13:59 INFO bsp.BSPJobClient: Current supersteps number: 4
>>>>> 13/04/16 21:14:05 INFO bsp.BSPJobClient: Current supersteps number: 5
>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>> sync.ZKSyncClient: Initializing ZK Sync Client
>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>> sync.ZooKeeperSyncClientImpl: Start connecting to Zookeeper! At
>>>>> hadoop1/1.2.3.4:61002
>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server:
>>>>> Starting SocketReader
>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server:
>>>>> IPC
>>>>> Server Responder: starting
>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server:
>>>>> IPC
>>>>> Server handler 0 on 61002: starting
>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>>> message.HadoopMessageManagerImpl:  BSPPeer address:hadoop1 port:61002
>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server:
>>>>> IPC
>>>>> Server listener on 61002: starting
>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:03 ERROR
>>>>> bsp.BSPTask:
>>>>> Error running bsp setup and bsp function.
>>>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>>>>> Stopping server on 61002
>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>>>>> IPC
>>>>> Server handler 0 on 61002: exiting
>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>>>>> Stopping IPC Server listener on 61002
>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>>>>> Stopping IPC Server Responder
>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
>>>>> metrics.RpcInstrumentation: shut down
>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 ERROR
>>>>> bsp.BSPTask:
>>>>> Shutting down ping service.
>>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 FATAL
>>>>> bsp.GroomServer:
>>>>> Error running child
>>>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>>>> 13/04/16 21:15:11 INFO bsp.BSPJobClient: Job failed.
>>>>>
>>>>> tasklogs:
>>>>> attempt_201304161357_0015_000000_0.log
>>>>> 13/04/16 21:13:43 INFO sync.ZKSyncClient: Initializing ZK Sync Client
>>>>> 13/04/16 21:13:43 INFO sync.ZooKeeperSyncClientImpl: Start connecting
>>>>> to
>>>>> Zookeeper! At hadoop1/1.2.3.4:61002
>>>>> 13/04/16 21:13:43 INFO ipc.Server: Starting SocketReader
>>>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server Responder: starting
>>>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server handler 0 on 61002:
>>>>> starting
>>>>> 13/04/16 21:13:43 INFO message.HadoopMessageManagerImpl:  BSPPeer
>>>>> address:hadoop1 port:61002
>>>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server listener on 61002:
>>>>> starting
>>>>> 13/04/16 21:14:03 ERROR bsp.BSPTask: Error running bsp setup and bsp
>>>>> function.
>>>>> java.lang.NullPointerException
>>>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping server on 61002
>>>>> 13/04/16 21:14:04 INFO ipc.Server: IPC Server handler 0 on 61002:
>>>>> exiting
>>>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server listener on
>>>>> 61002
>>>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server Responder
>>>>> 13/04/16 21:14:04 INFO metrics.RpcInstrumentation: shut down
>>>>> 13/04/16 21:14:04 ERROR bsp.BSPTask: Shutting down ping service.
>>>>> 13/04/16 21:14:04 FATAL bsp.GroomServer: Error running child
>>>>> java.lang.NullPointerException
>>>>> java.lang.NullPointerException
>>>>>
>>>>> attempt_201304161357_0015_000001_0.log
>>>>> 13/04/16 21:13:42 INFO sync.ZKSyncClient: Initializing ZK Sync Client
>>>>> 13/04/16 21:13:42 INFO sync.ZooKeeperSyncClientImpl: Start connecting
>>>>> to
>>>>> Zookeeper! At hadoop1/1.2.3.4:61001
>>>>> 13/04/16 21:13:42 ERROR sync.ZooKeeperSyncClientImpl:
>>>>> org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode =
>>>>> NoNode for /bsp/job_201304161357_0015/peers
>>>>> 13/04/16 21:13:42 INFO ipc.Server: Starting SocketReader
>>>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server Responder: starting
>>>>> 13/04/16 21:13:42 INFO message.HadoopMessageManagerImpl:  BSPPeer
>>>>> address:hadoop1 port:61001
>>>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server listener on 61001:
>>>>> starting
>>>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server handler 0 on 61001:
>>>>> starting
>>>>> 13/04/16 21:14:06 ERROR bsp.BSPPeerImpl: Error while sending messages
>>>>> java.io.IOException: Call to hadoop1/1.2.3.4:61002 failed on local
>>>>> exception: java.io.EOFException
>>>>>     at org.apache.hadoop.ipc.Client.wrapException(Client.java:1103)
>>>>>     at org.apache.hadoop.ipc.Client.call(Client.java:1071)
>>>>>     at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225)
>>>>>     at $Proxy3.put(Unknown Source)
>>>>>     at
>>>>>
>>>>> org.apache.hama.bsp.message.HadoopMessageManagerImpl.transfer(HadoopMessageManagerImpl.java:108)
>>>>>     at org.apache.hama.bsp.BSPPeerImpl.sync(BSPPeerImpl.java:410)
>>>>>     at
>>>>> org.apache.hama.graph.GraphJobRunner.bsp(GraphJobRunner.java:118)
>>>>>     at org.apache.hama.bsp.BSPTask.runBSP(BSPTask.java:166)
>>>>>     at org.apache.hama.bsp.BSPTask.run(BSPTask.java:143)
>>>>>     at
>>>>>
>>>>> org.apache.hama.bsp.GroomServer$BSPPeerChild.main(GroomServer.java:1266)
>>>>> Caused by: java.io.EOFException
>>>>>     at java.io.DataInputStream.readInt(DataInputStream.java:375)
>>>>>     at
>>>>>
>>>>> org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:800)
>>>>>     at org.apache.hadoop.ipc.Client$Connection.run(Client.java:745)
>>>>
>>>>
>>>>
>>>> --
>>>> Best Regards, Edward J. Yoon
>>>> @eddieyoon
>>
>>
>>
>



-- 
Best Regards, Edward J. Yoon
@eddieyoon

Re: Why my distributed mode does not work?

Posted by Lyu Xuedong <lx...@gmail.com>.
hama: 0.6.0
hadoop : 1.0.4
JDK : 1.6
OS : ubuntu 12.04

On 04/17/2013 05:47 AM, Edward J. Yoon wrote:
> Your version?
>
> On Wed, Apr 17, 2013 at 12:07 AM, Lvxuedong <lx...@gmail.com> wrote:
>> Hi, Edward, thank you. But your suggestion seems do not work, do you have some other advice ?
>>
>> Is java.lang.NullPointerException related with heap size ?
>>
>>
>>
>> 在 2013-4-16,22:13,"Edward J. Yoon" <ed...@apache.org> 写道:
>>
>>> I guess you need to increase the child processor JVM heap size.
>>>
>>> - conf/hama-site.xml:
>>>
>>>   <property>
>>>     <name>bsp.child.java.opts</name>
>>>     <value>-Xmx2048m</value>
>>>   </property>
>>>
>>> On Tue, Apr 16, 2013 at 10:55 PM, Lyu Xuedong <lx...@gmail.com> wrote:
>>>> My project can work well in eclipse, but when I export it as a jar file and
>>>> submit it to a two-node hama cluster, some errors and fatal occur if the
>>>> input file is over 64M.
>>>> I suspect that my configure files of hadoop or hama is somewhere not right,
>>>> but a pi estimator can run normally on my cluster and
>>>> GroomServer$BSPPerChild can be seen on each nodes. I debug for a whole day,
>>>> nothing improved.
>>>> What in my input file are a large number of RDF triples: "<subject>
>>>> <predicate> <object> ." My task is to create vertices for subjects and
>>>> objects. Predicates are subjects' edges.
>>>> What should I do ?
>>>>
>>>> Terminal output:
>>>>
>>>> 13/04/16 21:13:36 INFO bgp.HamaBgpComplete: Job begain.
>>>> 13/04/16 21:13:37 INFO bsp.FileInputFormat: Total input paths to process : 2
>>>> 13/04/16 21:13:38 INFO bsp.BSPJobClient: Running job: job_201304161357_0015
>>>> 13/04/16 21:13:41 INFO bsp.BSPJobClient: Current supersteps number: 0
>>>> 13/04/16 21:13:47 INFO bsp.BSPJobClient: Current supersteps number: 2
>>>> 13/04/16 21:13:53 INFO bsp.BSPJobClient: Current supersteps number: 3
>>>> 13/04/16 21:13:59 INFO bsp.BSPJobClient: Current supersteps number: 4
>>>> 13/04/16 21:14:05 INFO bsp.BSPJobClient: Current supersteps number: 5
>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>> sync.ZKSyncClient: Initializing ZK Sync Client
>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>> sync.ZooKeeperSyncClientImpl: Start connecting to Zookeeper! At
>>>> hadoop1/1.2.3.4:61002
>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server:
>>>> Starting SocketReader
>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: IPC
>>>> Server Responder: starting
>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: IPC
>>>> Server handler 0 on 61002: starting
>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>>> message.HadoopMessageManagerImpl:  BSPPeer address:hadoop1 port:61002
>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: IPC
>>>> Server listener on 61002: starting
>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:03 ERROR bsp.BSPTask:
>>>> Error running bsp setup and bsp function.
>>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>>>> Stopping server on 61002
>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server: IPC
>>>> Server handler 0 on 61002: exiting
>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>>>> Stopping IPC Server listener on 61002
>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>>>> Stopping IPC Server Responder
>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
>>>> metrics.RpcInstrumentation: shut down
>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 ERROR bsp.BSPTask:
>>>> Shutting down ping service.
>>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 FATAL bsp.GroomServer:
>>>> Error running child
>>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>>> 13/04/16 21:15:11 INFO bsp.BSPJobClient: Job failed.
>>>>
>>>> tasklogs:
>>>> attempt_201304161357_0015_000000_0.log
>>>> 13/04/16 21:13:43 INFO sync.ZKSyncClient: Initializing ZK Sync Client
>>>> 13/04/16 21:13:43 INFO sync.ZooKeeperSyncClientImpl: Start connecting to
>>>> Zookeeper! At hadoop1/1.2.3.4:61002
>>>> 13/04/16 21:13:43 INFO ipc.Server: Starting SocketReader
>>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server Responder: starting
>>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server handler 0 on 61002: starting
>>>> 13/04/16 21:13:43 INFO message.HadoopMessageManagerImpl:  BSPPeer
>>>> address:hadoop1 port:61002
>>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server listener on 61002: starting
>>>> 13/04/16 21:14:03 ERROR bsp.BSPTask: Error running bsp setup and bsp
>>>> function.
>>>> java.lang.NullPointerException
>>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping server on 61002
>>>> 13/04/16 21:14:04 INFO ipc.Server: IPC Server handler 0 on 61002: exiting
>>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server listener on 61002
>>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server Responder
>>>> 13/04/16 21:14:04 INFO metrics.RpcInstrumentation: shut down
>>>> 13/04/16 21:14:04 ERROR bsp.BSPTask: Shutting down ping service.
>>>> 13/04/16 21:14:04 FATAL bsp.GroomServer: Error running child
>>>> java.lang.NullPointerException
>>>> java.lang.NullPointerException
>>>>
>>>> attempt_201304161357_0015_000001_0.log
>>>> 13/04/16 21:13:42 INFO sync.ZKSyncClient: Initializing ZK Sync Client
>>>> 13/04/16 21:13:42 INFO sync.ZooKeeperSyncClientImpl: Start connecting to
>>>> Zookeeper! At hadoop1/1.2.3.4:61001
>>>> 13/04/16 21:13:42 ERROR sync.ZooKeeperSyncClientImpl:
>>>> org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode =
>>>> NoNode for /bsp/job_201304161357_0015/peers
>>>> 13/04/16 21:13:42 INFO ipc.Server: Starting SocketReader
>>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server Responder: starting
>>>> 13/04/16 21:13:42 INFO message.HadoopMessageManagerImpl:  BSPPeer
>>>> address:hadoop1 port:61001
>>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server listener on 61001: starting
>>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server handler 0 on 61001: starting
>>>> 13/04/16 21:14:06 ERROR bsp.BSPPeerImpl: Error while sending messages
>>>> java.io.IOException: Call to hadoop1/1.2.3.4:61002 failed on local
>>>> exception: java.io.EOFException
>>>>     at org.apache.hadoop.ipc.Client.wrapException(Client.java:1103)
>>>>     at org.apache.hadoop.ipc.Client.call(Client.java:1071)
>>>>     at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225)
>>>>     at $Proxy3.put(Unknown Source)
>>>>     at
>>>> org.apache.hama.bsp.message.HadoopMessageManagerImpl.transfer(HadoopMessageManagerImpl.java:108)
>>>>     at org.apache.hama.bsp.BSPPeerImpl.sync(BSPPeerImpl.java:410)
>>>>     at org.apache.hama.graph.GraphJobRunner.bsp(GraphJobRunner.java:118)
>>>>     at org.apache.hama.bsp.BSPTask.runBSP(BSPTask.java:166)
>>>>     at org.apache.hama.bsp.BSPTask.run(BSPTask.java:143)
>>>>     at
>>>> org.apache.hama.bsp.GroomServer$BSPPeerChild.main(GroomServer.java:1266)
>>>> Caused by: java.io.EOFException
>>>>     at java.io.DataInputStream.readInt(DataInputStream.java:375)
>>>>     at
>>>> org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:800)
>>>>     at org.apache.hadoop.ipc.Client$Connection.run(Client.java:745)
>>>
>>>
>>> --
>>> Best Regards, Edward J. Yoon
>>> @eddieyoon
>
>


Re: Why my distributed mode does not work?

Posted by "Edward J. Yoon" <ed...@apache.org>.
Your version?

On Wed, Apr 17, 2013 at 12:07 AM, Lvxuedong <lx...@gmail.com> wrote:
> Hi, Edward, thank you. But your suggestion seems do not work, do you have some other advice ?
>
> Is java.lang.NullPointerException related with heap size ?
>
>
>
> 在 2013-4-16,22:13,"Edward J. Yoon" <ed...@apache.org> 写道:
>
>> I guess you need to increase the child processor JVM heap size.
>>
>> - conf/hama-site.xml:
>>
>>  <property>
>>    <name>bsp.child.java.opts</name>
>>    <value>-Xmx2048m</value>
>>  </property>
>>
>> On Tue, Apr 16, 2013 at 10:55 PM, Lyu Xuedong <lx...@gmail.com> wrote:
>>> My project can work well in eclipse, but when I export it as a jar file and
>>> submit it to a two-node hama cluster, some errors and fatal occur if the
>>> input file is over 64M.
>>> I suspect that my configure files of hadoop or hama is somewhere not right,
>>> but a pi estimator can run normally on my cluster and
>>> GroomServer$BSPPerChild can be seen on each nodes. I debug for a whole day,
>>> nothing improved.
>>> What in my input file are a large number of RDF triples: "<subject>
>>> <predicate> <object> ." My task is to create vertices for subjects and
>>> objects. Predicates are subjects' edges.
>>> What should I do ?
>>>
>>> Terminal output:
>>>
>>> 13/04/16 21:13:36 INFO bgp.HamaBgpComplete: Job begain.
>>> 13/04/16 21:13:37 INFO bsp.FileInputFormat: Total input paths to process : 2
>>> 13/04/16 21:13:38 INFO bsp.BSPJobClient: Running job: job_201304161357_0015
>>> 13/04/16 21:13:41 INFO bsp.BSPJobClient: Current supersteps number: 0
>>> 13/04/16 21:13:47 INFO bsp.BSPJobClient: Current supersteps number: 2
>>> 13/04/16 21:13:53 INFO bsp.BSPJobClient: Current supersteps number: 3
>>> 13/04/16 21:13:59 INFO bsp.BSPJobClient: Current supersteps number: 4
>>> 13/04/16 21:14:05 INFO bsp.BSPJobClient: Current supersteps number: 5
>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>> sync.ZKSyncClient: Initializing ZK Sync Client
>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>> sync.ZooKeeperSyncClientImpl: Start connecting to Zookeeper! At
>>> hadoop1/1.2.3.4:61002
>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server:
>>> Starting SocketReader
>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: IPC
>>> Server Responder: starting
>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: IPC
>>> Server handler 0 on 61002: starting
>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>>> message.HadoopMessageManagerImpl:  BSPPeer address:hadoop1 port:61002
>>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: IPC
>>> Server listener on 61002: starting
>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:03 ERROR bsp.BSPTask:
>>> Error running bsp setup and bsp function.
>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>>> Stopping server on 61002
>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server: IPC
>>> Server handler 0 on 61002: exiting
>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>>> Stopping IPC Server listener on 61002
>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>>> Stopping IPC Server Responder
>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
>>> metrics.RpcInstrumentation: shut down
>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 ERROR bsp.BSPTask:
>>> Shutting down ping service.
>>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 FATAL bsp.GroomServer:
>>> Error running child
>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>>> 13/04/16 21:15:11 INFO bsp.BSPJobClient: Job failed.
>>>
>>> tasklogs:
>>> attempt_201304161357_0015_000000_0.log
>>> 13/04/16 21:13:43 INFO sync.ZKSyncClient: Initializing ZK Sync Client
>>> 13/04/16 21:13:43 INFO sync.ZooKeeperSyncClientImpl: Start connecting to
>>> Zookeeper! At hadoop1/1.2.3.4:61002
>>> 13/04/16 21:13:43 INFO ipc.Server: Starting SocketReader
>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server Responder: starting
>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server handler 0 on 61002: starting
>>> 13/04/16 21:13:43 INFO message.HadoopMessageManagerImpl:  BSPPeer
>>> address:hadoop1 port:61002
>>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server listener on 61002: starting
>>> 13/04/16 21:14:03 ERROR bsp.BSPTask: Error running bsp setup and bsp
>>> function.
>>> java.lang.NullPointerException
>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping server on 61002
>>> 13/04/16 21:14:04 INFO ipc.Server: IPC Server handler 0 on 61002: exiting
>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server listener on 61002
>>> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server Responder
>>> 13/04/16 21:14:04 INFO metrics.RpcInstrumentation: shut down
>>> 13/04/16 21:14:04 ERROR bsp.BSPTask: Shutting down ping service.
>>> 13/04/16 21:14:04 FATAL bsp.GroomServer: Error running child
>>> java.lang.NullPointerException
>>> java.lang.NullPointerException
>>>
>>> attempt_201304161357_0015_000001_0.log
>>> 13/04/16 21:13:42 INFO sync.ZKSyncClient: Initializing ZK Sync Client
>>> 13/04/16 21:13:42 INFO sync.ZooKeeperSyncClientImpl: Start connecting to
>>> Zookeeper! At hadoop1/1.2.3.4:61001
>>> 13/04/16 21:13:42 ERROR sync.ZooKeeperSyncClientImpl:
>>> org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode =
>>> NoNode for /bsp/job_201304161357_0015/peers
>>> 13/04/16 21:13:42 INFO ipc.Server: Starting SocketReader
>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server Responder: starting
>>> 13/04/16 21:13:42 INFO message.HadoopMessageManagerImpl:  BSPPeer
>>> address:hadoop1 port:61001
>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server listener on 61001: starting
>>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server handler 0 on 61001: starting
>>> 13/04/16 21:14:06 ERROR bsp.BSPPeerImpl: Error while sending messages
>>> java.io.IOException: Call to hadoop1/1.2.3.4:61002 failed on local
>>> exception: java.io.EOFException
>>>    at org.apache.hadoop.ipc.Client.wrapException(Client.java:1103)
>>>    at org.apache.hadoop.ipc.Client.call(Client.java:1071)
>>>    at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225)
>>>    at $Proxy3.put(Unknown Source)
>>>    at
>>> org.apache.hama.bsp.message.HadoopMessageManagerImpl.transfer(HadoopMessageManagerImpl.java:108)
>>>    at org.apache.hama.bsp.BSPPeerImpl.sync(BSPPeerImpl.java:410)
>>>    at org.apache.hama.graph.GraphJobRunner.bsp(GraphJobRunner.java:118)
>>>    at org.apache.hama.bsp.BSPTask.runBSP(BSPTask.java:166)
>>>    at org.apache.hama.bsp.BSPTask.run(BSPTask.java:143)
>>>    at
>>> org.apache.hama.bsp.GroomServer$BSPPeerChild.main(GroomServer.java:1266)
>>> Caused by: java.io.EOFException
>>>    at java.io.DataInputStream.readInt(DataInputStream.java:375)
>>>    at
>>> org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:800)
>>>    at org.apache.hadoop.ipc.Client$Connection.run(Client.java:745)
>>
>>
>>
>> --
>> Best Regards, Edward J. Yoon
>> @eddieyoon



-- 
Best Regards, Edward J. Yoon
@eddieyoon

Re: Why my distributed mode does not work?

Posted by Lvxuedong <lx...@gmail.com>.
Hi, Edward, thank you. But your suggestion seems do not work, do you have some other advice ?

Is java.lang.NullPointerException related with heap size ?



在 2013-4-16,22:13,"Edward J. Yoon" <ed...@apache.org> 写道:

> I guess you need to increase the child processor JVM heap size.
> 
> - conf/hama-site.xml:
> 
>  <property>
>    <name>bsp.child.java.opts</name>
>    <value>-Xmx2048m</value>
>  </property>
> 
> On Tue, Apr 16, 2013 at 10:55 PM, Lyu Xuedong <lx...@gmail.com> wrote:
>> My project can work well in eclipse, but when I export it as a jar file and
>> submit it to a two-node hama cluster, some errors and fatal occur if the
>> input file is over 64M.
>> I suspect that my configure files of hadoop or hama is somewhere not right,
>> but a pi estimator can run normally on my cluster and
>> GroomServer$BSPPerChild can be seen on each nodes. I debug for a whole day,
>> nothing improved.
>> What in my input file are a large number of RDF triples: "<subject>
>> <predicate> <object> ." My task is to create vertices for subjects and
>> objects. Predicates are subjects' edges.
>> What should I do ?
>> 
>> Terminal output:
>> 
>> 13/04/16 21:13:36 INFO bgp.HamaBgpComplete: Job begain.
>> 13/04/16 21:13:37 INFO bsp.FileInputFormat: Total input paths to process : 2
>> 13/04/16 21:13:38 INFO bsp.BSPJobClient: Running job: job_201304161357_0015
>> 13/04/16 21:13:41 INFO bsp.BSPJobClient: Current supersteps number: 0
>> 13/04/16 21:13:47 INFO bsp.BSPJobClient: Current supersteps number: 2
>> 13/04/16 21:13:53 INFO bsp.BSPJobClient: Current supersteps number: 3
>> 13/04/16 21:13:59 INFO bsp.BSPJobClient: Current supersteps number: 4
>> 13/04/16 21:14:05 INFO bsp.BSPJobClient: Current supersteps number: 5
>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>> sync.ZKSyncClient: Initializing ZK Sync Client
>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>> sync.ZooKeeperSyncClientImpl: Start connecting to Zookeeper! At
>> hadoop1/1.2.3.4:61002
>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server:
>> Starting SocketReader
>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: IPC
>> Server Responder: starting
>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: IPC
>> Server handler 0 on 61002: starting
>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
>> message.HadoopMessageManagerImpl:  BSPPeer address:hadoop1 port:61002
>> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: IPC
>> Server listener on 61002: starting
>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:03 ERROR bsp.BSPTask:
>> Error running bsp setup and bsp function.
>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>> Stopping server on 61002
>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server: IPC
>> Server handler 0 on 61002: exiting
>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>> Stopping IPC Server listener on 61002
>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
>> Stopping IPC Server Responder
>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
>> metrics.RpcInstrumentation: shut down
>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 ERROR bsp.BSPTask:
>> Shutting down ping service.
>> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 FATAL bsp.GroomServer:
>> Error running child
>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
>> 13/04/16 21:15:11 INFO bsp.BSPJobClient: Job failed.
>> 
>> tasklogs:
>> attempt_201304161357_0015_000000_0.log
>> 13/04/16 21:13:43 INFO sync.ZKSyncClient: Initializing ZK Sync Client
>> 13/04/16 21:13:43 INFO sync.ZooKeeperSyncClientImpl: Start connecting to
>> Zookeeper! At hadoop1/1.2.3.4:61002
>> 13/04/16 21:13:43 INFO ipc.Server: Starting SocketReader
>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server Responder: starting
>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server handler 0 on 61002: starting
>> 13/04/16 21:13:43 INFO message.HadoopMessageManagerImpl:  BSPPeer
>> address:hadoop1 port:61002
>> 13/04/16 21:13:43 INFO ipc.Server: IPC Server listener on 61002: starting
>> 13/04/16 21:14:03 ERROR bsp.BSPTask: Error running bsp setup and bsp
>> function.
>> java.lang.NullPointerException
>> 13/04/16 21:14:04 INFO ipc.Server: Stopping server on 61002
>> 13/04/16 21:14:04 INFO ipc.Server: IPC Server handler 0 on 61002: exiting
>> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server listener on 61002
>> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server Responder
>> 13/04/16 21:14:04 INFO metrics.RpcInstrumentation: shut down
>> 13/04/16 21:14:04 ERROR bsp.BSPTask: Shutting down ping service.
>> 13/04/16 21:14:04 FATAL bsp.GroomServer: Error running child
>> java.lang.NullPointerException
>> java.lang.NullPointerException
>> 
>> attempt_201304161357_0015_000001_0.log
>> 13/04/16 21:13:42 INFO sync.ZKSyncClient: Initializing ZK Sync Client
>> 13/04/16 21:13:42 INFO sync.ZooKeeperSyncClientImpl: Start connecting to
>> Zookeeper! At hadoop1/1.2.3.4:61001
>> 13/04/16 21:13:42 ERROR sync.ZooKeeperSyncClientImpl:
>> org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode =
>> NoNode for /bsp/job_201304161357_0015/peers
>> 13/04/16 21:13:42 INFO ipc.Server: Starting SocketReader
>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server Responder: starting
>> 13/04/16 21:13:42 INFO message.HadoopMessageManagerImpl:  BSPPeer
>> address:hadoop1 port:61001
>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server listener on 61001: starting
>> 13/04/16 21:13:42 INFO ipc.Server: IPC Server handler 0 on 61001: starting
>> 13/04/16 21:14:06 ERROR bsp.BSPPeerImpl: Error while sending messages
>> java.io.IOException: Call to hadoop1/1.2.3.4:61002 failed on local
>> exception: java.io.EOFException
>>    at org.apache.hadoop.ipc.Client.wrapException(Client.java:1103)
>>    at org.apache.hadoop.ipc.Client.call(Client.java:1071)
>>    at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225)
>>    at $Proxy3.put(Unknown Source)
>>    at
>> org.apache.hama.bsp.message.HadoopMessageManagerImpl.transfer(HadoopMessageManagerImpl.java:108)
>>    at org.apache.hama.bsp.BSPPeerImpl.sync(BSPPeerImpl.java:410)
>>    at org.apache.hama.graph.GraphJobRunner.bsp(GraphJobRunner.java:118)
>>    at org.apache.hama.bsp.BSPTask.runBSP(BSPTask.java:166)
>>    at org.apache.hama.bsp.BSPTask.run(BSPTask.java:143)
>>    at
>> org.apache.hama.bsp.GroomServer$BSPPeerChild.main(GroomServer.java:1266)
>> Caused by: java.io.EOFException
>>    at java.io.DataInputStream.readInt(DataInputStream.java:375)
>>    at
>> org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:800)
>>    at org.apache.hadoop.ipc.Client$Connection.run(Client.java:745)
> 
> 
> 
> -- 
> Best Regards, Edward J. Yoon
> @eddieyoon

Re: Why my distributed mode does not work?

Posted by "Edward J. Yoon" <ed...@apache.org>.
I guess you need to increase the child processor JVM heap size.

- conf/hama-site.xml:

  <property>
    <name>bsp.child.java.opts</name>
    <value>-Xmx2048m</value>
  </property>

On Tue, Apr 16, 2013 at 10:55 PM, Lyu Xuedong <lx...@gmail.com> wrote:
> My project can work well in eclipse, but when I export it as a jar file and
> submit it to a two-node hama cluster, some errors and fatal occur if the
> input file is over 64M.
> I suspect that my configure files of hadoop or hama is somewhere not right,
> but a pi estimator can run normally on my cluster and
> GroomServer$BSPPerChild can be seen on each nodes. I debug for a whole day,
> nothing improved.
> What in my input file are a large number of RDF triples: "<subject>
> <predicate> <object> ." My task is to create vertices for subjects and
> objects. Predicates are subjects' edges.
> What should I do ?
>
> Terminal output:
>
> 13/04/16 21:13:36 INFO bgp.HamaBgpComplete: Job begain.
> 13/04/16 21:13:37 INFO bsp.FileInputFormat: Total input paths to process : 2
> 13/04/16 21:13:38 INFO bsp.BSPJobClient: Running job: job_201304161357_0015
> 13/04/16 21:13:41 INFO bsp.BSPJobClient: Current supersteps number: 0
> 13/04/16 21:13:47 INFO bsp.BSPJobClient: Current supersteps number: 2
> 13/04/16 21:13:53 INFO bsp.BSPJobClient: Current supersteps number: 3
> 13/04/16 21:13:59 INFO bsp.BSPJobClient: Current supersteps number: 4
> 13/04/16 21:14:05 INFO bsp.BSPJobClient: Current supersteps number: 5
> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
> sync.ZKSyncClient: Initializing ZK Sync Client
> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
> sync.ZooKeeperSyncClientImpl: Start connecting to Zookeeper! At
> hadoop1/1.2.3.4:61002
> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server:
> Starting SocketReader
> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: IPC
> Server Responder: starting
> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: IPC
> Server handler 0 on 61002: starting
> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO
> message.HadoopMessageManagerImpl:  BSPPeer address:hadoop1 port:61002
> attempt_201304161357_0015_000000_0: 13/04/16 21:13:43 INFO ipc.Server: IPC
> Server listener on 61002: starting
> attempt_201304161357_0015_000000_0: 13/04/16 21:14:03 ERROR bsp.BSPTask:
> Error running bsp setup and bsp function.
> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
> Stopping server on 61002
> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server: IPC
> Server handler 0 on 61002: exiting
> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
> Stopping IPC Server listener on 61002
> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO ipc.Server:
> Stopping IPC Server Responder
> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 INFO
> metrics.RpcInstrumentation: shut down
> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 ERROR bsp.BSPTask:
> Shutting down ping service.
> attempt_201304161357_0015_000000_0: 13/04/16 21:14:04 FATAL bsp.GroomServer:
> Error running child
> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
> attempt_201304161357_0015_000000_0: java.lang.NullPointerException
> 13/04/16 21:15:11 INFO bsp.BSPJobClient: Job failed.
>
> tasklogs:
> attempt_201304161357_0015_000000_0.log
> 13/04/16 21:13:43 INFO sync.ZKSyncClient: Initializing ZK Sync Client
> 13/04/16 21:13:43 INFO sync.ZooKeeperSyncClientImpl: Start connecting to
> Zookeeper! At hadoop1/1.2.3.4:61002
> 13/04/16 21:13:43 INFO ipc.Server: Starting SocketReader
> 13/04/16 21:13:43 INFO ipc.Server: IPC Server Responder: starting
> 13/04/16 21:13:43 INFO ipc.Server: IPC Server handler 0 on 61002: starting
> 13/04/16 21:13:43 INFO message.HadoopMessageManagerImpl:  BSPPeer
> address:hadoop1 port:61002
> 13/04/16 21:13:43 INFO ipc.Server: IPC Server listener on 61002: starting
> 13/04/16 21:14:03 ERROR bsp.BSPTask: Error running bsp setup and bsp
> function.
> java.lang.NullPointerException
> 13/04/16 21:14:04 INFO ipc.Server: Stopping server on 61002
> 13/04/16 21:14:04 INFO ipc.Server: IPC Server handler 0 on 61002: exiting
> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server listener on 61002
> 13/04/16 21:14:04 INFO ipc.Server: Stopping IPC Server Responder
> 13/04/16 21:14:04 INFO metrics.RpcInstrumentation: shut down
> 13/04/16 21:14:04 ERROR bsp.BSPTask: Shutting down ping service.
> 13/04/16 21:14:04 FATAL bsp.GroomServer: Error running child
> java.lang.NullPointerException
> java.lang.NullPointerException
>
> attempt_201304161357_0015_000001_0.log
> 13/04/16 21:13:42 INFO sync.ZKSyncClient: Initializing ZK Sync Client
> 13/04/16 21:13:42 INFO sync.ZooKeeperSyncClientImpl: Start connecting to
> Zookeeper! At hadoop1/1.2.3.4:61001
> 13/04/16 21:13:42 ERROR sync.ZooKeeperSyncClientImpl:
> org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode =
> NoNode for /bsp/job_201304161357_0015/peers
> 13/04/16 21:13:42 INFO ipc.Server: Starting SocketReader
> 13/04/16 21:13:42 INFO ipc.Server: IPC Server Responder: starting
> 13/04/16 21:13:42 INFO message.HadoopMessageManagerImpl:  BSPPeer
> address:hadoop1 port:61001
> 13/04/16 21:13:42 INFO ipc.Server: IPC Server listener on 61001: starting
> 13/04/16 21:13:42 INFO ipc.Server: IPC Server handler 0 on 61001: starting
> 13/04/16 21:14:06 ERROR bsp.BSPPeerImpl: Error while sending messages
> java.io.IOException: Call to hadoop1/1.2.3.4:61002 failed on local
> exception: java.io.EOFException
>     at org.apache.hadoop.ipc.Client.wrapException(Client.java:1103)
>     at org.apache.hadoop.ipc.Client.call(Client.java:1071)
>     at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225)
>     at $Proxy3.put(Unknown Source)
>     at
> org.apache.hama.bsp.message.HadoopMessageManagerImpl.transfer(HadoopMessageManagerImpl.java:108)
>     at org.apache.hama.bsp.BSPPeerImpl.sync(BSPPeerImpl.java:410)
>     at org.apache.hama.graph.GraphJobRunner.bsp(GraphJobRunner.java:118)
>     at org.apache.hama.bsp.BSPTask.runBSP(BSPTask.java:166)
>     at org.apache.hama.bsp.BSPTask.run(BSPTask.java:143)
>     at
> org.apache.hama.bsp.GroomServer$BSPPeerChild.main(GroomServer.java:1266)
> Caused by: java.io.EOFException
>     at java.io.DataInputStream.readInt(DataInputStream.java:375)
>     at
> org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:800)
>     at org.apache.hadoop.ipc.Client$Connection.run(Client.java:745)
>
>
>
>
>
>
>
>
>



-- 
Best Regards, Edward J. Yoon
@eddieyoon