You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@giraph.apache.org by "Avery Ching (JIRA)" <ji...@apache.org> on 2012/10/05 22:46:02 UTC

[jira] [Updated] (GIRAPH-356) Improve ZooKeeper issues

     [ https://issues.apache.org/jira/browse/GIRAPH-356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Avery Ching updated GIRAPH-356:
-------------------------------

    Summary: Improve ZooKeeper issues  (was: Help debug ZooKeeper issues)
    
> Improve ZooKeeper issues
> ------------------------
>
>                 Key: GIRAPH-356
>                 URL: https://issues.apache.org/jira/browse/GIRAPH-356
>             Project: Giraph
>          Issue Type: Improvement
>            Reporter: Avery Ching
>            Assignee: Avery Ching
>         Attachments: GIRAPH-356.2.patch, GIRAPH-356.patch
>
>
> Currently, if the ZooKeeper process fails, we have little information on why and what happened.  This patch addresses this by keeping the last 100 log lines and dumps when the map fails under a RuntimeException.
> Here is an example of a master task failure when there is an invalid JVM argument passed to ZooKeeper.  The error is much for obvious now.
> 2012-10-04 15:05:28,916 WARN org.apache.giraph.zk.ZooKeeperManager: logZooKeeperOutput: Dumping up to last 100 lines of the ZooKeeper process STDOUT and STDERR.
> 2012-10-04 15:05:28,916 WARN org.apache.giraph.zk.ZooKeeperManager$StreamCollector: Unrecognized option: -BadOpt
> 2012-10-04 15:05:28,916 WARN org.apache.giraph.zk.ZooKeeperManager$StreamCollector: Could not create the Java virtual machine.
> 2012-10-04 15:05:28,919 INFO org.apache.hadoop.mapred.TaskLogsTruncater: Initializing logs' truncater with mapRetainSize=-1 and reduceRetainSize=-1
> 2012-10-04 15:05:28,959 WARN org.apache.hadoop.mapred.Child: Error running child
> java.lang.IllegalStateException: run: Caught an unrecoverable exception onlineZooKeeperServers: Failed to connect in 5 tries!
>                                  at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:591)
>                                  at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:763)
>                                  at org.apache.hadoop.mapred.MapTask.run(MapTask.java:369)
>                                  at org.apache.hadoop.mapred.Child$4.run(Child.java:259)
>                                  at java.security.AccessController.doPrivileged(Native Method)
>                                  at javax.security.auth.Subject.doAs(Subject.java:396)
>                                  at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
>                                  at org.apache.hadoop.mapred.Child.main(Child.java:253)
> Caused by: java.lang.IllegalStateException: onlineZooKeeperServers: Failed to connect in 5 tries!
>        at org.apache.giraph.zk.ZooKeeperManager.onlineZooKeeperServers(ZooKeeperManager.java:721)
>        at org.apache.giraph.graph.GraphMapper.setup(GraphMapper.java:328)
>        at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:573)
>        ... 7 more
> 2012-10-04 15:05:28,963 INFO org.apache.hadoop.mapred.Task: Runnning cleanup for the task

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira