You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Leting Wu (JIRA)" <ji...@apache.org> on 2012/06/30 00:06:44 UTC
[jira] [Comment Edited] (MAHOUT-1034) ERROR in Navie Bayes
Training(trainnb)
[ https://issues.apache.org/jira/browse/MAHOUT-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13404266#comment-13404266 ]
Leting Wu edited comment on MAHOUT-1034 at 6/29/12 10:06 PM:
-------------------------------------------------------------
I tried Mahout 0.6. Does not look well either. My account is one of the ones on the workstation. I can run sudo to change the global setting. New to Mahout and really need help. Thanks.
{noformat}
$ ./examples/bin/classify-20newsgroups.sh
Please select a number to choose the corresponding task to run
1. naivebayes
2. sgd
3. clean -- cleans up the work area in /tmp/mahout-work-lwu
Enter your choice : 1
ok. You chose 1 and we'll use naivebayes
creating work directory at /tmp/mahout-work-lwu
Downloading 20news-bydate
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 13.7M 100 13.7M 0 0 186k 0 0:01:15 0:01:15 --:--:-- 441k
Extracting...
Preparing Training Data
MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
Running on hadoop, using HADOOP_HOME=/opt/hadoop
HADOOP_CONF_DIR=/opt/hadoop/conf
MAHOUT-JOB: /opt/mahout/examples/target/mahout-examples-0.6-job.jar
12/06/29 14:56:50 WARN driver.MahoutDriver: No org.apache.mahout.classifier.bayes.PrepareTwentyNewsgroups.props found on classpath, will use command-line arguments only
12/06/29 14:56:51 INFO driver.MahoutDriver: Program took 1143 ms (Minutes: 0.01905)
Preparing Test Data
MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
Running on hadoop, using HADOOP_HOME=/opt/hadoop
HADOOP_CONF_DIR=/opt/hadoop/conf
MAHOUT-JOB: /opt/mahout/examples/target/mahout-examples-0.6-job.jar
12/06/29 14:56:52 WARN driver.MahoutDriver: No org.apache.mahout.classifier.bayes.PrepareTwentyNewsgroups.props found on classpath, will use command-line arguments only
12/06/29 14:56:53 INFO driver.MahoutDriver: Program took 794 ms (Minutes: 0.013233333333333333)
DEPRECATED: Use of this script to execute hdfs command is deprecated.
Instead use the hdfs command for it.
rmr: DEPRECATED: Please use 'rm -r' instead.
12/06/29 14:56:55 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (820233764) connection to localhost/127.0.0.1:8888 from lwu,5,main]
java.lang.NullPointerException
at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852)
at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781)
rmr: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888;
DEPRECATED: Use of this script to execute hdfs command is deprecated.
Instead use the hdfs command for it.
rmr: DEPRECATED: Please use 'rm -r' instead.
12/06/29 14:56:56 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (820233764) connection to localhost/127.0.0.1:8888 from lwu,5,main]
java.lang.NullPointerException
at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852)
at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781)
rmr: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888;
DEPRECATED: Use of this script to execute hdfs command is deprecated.
Instead use the hdfs command for it.
12/06/29 14:56:56 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (165149691) connection to localhost/127.0.0.1:8888 from lwu,5,main]
java.lang.NullPointerException
at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852)
at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781)
put: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888;
{noformat}
was (Author: rhinewlt):
I tried Mahout 0.6. Does not look well either. My account is one of the ones on the workstation. I can run sudo to change the global setting.
{noformat}
$ ./examples/bin/classify-20newsgroups.sh
Please select a number to choose the corresponding task to run
1. naivebayes
2. sgd
3. clean -- cleans up the work area in /tmp/mahout-work-lwu
Enter your choice : 1
ok. You chose 1 and we'll use naivebayes
creating work directory at /tmp/mahout-work-lwu
Downloading 20news-bydate
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 13.7M 100 13.7M 0 0 186k 0 0:01:15 0:01:15 --:--:-- 441k
Extracting...
Preparing Training Data
MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
Running on hadoop, using HADOOP_HOME=/opt/hadoop
HADOOP_CONF_DIR=/opt/hadoop/conf
MAHOUT-JOB: /opt/mahout/examples/target/mahout-examples-0.6-job.jar
12/06/29 14:56:50 WARN driver.MahoutDriver: No org.apache.mahout.classifier.bayes.PrepareTwentyNewsgroups.props found on classpath, will use command-line arguments only
12/06/29 14:56:51 INFO driver.MahoutDriver: Program took 1143 ms (Minutes: 0.01905)
Preparing Test Data
MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
Running on hadoop, using HADOOP_HOME=/opt/hadoop
HADOOP_CONF_DIR=/opt/hadoop/conf
MAHOUT-JOB: /opt/mahout/examples/target/mahout-examples-0.6-job.jar
12/06/29 14:56:52 WARN driver.MahoutDriver: No org.apache.mahout.classifier.bayes.PrepareTwentyNewsgroups.props found on classpath, will use command-line arguments only
12/06/29 14:56:53 INFO driver.MahoutDriver: Program took 794 ms (Minutes: 0.013233333333333333)
DEPRECATED: Use of this script to execute hdfs command is deprecated.
Instead use the hdfs command for it.
rmr: DEPRECATED: Please use 'rm -r' instead.
12/06/29 14:56:55 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (820233764) connection to localhost/127.0.0.1:8888 from lwu,5,main]
java.lang.NullPointerException
at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852)
at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781)
rmr: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888;
DEPRECATED: Use of this script to execute hdfs command is deprecated.
Instead use the hdfs command for it.
rmr: DEPRECATED: Please use 'rm -r' instead.
12/06/29 14:56:56 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (820233764) connection to localhost/127.0.0.1:8888 from lwu,5,main]
java.lang.NullPointerException
at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852)
at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781)
rmr: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888;
DEPRECATED: Use of this script to execute hdfs command is deprecated.
Instead use the hdfs command for it.
12/06/29 14:56:56 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (165149691) connection to localhost/127.0.0.1:8888 from lwu,5,main]
java.lang.NullPointerException
at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852)
at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781)
put: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888;
{noformat}
> ERROR in Navie Bayes Training(trainnb)
> --------------------------------------
>
> Key: MAHOUT-1034
> URL: https://issues.apache.org/jira/browse/MAHOUT-1034
> Project: Mahout
> Issue Type: Bug
> Components: Classification
> Affects Versions: 0.7
> Environment: Ubuntu 11.04
> Reporter: Leting Wu
> Priority: Critical
>
> When run either examples/classify-20newsgrouops.sh or ash-email-examples.sh, trainnb always fails:
> {noformat}
> INFO mapred.JobClient: Task Id : attempt_201206281546_0003_m_000000_0, Status : FAILED
> java.lang.IllegalArgumentException
> at com.google.common.base.Preconditions.checkArgument(Preconditions.java:72)
> at org.apache.mahout.classifier.naivebayes.training.WeightsMapper.setup(WeightsMapper.java:42)
> at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
> at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:647)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:323)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:270)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1177)
> at org.apache.hadoop.mapred.Child.main(Child.java:264)
> {noformat}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira