You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Leting Wu (JIRA)" <ji...@apache.org> on 2012/06/30 00:06:44 UTC

[jira] [Comment Edited] (MAHOUT-1034) ERROR in Navie Bayes Training(trainnb)

    [ https://issues.apache.org/jira/browse/MAHOUT-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13404266#comment-13404266 ] 

Leting Wu edited comment on MAHOUT-1034 at 6/29/12 10:06 PM:
-------------------------------------------------------------

I tried Mahout 0.6. Does not look well either. My account is one of the ones on the workstation. I can run sudo to change the global setting. New to Mahout and really need help. Thanks.

{noformat}
$ ./examples/bin/classify-20newsgroups.sh
Please select a number to choose the corresponding task to run
1. naivebayes
2. sgd
3. clean -- cleans up the work area in /tmp/mahout-work-lwu
Enter your choice : 1
ok. You chose 1 and we'll use naivebayes
creating work directory at /tmp/mahout-work-lwu
Downloading 20news-bydate
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100 13.7M  100 13.7M    0     0   186k      0  0:01:15  0:01:15 --:--:--  441k
Extracting...
Preparing Training Data
MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
Running on hadoop, using HADOOP_HOME=/opt/hadoop
HADOOP_CONF_DIR=/opt/hadoop/conf
MAHOUT-JOB: /opt/mahout/examples/target/mahout-examples-0.6-job.jar
12/06/29 14:56:50 WARN driver.MahoutDriver: No org.apache.mahout.classifier.bayes.PrepareTwentyNewsgroups.props found on classpath, will use command-line arguments only
12/06/29 14:56:51 INFO driver.MahoutDriver: Program took 1143 ms (Minutes: 0.01905)
Preparing Test Data
MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
Running on hadoop, using HADOOP_HOME=/opt/hadoop
HADOOP_CONF_DIR=/opt/hadoop/conf
MAHOUT-JOB: /opt/mahout/examples/target/mahout-examples-0.6-job.jar
12/06/29 14:56:52 WARN driver.MahoutDriver: No org.apache.mahout.classifier.bayes.PrepareTwentyNewsgroups.props found on classpath, will use command-line arguments only
12/06/29 14:56:53 INFO driver.MahoutDriver: Program took 794 ms (Minutes: 0.013233333333333333)
DEPRECATED: Use of this script to execute hdfs command is deprecated.
Instead use the hdfs command for it.

rmr: DEPRECATED: Please use 'rm -r' instead.
12/06/29 14:56:55 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (820233764) connection to localhost/127.0.0.1:8888 from lwu,5,main]
java.lang.NullPointerException
	at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852)
	at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781)
rmr: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888; 
DEPRECATED: Use of this script to execute hdfs command is deprecated.
Instead use the hdfs command for it.

rmr: DEPRECATED: Please use 'rm -r' instead.
12/06/29 14:56:56 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (820233764) connection to localhost/127.0.0.1:8888 from lwu,5,main]
java.lang.NullPointerException
	at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852)
	at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781)
rmr: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888; 
DEPRECATED: Use of this script to execute hdfs command is deprecated.
Instead use the hdfs command for it.

12/06/29 14:56:56 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (165149691) connection to localhost/127.0.0.1:8888 from lwu,5,main]
java.lang.NullPointerException
	at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852)
	at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781)
put: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888; 
{noformat}
                
      was (Author: rhinewlt):
    I tried Mahout 0.6. Does not look well either. My account is one of the ones on the workstation. I can run sudo to change the global setting.

{noformat}
$ ./examples/bin/classify-20newsgroups.sh
Please select a number to choose the corresponding task to run
1. naivebayes
2. sgd
3. clean -- cleans up the work area in /tmp/mahout-work-lwu
Enter your choice : 1
ok. You chose 1 and we'll use naivebayes
creating work directory at /tmp/mahout-work-lwu
Downloading 20news-bydate
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100 13.7M  100 13.7M    0     0   186k      0  0:01:15  0:01:15 --:--:--  441k
Extracting...
Preparing Training Data
MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
Running on hadoop, using HADOOP_HOME=/opt/hadoop
HADOOP_CONF_DIR=/opt/hadoop/conf
MAHOUT-JOB: /opt/mahout/examples/target/mahout-examples-0.6-job.jar
12/06/29 14:56:50 WARN driver.MahoutDriver: No org.apache.mahout.classifier.bayes.PrepareTwentyNewsgroups.props found on classpath, will use command-line arguments only
12/06/29 14:56:51 INFO driver.MahoutDriver: Program took 1143 ms (Minutes: 0.01905)
Preparing Test Data
MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
Running on hadoop, using HADOOP_HOME=/opt/hadoop
HADOOP_CONF_DIR=/opt/hadoop/conf
MAHOUT-JOB: /opt/mahout/examples/target/mahout-examples-0.6-job.jar
12/06/29 14:56:52 WARN driver.MahoutDriver: No org.apache.mahout.classifier.bayes.PrepareTwentyNewsgroups.props found on classpath, will use command-line arguments only
12/06/29 14:56:53 INFO driver.MahoutDriver: Program took 794 ms (Minutes: 0.013233333333333333)
DEPRECATED: Use of this script to execute hdfs command is deprecated.
Instead use the hdfs command for it.

rmr: DEPRECATED: Please use 'rm -r' instead.
12/06/29 14:56:55 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (820233764) connection to localhost/127.0.0.1:8888 from lwu,5,main]
java.lang.NullPointerException
	at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852)
	at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781)
rmr: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888; 
DEPRECATED: Use of this script to execute hdfs command is deprecated.
Instead use the hdfs command for it.

rmr: DEPRECATED: Please use 'rm -r' instead.
12/06/29 14:56:56 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (820233764) connection to localhost/127.0.0.1:8888 from lwu,5,main]
java.lang.NullPointerException
	at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852)
	at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781)
rmr: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888; 
DEPRECATED: Use of this script to execute hdfs command is deprecated.
Instead use the hdfs command for it.

12/06/29 14:56:56 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (165149691) connection to localhost/127.0.0.1:8888 from lwu,5,main]
java.lang.NullPointerException
	at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852)
	at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781)
put: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888; 
{noformat}
                  
> ERROR in Navie Bayes Training(trainnb)
> --------------------------------------
>
>                 Key: MAHOUT-1034
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1034
>             Project: Mahout
>          Issue Type: Bug
>          Components: Classification
>    Affects Versions: 0.7
>         Environment: Ubuntu 11.04
>            Reporter: Leting Wu
>            Priority: Critical
>
> When run either examples/classify-20newsgrouops.sh or ash-email-examples.sh, trainnb always fails:
> {noformat}
> INFO mapred.JobClient: Task Id : attempt_201206281546_0003_m_000000_0, Status : FAILED
> java.lang.IllegalArgumentException
> 	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:72)
> 	at org.apache.mahout.classifier.naivebayes.training.WeightsMapper.setup(WeightsMapper.java:42)
> 	at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
> 	at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:647)
> 	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:323)
> 	at org.apache.hadoop.mapred.Child$4.run(Child.java:270)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at javax.security.auth.Subject.doAs(Subject.java:396)
> 	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1177)
> 	at org.apache.hadoop.mapred.Child.main(Child.java:264)
> {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira