You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "alan krumholz (JIRA)" <ji...@apache.org> on 2013/09/05 21:37:51 UTC
[jira] [Created] (MAHOUT-1327)
org.apache.mahout.clustering.classify.ClusterClassifier.readFromSeqFiles is
using a new instance of the Configuration object to read the file form the
Path instead of using the Configuration object passed to the method
alan krumholz created MAHOUT-1327:
-------------------------------------
Summary: org.apache.mahout.clustering.classify.ClusterClassifier.readFromSeqFiles is using a new instance of the Configuration object to read the file form the Path instead of using the Configuration object passed to the method
Key: MAHOUT-1327
URL: https://issues.apache.org/jira/browse/MAHOUT-1327
Project: Mahout
Issue Type: Bug
Components: Clustering
Affects Versions: 0.8, 0.7
Reporter: alan krumholz
Priority: Critical
When you use KmeansDriver.run with a Configuration object pointing to HDFS:
Configuration conf = new Configuration();
conf.addResource(new Path("C:\\hdp-win\\hadoop\\hadoop-1.1.0-SNAPSHOT\\conf\\core-site.xml"));
conf.addResource(new Path("C:\\hdp-win\\hadoop\\hadoop-1.1.0-SNAPSHOT\\conf\\hdfs-site.xml"))
It calls org.apache.mahout.clustering.classify.ClusterClassifier.readFromSeqFiles
at some point and I get an exception (there is no problem if you run it with a conf object pointing to the local file system):
java.lang.IndexOutOfBoundsException: Index: 0, Size: 0
at java.util.ArrayList.RangeCheck(ArrayList.java:547)
at java.util.ArrayList.get(ArrayList.java:322)
at org.apache.mahout.clustering.classify.ClusterClassifier.readFromSeqFiles(ClusterClassifier.java:215)
I think this is happening because that method is using a new instance of the Configuration object to read the file form the Path instead of using the Configuration object passed to the method.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira