You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Guo Ruijing (JIRA)" <ji...@apache.org> on 2014/09/02 10:22:21 UTC

[jira] [Commented] (MAHOUT-1612) NullPointerException happens during JSON output format for clusterdumper

    [ https://issues.apache.org/jira/browse/MAHOUT-1612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14118020#comment-14118020 ] 

Guo Ruijing commented on MAHOUT-1612:
-------------------------------------

Detail Log:

14/08/26 17:29:05 INFO common.AbstractJob: Command line arguments: {--dictionaryType=[text], --distanceMeasure=[org.apache.mahout.common.distance.SquaredEuclideanDistanceMeasure], --endPhase=[2147483647], --input=[output/clusters-*-final], --output=[/tmp/report], --outputFormat=[JSON], --pointsDir=[output/clusteredPoints], --startPhase=[0], --tempDir=[temp]}
Exception in thread "main" java.lang.NullPointerException
at org.apache.mahout.utils.clustering.JsonClusterWriter.getTopFeaturesList(JsonClusterWriter.java:118)
at org.apache.mahout.utils.clustering.JsonClusterWriter.write(JsonClusterWriter.java:73)
at org.apache.mahout.utils.clustering.AbstractClusterWriter.write(AbstractClusterWriter.java:115)
at org.apache.mahout.utils.clustering.AbstractClusterWriter.write(AbstractClusterWriter.java:102)
at org.apache.mahout.utils.clustering.ClusterDumper.printClusters(ClusterDumper.java:192)
at org.apache.mahout.utils.clustering.ClusterDumper.run(ClusterDumper.java:157)
at org.apache.mahout.utils.clustering.ClusterDumper.main(ClusterDumper.java:101)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:72)
at org.apache.hadoop.util.ProgramDriver.run(ProgramDriver.java:145)
at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:153)
at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.hadoop.util.RunJar.main(RunJar.java:212)

> NullPointerException happens during JSON output format for clusterdumper
> ------------------------------------------------------------------------
>
>                 Key: MAHOUT-1612
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1612
>             Project: Mahout
>          Issue Type: Bug
>          Components: Clustering
>    Affects Versions: 0.9
>            Reporter: Guo Ruijing
>
> 1. download datafile from:
> http://archive.ics.uci.edu/ml/databases/synthetic_control/synthetic_control.data
> 2. put data file on hdfs:
> hdfs dfs -mkdir testdata
> hdfs dfs -put synthetic_control.data testdata/
> 3. run a mahout clustering job:
> mahout org.apache.mahout.clustering.syntheticcontrol.kmeans.Job
> 4. run clusterdump with JSON format:
> mahout clusterdump i output/clusters*-final -p output/clusteredPoints -o /tmp/report -of JSON
> expected:
> clusterdump with JSON format should succeeded same as CSV and TEXT
> actually:
> clusterdump with JSON format throw NullPointerException



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)